342 lines
		
	
	
		
			18 KiB
		
	
	
	
		
			HTML
		
	
	
	
	
	
			
		
		
	
	
			342 lines
		
	
	
		
			18 KiB
		
	
	
	
		
			HTML
		
	
	
	
	
	
| {% extends 'base.html' %}
 | |
| 
 | |
| {% block title %}About SSH{% endblock %}
 | |
| 
 | |
| {% block content %}
 | |
|   <div class="row full-margin"><h1>Understanding the Secure Shell Protocol (SSH)</h1></div>
 | |
| {% endblock %}
 | |
| 
 | |
| {% block subcontent %}
 | |
| <div class="long-form">
 | |
| <p>
 | |
|   In order to use our service, you will have to use the Secure Shell protocol (SSH) to connect to your capsul.
 | |
| </p>
 | |
| <p>
 | |
|   SSH is a very old tool, created back when the internet was a different place, with different use cases and concerns.
 | |
|   In many ways, the protocol has failed to evolve to meet the needs of our 21st century global internet. 
 | |
|   Instead, the users of SSH (tech heads, sysadmins, etc) have had to evolve our processes to work around SSH's limitations.
 | |
| </p>
 | |
| <p>
 | |
|   These days, we use SSH + public-key cryptography to establish secure connections to our servers. 
 | |
|   If you are not familiar with the concept of public key cryptography, cryptographic signatures, 
 | |
|   or diffie-hellman key exchange, you may wish to see 
 | |
|   <a href="https://en.wikipedia.org/wiki/Public-key_cryptography">the wikipedia article</a> for a refresher.
 | |
| </p>
 | |
| 
 | |
| <div class="row half-margin"><h1>Public Key Crypto and Key Exchange: The TL;DR</h1></div>
 | |
| 
 | |
| <p>
 | |
|   Computers can generate <b>"key pairs"</b> which consist of a public key and a private key. Given a <b>public key pair A</b>:
 | |
|   <ol>
 | |
|     <li>
 | |
|       A computer which has access to <b>public key A</b> can encrypt data, 
 | |
|       and then <b>ONLY</b> a computer which has access <b>private key A</b> can decrypt & read it
 | |
|     </li>
 | |
|     <li>
 | |
|       Likewise, a computer which has access to <b>private key A</b> can encrypt data, 
 | |
|       and any a computer which has access <b>public key A</b> can decrypt it, 
 | |
|       thus <b>PROVING</b> the message must have come from someone who posesses <b>private key A</b>
 | |
|     </li>
 | |
|   </ol>
 | |
|   Key exchange is a process in which two computers, Computer A and Computer B (often referred to as Alice and Bob)
 | |
|   both create key pairs, so you have <b>key pair A</b> and <b>key pair B</b>, for a total of 4 keys:
 | |
|   <ol>
 | |
|     <li><b>public key A</b></li>
 | |
|     <li><b>private key A</b></li>
 | |
|     <li><b>public key B</b></li>
 | |
|     <li><b>private key B</b></li>
 | |
|   </ol>
 | |
|   In simplified terms, during a key exchange, 
 | |
|   <ol>
 | |
|     <li><b>computer A</b> sends <b>computer B</b> its public key</li>
 | |
|     <li><b>computer B</b> sends <b>computer A</b> its public key</li>
 | |
|     <li><b>computer A</b> sends <b>computer B</b> 
 | |
|       a message which is encrypted with <b>computer B</b>'s public key</li>
 | |
|     <li><b>computer B</b> sends <b>computer A</b> 
 | |
|       a message which is encrypted with <b>computer A</b>'s public key</li>
 | |
|   </ol>
 | |
|   The way this process is carried out allows A and B to communicate with each-other securely, which is great, <br/><br/>
 | |
| 
 | |
|   <b><u>HOWEVER, there is a catch!!</u></b>
 | |
| </p>
 | |
| 
 | |
| <p>
 | |
|   When computers A and B are trying to establish a secure connection for the first time, 
 | |
|   we assume that the way they communicate right now is NOT secure. That means that someone on the network 
 | |
|   between A and B can read & modify
 | |
|   all messages they send to each-other! You might be able to see where this is heading... 
 | |
| </p>
 | |
| <p>
 | |
|   When <b>computer A</b> sends its public key to <b>computer B</b>, 
 | |
|   someone in the middle (lets call it <b>computer E, or Eve</b>) could record that message, save it, 
 | |
|   and then replace it with a forged message to <b>computer B</b> containing <b>public key E</b>
 | |
|   (from a key pair that <b>computer E</b> generated).
 | |
| 
 | |
|   If this happens, when <b>computer B</b> sends an encrypted message to <b>computer A</b>, 
 | |
|   B thinks that A's public key is actually <b>public key E</b>, so it will use <b>public key E</b> to encrypt.
 | |
|   And again, <b>computer E</b> in the middle can intercept the message, and they can decrypt it as well 
 | |
|   because they have <b>private key E</b>.
 | |
|   Finally, they can relay the same message to <b>computer A</b>, this time encrypted with <b>computer A</b>'s public key. 
 | |
|   This is called a <a href="https://en.wikipedia.org/wiki/Man-in-the-middle_attack">Man In The Middle (MITM)</a> attack.
 | |
| </p>
 | |
| <p>
 | |
|   Without some additional verification method, 
 | |
|   <b><u>Computer A AND Computer B can both be duped and the connection is NOT really secure</u></b>.
 | |
| </p>
 | |
| 
 | |
| <div class="row half-margin"><h1>Authenticating Public Keys: A Tale of Two Protocols</h1></div>
 | |
| 
 | |
| <p>
 | |
|   Now that we have seen how key exhange works, 
 | |
|   and we understand that in order to prevent MITM attacks, all participants have to have a way of knowing 
 | |
|   whether a given public key is authentic or not, I can explain what I meant when I said
 | |
| </p>
 | |
| <p>
 | |
|   > [SSH] has failed to evolve to meet the needs of our 21st century global internet
 | |
| </p>
 | |
| <p>
 | |
|   In order to explain this, let's first look at how a different, more modern protocol, 
 | |
|   Transport Layer Security (or TLS) solved this problem. 
 | |
|   TLS, (still sometimes called by its olde name "Secure Sockets Layer", or SSL) was created to enable HTTPS, and allow 
 | |
|   internet users to log into web sites securely and purchase things online by entering their credit card number.
 | |
|   Of course, this required security that actually works; if someone could MITM attack the connection, they could easily
 | |
|   steal tons of credit card numbers and passwords.
 | |
| </p>
 | |
| <p>
 | |
|   In order to enable this, a new sub-protocol called <a href="https://en.wikipedia.org/wiki/X.509">X.509</a> was created. 
 | |
|   X.509 is a standard related to the data format of certificates and keys (public keys and private keys), but it also defines 
 | |
|   a simple and easy way to determine whether a given certificate (public key) is authentic. 
 | |
|   X.509 introduced the concept of a Certificate Authority, or CA. 
 | |
|   These CAs were supposed to be bank-like public institutions of power which everyone could trust. 
 | |
|   The CA would create a key pair on an extremely secure computer, and then a CA Certificate (the public side of that key pair)
 | |
|   would be distributed along with every copy of Windows, Mac OS, and Linux. Then companies who wanted to run a secure web server 
 | |
|   could generate thier OWN key pair for thier web server, 
 | |
|   and pay the CA to sign thier web server's X.509 certificate (public key) with the highly protected CA private key. 
 | |
|   Critically, issue date, expiration date, and the domain name of the web server, like foo.example.com, would have to be included 
 | |
|   in the x.509 certiciate along with the public key. 
 | |
|   This way, when the user types https://foo.example.com into thier web browser:
 | |
| 
 | |
|   <ol>
 | |
|     <li>The web browser sends a TLS ClientHello request to the server</li>
 | |
|     <li>
 | |
|       The server responds with a ServerHello & ServerCertificate message
 | |
|       <ul>
 | |
|         <li>The ServerCertificate message contains the X.509 certificate for the web server at foo.example.com</li>
 | |
|       </ul>
 | |
|     </li>
 | |
|     <li>The web browser inspects the X.509 certificate
 | |
|       <ul>
 | |
|         <li>
 | |
|           Is the current date in between the issued date and expiry date of the certificate?
 | |
|           If not, display an <a href="https://expired.badssl.com/">EXPIRED_CERTIFICATE error</a>.
 | |
|         </li>
 | |
|         <li>
 | |
|           Does the domain name the user typed in, foo.example.com, match the domain name in the certificate? 
 | |
|           If not, display a <a href="https://wrong.host.badssl.com/">BAD_CERT_DOMAIN error</a>.
 | |
|         </li>
 | |
|         <li>
 | |
|           Does the certificate contain a valid CA signature? 
 | |
|           (can the signature on the certificate be decrypted by one of the CA Certificates included with the operating system?) 
 | |
|           If not, display a <a href="https://untrusted-root.badssl.com/">UNKNOWN_ISSUER error</a>.
 | |
|         </li>
 | |
|       </ul>
 | |
|     </li>
 | |
|     <li>Assuming all the checks pass, the web browser trusts the certificate and connects</li>
 | |
|   </ol>
 | |
| </p>
 | |
| <p>
 | |
|   This system enabled the internet to grow and flourish:
 | |
|   purchasing from a CA was the only way to get a valid X.509 certificate for a website, 
 | |
|   and guaranteeing authenticity was in the CA's business interest. 
 | |
|   The CAs kept their private keys behind razor wire and armed guards, and followed strict rules to ensure that only the right
 | |
|   people got thier certificates signed. 
 | |
|   Only the CA's themselves or anyone who had enough power to force them to create a fraudulent certificate 
 | |
|   would be able to execute MITM attacks.
 | |
| </p>
 | |
| <p>
 | |
|   The TLS+X.509 Certificate Authority works well for HTTP and other application protocols, because 
 | |
|   <ul>
 | |
|     <li>Most internet users don't have the patience to manually verify the authenticity of digital certificates.</li>
 | |
|     <li>Most internet users don't understand or care how it works; they just want to connect right now.</li>
 | |
|     <li>Businesses and organizations that run websites are generally willing to jump through hoops and 
 | |
|       subjugate themselves to authorities in order to offer a more secure application experience to thier users.</li>
 | |
|     <li>The centralization & problematic power dynamic which CAs represent 
 | |
|       is easily swept under the rug, if it doesn't directly or noticably impact the average person, who cares?</li>
 | |
|   </ul>
 | |
| </p>
 | |
| 
 | |
| <p>
 | |
|   However, this would never fly with SSH. You have to understand, SSH does not come from Microsoft, it does not come from Apple, 
 | |
|   in fact, it does not even come from Linux or GNU. <a href="https://www.openssh.com/">SSH comes from BSD</a>. 
 | |
|   <a href="https://en.wikipedia.org/wiki/BSD">Berkeley Software Distribution</a>. Most people don't even know 
 | |
|   what BSD is. It's <i>Deep Nerdcore</i> material. The people who maintain SSH are not playing around, they would never 
 | |
|   allow themselves to be subjugated by so-called "Certificate Authorities".
 | |
|   So, what are they doing instead? Where is SSH at? Well, back when it was created, computer security was easy — 
 | |
|   a very minimal defense was enough to deter attackers. 
 | |
|   In order to help prevent these MITM attacks, instead of something like X.509, SSH uses a policy called 
 | |
|   <a href="https://en.wikipedia.org/wiki/Trust_on_first_use">Trust On First Use (TOFU)</a>. 
 | |
| </p>
 | |
| 
 | |
| <p> 
 | |
|   The SSH client application keeps a record of every server it has ever connected to 
 | |
|   in a file <span class="code">~/.ssh/known_hosts</span>.
 | |
| </p>
 | |
| 
 | |
| <p> 
 | |
|   (the tilde <span class="code">~</span> here represents the user's home directory, 
 | |
|   <span class="code">/home/username</span> on linux, 
 | |
|   <span class="code">C:\Users\username</span> on Windows, and 
 | |
|   <span class="code">/Users/username</span> on MacOS).
 | |
| </p>
 | |
| 
 | |
| <p> 
 | |
|   If the user asks the SSH client to connect to a server it has never seen before, 
 | |
|   it will print a prompt like this to the terminal:
 | |
| </p>
 | |
| 
 | |
|   <pre class="code">The authenticity of host 'fooserver.com (69.4.20.69)' can't be established.
 | |
|     ECDSA key fingerprint is SHA256:EXAMPLE1xY4JUVhYirOVlfuDFtgTbaiw3x29xYizEeU.
 | |
|     Are you sure you want to continue connecting (yes/no/[fingerprint])?</pre>
 | |
| 
 | |
| <p>
 | |
|   Here, the SSH client is displaying the fingerprint (<a href="https://en.wikipedia.org/wiki/SHA-2">SHA256 hash</a>) 
 | |
|   of the public key provided by the server at <span class="code">fooserver.com</span>. 
 | |
|   Back in the day, when SSH was created, servers lived for months to years, not minutes, and they were installed by hand. 
 | |
|   So it would have been perfectly reasonable to call the person installing the server 
 | |
|   and ask them to log into it & read off the host key fingerprint over the phone.  
 | |
|   After verifing that the fingerprints match in the phone call, the user would type <span class="code">yes</span> 
 | |
|   to continue.
 | |
| </p>
 | |
| 
 | |
| <p>
 | |
|   After the SSH client connects to a server for the first time, it will record the server's IP address and public key in the 
 | |
|   <span class="code">~/.ssh/known_hosts</span> file. All subsequent connections will simply check the public key 
 | |
|   the server presents against the public key it has recorded in the <span class="code">~/.ssh/known_hosts</span> file. 
 | |
|   If the two public keys match, the connection will continue without prompting the user, however, if they don't match, 
 | |
|   the SSH client will display a scary warning message:
 | |
| </p>
 | |
| <pre class="code">
 | |
| @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
 | |
| @       WARNING: POSSIBLE DNS SPOOFING DETECTED!          @
 | |
| @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
 | |
| The ECDSA host key for fooserver.com has changed,
 | |
| and the key for the corresponding IP address 69.4.20.42
 | |
| is unknown. This could either mean that
 | |
| DNS SPOOFING is happening or the IP address for the host
 | |
| and its host key have changed at the same time.
 | |
| @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
 | |
| @    WARNING: REMOTE HOST IDENTIFICATION HAS CHANGED!     @
 | |
| @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
 | |
| IT IS POSSIBLE THAT SOMEONE IS DOING SOMETHING NASTY!
 | |
| Someone could be eavesdropping on you right now (man-in-the-middle attack)!
 | |
| It is also possible that a host key has just been changed.
 | |
| The fingerprint for the ECDSA key sent by the remote host is
 | |
| SHA256:EXAMPLEpDDefcNcIROtFpuTiHC1j3iNU74aaKFO03+0.
 | |
| Please contact your system administrator.
 | |
| Add correct host key in /root/.ssh/known_hosts to get rid of this message.
 | |
| Offending ECDSA key in /root/.ssh/known_hosts:1
 | |
|   remove with:
 | |
|   ssh-keygen -f "/root/.ssh/known_hosts" -R "fooserver.com"
 | |
| ECDSA host key for fooserver.com has changed and you have requested strict checking.
 | |
| Host key verification failed.
 | |
| </pre>
 | |
| 
 | |
| <p>
 | |
|   This is why it's called <b>Trust On First Use</b>: 
 | |
|   
 | |
|   SSH protocol assumes that when you type <span class="code">yes</span> in response to the prompt during your first connection, 
 | |
|   you <b>really did</b> verify that the server's public key fingerprint matches.
 | |
|   
 | |
|   If you type <span class="code">yes</span> here without checking the server's host key somehow, you could add an attackers public key to the trusted 
 | |
|   list in your <span class="code">~/.ssh/known_hosts</span> file; if you type <span class="code">yes</span> blindly, you are 
 | |
|   <b>completely disabling all security of the SSH connection</b>. 
 | |
|   It can be fully man-in-the-middle attacked & you are vulnerable to spying, command injection, result-falsification, 
 | |
|   the whole nine yards. 
 | |
| </p>
 | |
| 
 | |
| <p>
 | |
|   So what are technologists to do? Most cloud providers don't "provide" a secure and reliable way to get the SSH host public keys
 | |
|   for instances that users create on thier platform. For example, see this 
 | |
|   <a href="https://serverfault.com/questions/941915/verify-authenticity-of-ssh-host-on-digital-ocean-droplet-freebsd">
 | |
|     question posted by a frustrated user trying to secure thier connection to a digitalocean droplet
 | |
|   </a>.
 | |
| 
 | |
|   Besides using the provider's HTTPS-based console to log into the machine & directly read the public key, most of the time,
 | |
|   providers recommend using a "userdata script", which runs when the machine boots, to upload the machine's SSH public keys to a 
 | |
|   trusted location, like <a href="https://www.backblaze.com/b2/cloud-storage.html">Backblaze B2</a> or 
 | |
|   <del>Amazon S3</del><sup><a href="#ref_1">[1]</a></sup> storage for later retrieval by users. 
 | |
|   As an example, I wrote a  
 | |
|   <a href="https://git.sequentialread.com/forest/rootsystem/src/1cdbe53974d20da97d9f522d4bd62c34487817c0/terraform-modules/gateway-instance-digitalocean/upload_known_hosts.tpl#L5">
 | |
|     userdata script which does this</a>
 | |
|   for my own cloud compute management tool called 
 | |
|   <a href="https://git.sequentialread.com/forest/rootsystem">rootsystem</a>.
 | |
|   Later in the process, rootsystem will 
 | |
|   <a href="https://git.sequentialread.com/forest/rootsystem/src/1cdbe53974d20da97d9f522d4bd62c34487817c0/host-key-poller/main.go#L33">
 | |
|     download the public keys from the Object Storage provider 
 | |
|     and add them to the ~/.ssh/known_hosts file</a>
 | |
|   before finally 
 | |
|   <a href="https://git.sequentialread.com/forest/rootsystem/src/1cdbe53974d20da97d9f522d4bd62c34487817c0/terraform-modules/ansible-threshold-server/main.tf#L32">
 | |
|     invoking the ssh client against the cloud host</a>.
 | |
| </p>
 | |
| 
 | |
| <p>
 | |
|   Personally, I think it's disgusting and irresponsible to require users to go through that much work
 | |
|   just to be able to connect to their instance securely. However, this practice appears to be an industry standard. 
 | |
|   It's gross, but it's where we're at right now.
 | |
| </p>
 | |
| 
 | |
| <p>
 | |
|   So for <a href="https://capsul.org">capsul</a>, we obviously wanted to do better. 
 | |
|   We wanted to make this kind of thing as easy as possible for the user, 
 | |
|   so I'm proud to announce as of today, capsul SSH host key fingerprints will be displayed on the capsul detail page, 
 | |
|   as well as the host's SSH public keys themselves in <span class="code">~/.ssh/known_hosts</span> format.
 | |
|   Users can simply copy and paste these keys into thier <span class="code">~/.ssh/known_hosts</span> file and connect 
 | |
|   with confidence that they are not being MITM attacked. 
 | |
| </p>
 | |
| 
 | |
| <div class="row half-margin"><h1>It's 2021. Can't we do better than this? What's next?</h1></div>
 | |
| 
 | |
| <p>
 | |
|   Glad you asked 😜.
 | |
| </p>
 | |
| <p>
 | |
|   TLS is great, except it has one problem: the X.509 CA system centralizes power and structurally invites abuse.
 | |
|   Power corrupts, and absolute power corrupts absolutely. But there is hope for the future: with the invention of Bitcoin
 | |
|   in 2009, we now have a new tool to use for authority-free secure consensus. Some bright folks have forked Bitcoin to produce
 | |
|   <a href="https://www.namecoin.org/">Namecoin</a>, a DNS-like public blockchain which is 
 | |
|   <a href="https://en.bitcoin.it/wiki/Merged_mining_specification">merge-mined</a> with Bitcoin, and which allows users to 
 | |
|   <a href="https://sequentialread.com/how-to-register-a-namecoin-bit-domain-with-electrum-nmc/">
 | |
|     register and trade names, including domain names
 | |
|   </a>. In fact, Namecoin features a 
 | |
|   <a href="https://github.com/namecoin/proposals/blob/master/ifa-0003.md">
 | |
|     specification for associating public keys with domain names
 | |
|   </a>
 | |
|   and easy-to-use client software packages capable of resolving these 
 | |
|   <a href="https://www.namecoin.org/download/betas/#ncdns">names</a> 
 | |
|   &
 | |
|   <a href="https://www.namecoin.org/download/betas/#ncp11">
 | |
|     public</a>
 | |
| 
 | |
|   <a href="https://www.namecoin.org/resources/presentations/Grayhat_2020/Namecoin_TLS_Part_2_Grayhat_2020_Monero_Village.pdf">
 | |
|     keys</a>, 
 | |
|   capable of replacing both the DNS system and X.509 Certificate Authority system. 
 | |
| </p>
 | |
| 
 | |
| <p>
 | |
|   For more information on how to get started with Namecoin, see my 
 | |
|   <a href="https://sequentialread.com/how-to-register-a-namecoin-bit-domain-with-electrum-nmc/">
 | |
|     Namecoin guide for webmasters</a>.
 | |
| 
 | |
| </p>
 | |
| 
 | |
| <hr/>
 | |
| <p>
 | |
|   <sup id="ref_1">[1]</sup> <a href="https://www.doitwithoutdues.com/">fuck amazon</a>
 | |
| </p>
 | |
| 
 | |
| </div>
 | |
| {% endblock %}
 | |
| 
 | |
| {% block pagesource %}/templates/about-ssh.html{% endblock %}
 | |
| 
 |