add /about-ssh essay

2021-01-30 22:14:04 -06:00
parent c849ec3c11
commit 7217ff7a6d
4 changed files with 362 additions and 4 deletions
--- a/capsulflask/landing.py
+++ b/capsulflask/landing.py
@ -22,6 +22,10 @@ def pricing():
 def faq():
  return render_template("faq.html")

+@bp.route("/about-ssh")
+def about_ssh():
+  return render_template("about-ssh.html")
+
@bp.route("/changelog")
 def changelog():
  return render_template("changelog.html")
--- a/capsulflask/static/style.css
+++ b/capsulflask/static/style.css
@ -211,10 +211,22 @@ input[type=image].submit {
 }


-
-
-ul li {
+ul li, ol li {
  margin: 0.5em 0;
+  margin-left: 1.2rem;
+}
+
+.long-form p, .long-form li {
+  line-height: 2em;
+}
+
+.long-form p .code, .long-form li .code{
+  line-height: 1em;
+  padding: 5px;
+  padding-top: 3px;
+  margin-top: 2px;
+  padding-bottom: 4px;
+  border-radius: 4px;
 }

 hr {
--- a/capsulflask/templates/about-ssh.html
+++ b/capsulflask/templates/about-ssh.html
@ -0,0 +1,340 @@
+{% extends 'base.html' %}
+
+{% block title %}About SSH{% endblock %}
+
+{% block content %}
+  <div class="row full-margin"><h1>Understanding the Secure Shell Protocol (SSH)</h1></div>
+{% endblock %}
+
+{% block subcontent %}
+<div class="long-form">
+<p>
+  In order to use our service, you will have to use the Secure Shell protocol (SSH) to connect to your capsul.
+</p>
+<p>
+  SSH is a very old tool, created back when the internet was a different place, with different use cases and concerns.
+  In many ways, the protocol has failed to evolve to meet the needs of our 21st century global internet. 
+  Instead, the users of SSH (tech heads, sysadmins, etc) have had to evolve our processes to work around SSH's limitations.
+</p>
+<p>
+  These days, we use SSH + public-key cryptography to establish secure connections to our servers. 
+  If you are not familiar with the concept of public key cryptography, cryptographic signatures, 
+  or diffie-hellman key exchange, you may wish to see 
+  <a href="https://en.wikipedia.org/wiki/Public-key_cryptography">the wikipedia article</a> for a refresher.
+</p>
+
+<div class="row half-margin"><h1>Public Key Crypto and Key Exchange: The TL;DR</h1></div>
+
+<p>
+  Computers can generate <b>"key pairs"</b> which consist of a public key and a private key. Given a <b>public key pair A</b>:
+  <ol>
+    <li>
+      A computer which has access to <b>public key A</b> can encrypt data, 
+      and then <b>ONLY</b> a computer which has access <b>private key A</b> can decrypt & read it
+    </li>
+    <li>
+      Likewise, a computer which has access to <b>private key A</b> can encrypt data, 
+      and any a computer which has access <b>public key A</b> can decrypt it, 
+      thus <b>PROVING</b> the message must have come from someone who posesses <b>private key A</b>
+    </li>
+  </ol>
+  Key exchange is a process in which two computers, Computer A and Computer B (often referred to as Alice and Bob)
+  both create key pairs, so you have <b>key pair A</b> and <b>key pair B</b>, for a total of 4 keys:
+  <ol>
+    <li><b>public key A</b></li>
+    <li><b>private key A</b></li>
+    <li><b>public key B</b></li>
+    <li><b>private key B</b></li>
+  </ol>
+  In simplified terms, during a key exchange, 
+  <ol>
+    <li><b>computer A</b> sends <b>computer B</b> its public key</li>
+    <li><b>computer B</b> sends <b>computer A</b> its public key</li>
+    <li><b>computer A</b> sends <b>computer B</b> 
+      a message which is encrypted with <b>computer B</b>'s public key</li>
+    <li><b>computer B</b> sends <b>computer A</b> 
+      a message which is encrypted with <b>computer A</b>'s public key</li>
+  </ol>
+  The way this process is carried out allows A and B to communicate with each-other securely, which is great, <br/><br/>
+
+  <b><u>HOWEVER, there is a catch!!</u></b>
+</p>
+
+<p>
+  When computers A and B are trying to establish a secure connection for the first time, 
+  we assume that the way they communicate right now is NOT secure. That means that someone between A and B can read & modify
+  all messages they send to each-other! You might be able to see where this is heading... 
+</p>
+<p>
+  When <b>computer A</b> sends its public key to <b>computer B</b>, 
+  someone in the middle (lets call it <b>computer E, or Eve</b>) could record that message, save it, 
+  and then replace it with a forged message to <b>computer B</b> containing <b>public key E</b>
+  (from a key pair that <b>computer E</b> generated).
+
+  If this happens, when <b>computer B</b> sends an encrypted message to <b>computer A</b>, 
+  B thinks that A's public key is actually <b>public key E</b>, so it will use <b>public key E</b> to encrypt.
+  And again, <b>computer E</b> in the middle can intercept the message, and they can decrypt it as well 
+  because they have <b>private key E</b>.
+  Finally, they can relay the same message to <b>computer A</b>, this time encrypted with <b>computer A</b>'s public key. 
+  This is called a <b>Man In The Middle (MITM)</b> attack.
+</p>
+<p>
+  Without some additional verification method, 
+  <b><u>Computer A AND Computer B can both be duped and the connection is NOT really secure</u></b>.
+</p>
+
+<div class="row half-margin"><h1>Authenticating Public Keys: A Tale of Two Protocols</h1></div>
+
+<p>
+  Now that we have seen how key exhange works, 
+  and we understand that in order to prevent MITM attacks, all participants have to have a way of knowing 
+  whether a given public key is authentic or not, I can explain what I meant when I said
+</p>
+<p>
+  > [SSH] has failed to evolve to meet the needs of our 21st century global internet
+</p>
+<p>
+  In order to explain this, let's first look at how a different, more modern protocol, 
+  Transport Layer Security (or TLS) solved this problem. 
+  TLS, (still sometimes called by its olde name "Secure Sockets Layer", or SSL) was created to enable HTTPS, and allow 
+  internet users to log into web sites securely and purchase things online by entering their credit card number.
+  Of course, this required security that actually works; if someone could MITM attack the connection, they could easily
+  steal tons of credit card numbers and passwords.
+</p>
+<p>
+  In order to enable this, a new sub-protocol called <a href="https://en.wikipedia.org/wiki/X.509">X.509</a> was created. 
+  X.509 is a standard related to the data format of certificates and keys (public keys and private keys), but it also defines 
+  a simple and easy way to determine whether a given certificate (public key) is authentic. 
+  X.509 introduced the concept of a Certificate Authority, or CA. 
+  These CAs were supposed to be bank-like public institutions of power which everyone could trust. 
+  The CA would create a key pair on an extremely secure computer, and then a CA Certificate (the public side of that key pair)
+  would be distributed along with every copy of Windows, Mac OS, and Linux. Then companies who wanted to run a secure web server 
+  could generate thier OWN key pair for thier web server, 
+  and pay the CA to sign thier web server's X.509 certificate (public key) with the highly protected CA private key. 
+  Critically, issue date, expiration date, and the domain name of the web server, like foo.example.com, would have to be included 
+  in the x.509 certiciate along with the public key. 
+  This way, when the user types https://foo.example.com into thier web browser:
+
+  <ol>
+    <li>The web browser sends a TLS ClientHello request to the server</li>
+    <li>
+      The server responds with a ServerHello & ServerCertificate message
+      <ul>
+        <li>The ServerCertificate message contains the X.509 certificate for the web server at foo.example.com</li>
+      </ul>
+    </li>
+    <li>The web browser inspects the X.509 certificate
+      <ul>
+        <li>
+          Is the current date in between the issued date and expiry date of the certificate?
+          If not, display an <a href="https://expired.badssl.com/">EXPIRED_CERTIFICATE error</a>.
+        </li>
+        <li>
+          Does the domain name the user typed in, foo.example.com, match the domain name in the certificate? 
+          If not, display a <a href="https://wrong.host.badssl.com/">BAD_CERT_DOMAIN error</a>.
+        </li>
+        <li>
+          Does the certificate contain a valid CA signature? 
+          (can the signature on the certificate be decrypted by one of the CA Certificates included with the operating system?) 
+          If not, display a <a href="https://untrusted-root.badssl.com/">UNKNOWN_ISSUER error</a>.
+        </li>
+      </ul>
+    </li>
+    <li>Assuming all the checks pass, the web browser trusts the certificate and connects</li>
+  </ol>
+</p>
+<p>
+  This system enabled the internet to grow and flourish:
+  purchasing from a CA was the only way to get a valid X.509 certificate for a website, 
+  and guaranteeing authenticity was in the CA's business interest. 
+  The CAs kept their private keys behind razor wire and armed guards, and followed strict rules to ensure that only the right
+  people got thier certificates signed. 
+  Only the CA's themselves or anyone who had enough power to force them to create a fraudulent certificate 
+  would be able to execute MITM attacks.
+</p>
+<p>
+  The TLS+X.509 Certificate Authority works well for HTTP and other application protocols, because 
+  <ul>
+    <li>Most internet users don't have the patience to manually verify the authenticity of digital certificates.</li>
+    <li>Most internet users don't understand or care how it works; they just want to connect right now.</li>
+    <li>Businesses and organizations that run websites are generally willing to jump through hoops and 
+      subjugate themselves to authorities in order to offer a more secure application experience to thier users.</li>
+    <li>The centralization & problematic power dynamic which CAs represent 
+      is easily swept under the rug, if it doesn't directly or noticably impact the average person, who cares?</li>
+  </ul>
+</p>
+
+<p>
+  However, this would never fly with SSH. You have to understand, SSH does not come from Microsoft, it does not come from Apple, 
+  in fact, it does not even come from Linux or GNU. <a href="https://www.openssh.com/">SSH comes from BSD</a>. 
+  <a href="https://en.wikipedia.org/wiki/BSD">Berkeley Software Distribution</a>. Most people don't even know 
+  what BSD is. It's <i>Deep Nerdcore</i> material. The people who maintain SSH are not playing around, they would never 
+  allow themselves to be subjugated by so-called "Certificate Authorities".
+  So, what are they doing instead? Where is SSH at? Well, back when it was created, computer security was easy — 
+  a very minimal defense was enough to deter attackers. 
+  In order to help prevent these MITM attacks, instead of something like X.509, SSH uses a policy called 
+  <a href="https://en.wikipedia.org/wiki/Trust_on_first_use">Trust On First Use (TOFU)</a>. 
+</p>
+
+<p> 
+  The SSH client application keeps a record of every server it has ever connected to 
+  in a file <span class="code">~/.ssh/known_hosts</span>.
+</p>
+
+<p> 
+  (the tilde <span class="code">~</span> here represents the user's home directory, 
+  <span class="code">/home/username</span> on linux, 
+  <span class="code">C:\Users\username</span> on Windows, and 
+  <span class="code">/Users/username</span> on MacOS).
+</p>
+
+<p> 
+  If the user asks the SSH client to connect to a server it has never seen before, 
+  it will print a prompt like this to the terminal:
+</p>
+
+  <pre class="code">The authenticity of host 'fooserver.com (69.4.20.69)' can't be established.
+    ECDSA key fingerprint is SHA256:EXAMPLE1xY4JUVhYirOVlfuDFtgTbaiw3x29xYizEeU.
+    Are you sure you want to continue connecting (yes/no/[fingerprint])?</pre>
+
+<p>
+  Here, the SSH client is displaying the fingerprint (<a href="https://en.wikipedia.org/wiki/SHA-2">SHA256 hash</a>) 
+  of the public key provided by the server at <span class="code">fooserver.com</span>. 
+  Back in the day, when SSH was created, servers lived for months to years, not minutes, and they were installed by hand. 
+  So it would have been perfectly reasonable to call the person installing the server 
+  and ask them to log into it & read off the host key fingerprint over the phone.  
+  After verifing that the fingerprints match in the phone call, the user would type <span class="code">yes</span> 
+  to continue.
+</p>
+
+<p>
+  After the SSH client connects to a server for the first time, it will record the server's IP address and public key in the 
+  <span class="code">~/.ssh/known_hosts</span> file. All subsequent connections will simply check the public key 
+  the server presents against the public key it has recorded in the <span class="code">~/.ssh/known_hosts</span> file. 
+  If the two public keys match, the connection will continue without prompting the user, however, if they don't match, 
+  the SSH client will display a scary warning message:
+</p>
+<pre class="code">
+@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
+@       WARNING: POSSIBLE DNS SPOOFING DETECTED!          @
+@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
+The ECDSA host key for fooserver.com has changed,
+and the key for the corresponding IP address 69.4.20.42
+is unknown. This could either mean that
+DNS SPOOFING is happening or the IP address for the host
+and its host key have changed at the same time.
+@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
+@    WARNING: REMOTE HOST IDENTIFICATION HAS CHANGED!     @
+@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
+IT IS POSSIBLE THAT SOMEONE IS DOING SOMETHING NASTY!
+Someone could be eavesdropping on you right now (man-in-the-middle attack)!
+It is also possible that a host key has just been changed.
+The fingerprint for the ECDSA key sent by the remote host is
+SHA256:EXAMPLEpDDefcNcIROtFpuTiHC1j3iNU74aaKFO03+0.
+Please contact your system administrator.
+Add correct host key in /root/.ssh/known_hosts to get rid of this message.
+Offending ECDSA key in /root/.ssh/known_hosts:1
+  remove with:
+  ssh-keygen -f "/root/.ssh/known_hosts" -R "fooserver.com"
+ECDSA host key for fooserver.com has changed and you have requested strict checking.
+Host key verification failed.
+</pre>
+
+<p>
+  This is why it's called <b>Trust On First Use</b>: 
+  
+  SSH protocol assumes that when you type <span class="code">yes</span> in response to the prompt during your first connection, 
+  you <b>really did</b> verify that the server's public key fingerprint matches.
+  
+  If you type <span class="code">yes</span> here without checking the server's host key somehow, you could add an attackers public key to the trusted 
+  list in your <span class="code">~/.ssh/known_hosts</span> file; if you type <span class="code">yes</span> blindly, you are 
+  <b>completely disabling all security of the SSH connection</b>. 
+  It can be fully man-in-the-middle attacked & you are vulnerable to spying, command injection, result-falsification, 
+  the whole nine yards. 
+</p>
+
+<p>
+  So what are technologists to do? Most cloud providers don't "provide" a secure and reliable way to get the SSH host public keys
+  for instances that users create on thier platform. For example, see this 
+  <a href="https://serverfault.com/questions/941915/verify-authenticity-of-ssh-host-on-digital-ocean-droplet-freebsd">
+    question posted by a frustrated user trying to secure thier connection to a digitalocean droplet
+  </a>.
+
+  Besides using the provider's HTTPS-based console to log into the machine & directly read the public key, most of the time,
+  providers recommend using a "userdata script", which runs when the machine boots, to upload the machine's SSH public keys to a 
+  trusted location, like <a href="https://www.backblaze.com/b2/cloud-storage.html">Backblaze B2</a> or 
+  <del>Amazon S3</del><sup><a href="#ref_1">[1]</a></sup> storage for later retrieval by users. 
+  As an example, I wrote a  
+  <a href="https://git.sequentialread.com/forest/rootsystem/src/1cdbe53974d20da97d9f522d4bd62c34487817c0/terraform-modules/gateway-instance-digitalocean/upload_known_hosts.tpl#L5">
+    userdata script which does this</a>
+  for my own cloud compute management tool called 
+  <a href="https://git.sequentialread.com/forest/rootsystem">rootsystem</a>.
+  Later in the process, rootsystem will 
+  <a href="https://git.sequentialread.com/forest/rootsystem/src/1cdbe53974d20da97d9f522d4bd62c34487817c0/host-key-poller/main.go#L33">
+    download the public keys from the Object Storage provider 
+    and add them to the ~/.ssh/known_hosts file</a>
+  before finally 
+  <a href="https://git.sequentialread.com/forest/rootsystem/src/1cdbe53974d20da97d9f522d4bd62c34487817c0/terraform-modules/ansible-threshold-server/main.tf#L32">
+    invoking the ssh client against the cloud host</a>.
+</p>
+
+<p>
+  Personally, I think it's disgusting and irresponsible to require users to go through that much work
+  just to be able to connect to their instance securely. However, this practice appears to be an industry standard. 
+  It's gross, but it's where we're at right now.
+</p>
+
+<p>
+  So for <a href="https://capsul.org">capsul</a>, we obviously wanted to do better. 
+  We wanted to make this kind of thing as easy as possible for the user, 
+  so I'm proud to announce as of today, capsul SSH host key fingerprints will be displayed on the capsul detail page, 
+  as well as the host's SSH public keys themselves in <span class="code">~/.ssh/known_hosts</span> format.
+  Users can simply copy and paste these keys into thier <span class="code">~/.ssh/known_hosts</span> file and connect 
+  with confidence that they are not being MITM attacked. 
+</p>
+
+<div class="row half-margin"><h1>It's 2021. Can't we do better than this? What's next?</h1></div>
+
+<p>
+  Glad you asked 😜.
+</p>
+<p>
+  TLS is great, except it has one problem: the X.509 CA system centralizes power and structurally invites abuse.
+  Power corrupts, and absolute power corrupts absolutely. But there is hope for the future: with the invention of Bitcoin
+  in 2009, we now have a new tool to use for authority-free secure consensus. Some bright folks have forked Bitcoin to produce
+  <a href="https://www.namecoin.org/">Namecoin</a>, a DNS-like public blockchain which is 
+  <a href="https://en.bitcoin.it/wiki/Merged_mining_specification">merge-mined</a> with Bitcoin, and which allows users to 
+  <a href="https://sequentialread.com/how-to-register-a-namecoin-bit-domain-with-electrum-nmc/">
+    register and trade names, including domain names
+  </a>. In fact, Namecoin features a 
+  <a href="https://github.com/namecoin/proposals/blob/master/ifa-0003.md">
+    specification for associating public keys with domain names
+  </a>
+  and easy-to-use client software packages capable of resolving these 
+  <a href="https://www.namecoin.org/download/betas/#ncdns">names</a> 
+  &
+  <a href="https://www.namecoin.org/download/betas/#ncp11">
+    public</a>
+
+  <a href="https://www.namecoin.org/resources/presentations/Grayhat_2020/Namecoin_TLS_Part_2_Grayhat_2020_Monero_Village.pdf">
+    keys</a>, 
+  capable of replacing both the DNS system and X.509 Certificate Authority system. 
+</p>
+
+<p>
+  For more information on how to get started with Namecoin, see my 
+  <a href="https://sequentialread.com/how-to-register-a-namecoin-bit-domain-with-electrum-nmc/">
+    Namecoin guide for webmasters</a>.
+
+</p>
+
+<hr/>
+<p>
+  <sup id="ref_1">[1]</sup> <a href="https://www.doitwithoutdues.com/">fuck amazon</a>
+</p>
+
+</div>
+{% endblock %}
+
+{% block pagesource %}/templates/about-ssh.html{% endblock %}
+
--- a/capsulflask/templates/capsul-detail.html
+++ b/capsulflask/templates/capsul-detail.html
@ -93,7 +93,9 @@
    <pre class="code">{% for key in vm['ssh_host_keys'] %}
 SHA256:{{ key.sha256 }} ({{ key.key_type }}){% endfor %}</pre>
  </div>
-
+  <div class="row">
+    What's this? see <a href="/about-ssh">Understanding the Secure Shell Protocol (SSH)</a>
+  </div>
  <div class="row ">
    <hr/>
  </div>