Smart Computing-Editorial

Reference Series


How Computers Work, Part I
August 2001• Vol.5 Issue 3 Page(s) 150-155 in print issue
Deciphering The Secret World Of Encryption A Look At The Methods Used To Keep Your Identity & Information Safe

Trying to protect important information in a digital environment is not an easy process. You must take steps to protect yourself and your information from being intercepted. One common data defender you can use is encryption.

Encryption is the process of obscuring the data in a file to prevent others from gaining access to its contents. An encrypted file appears as a string of gibberish. For someone to read or use the file, that person needs to decrypt it. Decryption is the process of translating an encrypted file into its original, readable form. In most cases, only the users with the correct key (a type of password) can turn the file in a readable one.

In the electronic world, data encryption has a variety of purposes, including safeguarding data file integrity and protecting e-mail and e-commerce (electronic commerce) transmissions. We also use encryption to authenticate a user’s right to access a computer resource and to determine a user’s identity across a network.

The study of encryption and decryption methods is called cryptology. In the next couple of pages, we will give you a glimpse into the world of the cryptologist and show you how data is protected as it speeds across the Information Superhighway.

Clearing The Code. Before you begin your studies, you need to learn some basics. First, data that has not been encrypted is termed cleartext (also known as plaintext); encrypted data is called ciphertext.

Second, the procedure used to encrypt cleartext is a cipher, or encryption algorithm. Algorithms are numerically based lists of instructions involving mathematical functions.

Symmetric vs. Asymmetric Keys. Encryption relies heavily on encryption keys, values that affect how algorithms produce ciphertext. There are two different types of encryption keys: symmetric (secret) key and asymmetric (public) key.

A symmetric key is used to encrypt and decrypt the same message. This means both the sender and the recipient must know the key. Symmetric-key cryptography is used to encrypt data files and transmissions between computer systems. Its primary weakness is that both machines must share the key during a transmission. This makes the key vulnerable to interception and decryption. Plus, a challenge exists in finding a secure method to transfer the key.

Blowfish, Bruce Schneier’s royalty-free symmetric-key code, is a 64-bit block algorithm that can use keys up to 448 bits in length.

In asymmetric-key cryptography, a widely known key called a public key is used to encrypt a message, while the private key, known only to the recipient, is used to decrypt the message. Private keys are mathematically derived from the public key. In order to keep people from discovering the private key from a public key, the key should be a very long and unique binary number based on a complex algorithm (see below for more information on key length). Its primary weakness is the length of time it takes to encrypt and decrypt files.

Public-key cryptography is often used to create digital signatures, which verify the clear- text’s origin and integrity. Digital signatures are used in e-mail and e-commerce systems. Additionally, public-key cryptography is often used to encrypt symmetric keys, called session keys when used this way, so they safely pass between two computer systems. Public keys do not exist forever; they have expiration dates in order to keep them from being compromised.

Ancient Secrets. Once you have a key, you need an algorithm. There are two major classes of algorithms: substitution ciphers and transcription ciphers. Substitution ciphers replace elements of clear text with other values. Transcription ciphers change the order of clear text elements to produce cipher text.

To understand these algorithms and the history of cryptology, one need only to look at the art of war. Even ancient history recounts the military commanders’ needs for developing communication methods that their enemies could not understand.

The ancient Spartans, for example, used a transcription cipher. They wound a narrow strip of parchment around a staff called a scytale. Then they wrote message characters that ran down the length of the parchment spiral. If someone unwound the parchment strip from the scytale, the order of the characters became scrambled. The only way to decode the message was by wrapping the parchment over a scytale of the same circumference.

We can make a tabular transcription cipher illustrating how the Spartan’s cipher would work. Let’s assume our scytale has a circumference of five characters.

The message “ATTACK THE PERSIANS AT DAWN” would look like this:

XXXXX (our scytale’s diameter)

ATTAC

KTHEP

ERSIA

NSATD

AWN

By reading down each column of letters we get the encrypted message “AKENATTRSWTHSANAEITCPAD.” If you unwound the parchment, however, you would only be able to read a jumble of characters.

Back in the early days, they even used substitution ciphers. The Roman legions of Julius Caesar used a substitution code where they replaced each character with a letter that was three characters to the right of the original letter. For instance, they would have replaced the letter “J” by the letter “M.”

A sample message in the Caesar cipher would look like this:

plain text:ATTACK THE GAULS AT DAWN

cipher text:CWWCFNWKHJCXOVCWGCZQ

To decipher the message, the recipient would simply substitute a letter three letters to the left for every character in the ciphertext.

The Caesar cipher is still with us today in ROT13, a simple encryption algorithm. This algorithm uses the 13th letter to the right of a plaintext element in order to produce the ciphertext element. Some Usenet news-client programs have a ROT13 button or keystroke to quickly hide sensitive text from someone peering over a user’s shoulder.

From Character To Bit. The ciphers of today work at a much deeper level. Instead of treating each letter as the basic cleartext element, encryption algorithms now assign each typographic character a binary value composed of zeros and ones. This makes each bit (either a one or a zero) in the binary number represent the basic cleartext element.

For instance, if the cryptographic program assigns the cleartext letter “a” the binary value of “100000,” it can then alter each bit, making it more difficult to deduce cleartext from the ciphertext. Each bit can be changed using the XOR (exclusive-OR) mathematical operation. The following table shows the possible values for XOR bit operations. (The character “Å” represents the XOR operation.)

0 Å 0 = 0

1 Å 0 = 1

0 Å 1 = 1

1 Å 1 = 0

The number “100000” could be XORed one bit at a time to another binary number. For the sake of example, let’s use “111001” as the encryption key (which is a very short key). In practice, keys are comprised of extremely large binary numbers, the best may contain 128 bits or more that depend on the encryption algorithm. The XOR operation for our example would render the following result.

100000 (plain text)

111000 (key)

011000 (cipher text)

XOR operations are also reversible. Thus, we can decipher our ciphertext by reapplying the same key of “111000” to give us our plaintext of “100000” or “a.”

011000 (cipher text)

111000 (key)

100000 (plain text)

Optimum Key Length. Keys are an essential element of cryptography, which is the science of transforming a file into ciphertext and back into plaintext. Even though some key lengths are better than others, there is not an optimum key length. For symmetric-key cryptography, keys range in size from 56 bits to 128 bits. In asymmetric-key cryptography, keys range from 384 bits to 2,034 bits. In general, the longer the key is, the harder it is to decrypt.

Regardless of length, cryptographic keys must be unique. At the Network Security conference in June 1999, cryptologist Bruce Schneier said, “Some people obsess about key length. A long key does not equal a strong system.”

In Schneier’s book, “Applied Cryptography,” Schneier cites examples of bad keys. These include a name, initials, personal interests, words in any language, names from literature and legend, substituting numbers for letters in names and words, and word pairs. Instead, he recommends a key writer use a range of randomly ordered bits.

When a random bit key is not easily generated, Schneier says a pass phrase is the next best choice. A pass phrase is a long password that is converted into a bit pattern using a key-crunching technique, which converts the phrases into random keys. Thus a line of poetry converted to binary code stands a better chance of making a key more difficult to decode than creating a key from one word.

Symmetric-Key Algorithms. Both the Caesar cipher and the Spartan scytale cipher exemplify private-key ciphers. The keyspace (the range of variations an encryption key can produce) of each of these ciphers is fairly limited. A cryptanalyst, a person who studies the art of deducing the meaning of encrypted data, could quickly determine the clear text.

One-time pad. Even though it might appear that private-key ciphers are easy to break once the cryptanalyst knows the key, one private key algorithm scheme is unbreakable: the one-time pad. Gilbert Vernam and Major Joseph Mauborgne invented it in 1917.

In the one-time pad system, both the sender and recipient have a pad, which is a range of key values used once and then discarded. The name one-time pad recalls the days when diplomatic and military cryptologists used paper pads that contained the key or keys used at a given time period. As each period ended, the cryptologists destroyed the keys for that time range.

It can be very difficult to keep one-time pad keys synchronized. Should either sender or receiver loose track of which key was used last, then that person cannot decode the cipher text. Computer-driven one-time pad systems allow algorithms to change the key with each cleartext element. Even with complete knowledge of a cryptographic algorithm and a sample of the ciphertext, an unknown, constantly changing key can keep the cleartext secret.

Cryptologists can approach the security of the one-time pad system by using cryptographic-grade random-number generators. Symmetric algorithms can make use of a large random number as an encryption key. For secrecy’s sake, this key is changed often.

However, the random-number generator must not repeat numbers already generated. Random-number generators in computer language compilers such as BASIC are actually pseudo-random-number generators. They produce a large set of numbers that appear to be random, but in time, the numbers repeat.

Because patient cryptanalysts look for these repeats, cryptologists have to develop their own random-number generators. For an example, go to the Couterpane Web site where you can find the freely available source code belonging to Bruce Schneier’s Yarrow cryptographic random-number generator (http:// www.counterpane.com/yarrow.html).

Available Symmetric-Key Versions. There are two methods of encrypting cleartext elements using symmetric-key algorithms: the block method and the streaming method. With block ciphers, the algorithm encrypts a fixed range of bits—up to 64-bit blocks or larger are often used. With stream ciphers, algorithms work with either a one-bit or one-byte (8-bit) element at a time. Here are a few of the symmetric-key algorithms you may encounter.

The Digital Encryption Standard. The DES (Digital Encryption Standard) was developed for the U.S. Government in 1977, based on work derived from Lucifer, an IBM cipher. DES is still in use today. Some governments and the financial industry tend to favor this standard. DES is a 64-bit block algorithm that uses a 56-bit key. Because some cryptologists no longer believe DES is secure, Triple-DES (or 3DES) was created to beef up the algorithm. In this version, the plaintext is encrypted three times by three keys. The keys are used in reverse to decrypt the cipher text.

International Data Encryption Algorithm. In 1990, James L. Massey and Xuejia Lai developed IDEA (International Data Encryption Algorithm) in Zurich, Switzerland. It is one of the best-known algorithms, having been used in the popular PGP (Pretty Good Privacy) encryption program. IDEA uses a 128-bit key, is very strong, and is available free of charge to non-profit organizations. Ascom-Tech holds IDEA’s international patents.

Blowfish. Bruce Schneier’s symmetric-key algorithm is Blowfish. It is a 64-bit block algorithm that can use keys up to 448 bits in length. It is a royalty-free algorithm designed as a drop-in replacement for DES and IDEA with free source code. Additional information is available at The Blowfish Encryption Algorithm page at http:// www.counterpane.com/blowfish.html.

RC2. Ronald Rivest of RSA Data Security Inc. (http://www.rsa.com) developed RC2, a block algorithm. RC2 uses keys from one to 2,048 bits.

RC4. The stream algorithm RC4 is another of Ronald Rivest’s creations. It also allows the use of keys from one to 2,048 bits.

RC5. RC5 is a Rivest block algorithm. Published in 1995, it allows a user to determine key size, block length, and the number of encryption rounds.

Asymmetric Algorithms. Public-key cryptography uses different keys for decryption and encryption. The keys must be mathematically related in order to work. In general, these mathematical problems allow for the creation of large integers that can be easily generated, however, the numbers used to produce them are difficult to determine. Here are a few of the asymmetric-key versions.

Certicom offers security for Palm and Windows CE PDAs (personal digital assistants).

Diffie-Hellman. Whitfield Diffie and Martin Hellman developed the first public-key algorithm in 1976 at Stanford University. The Diffie-Hellman algorithm is in use today for key exchange between computers. The algorithm is based on the discrete logarithm problem. Its security comes from the difficulty of finding an exponent, given the base and the result.

RSA. RSA, developed in 1977 at MIT by Ronald Rivest, Adi Shamir, and Leonard Adelman, is the most common public-key algorithm. RSA is used for data encryption and digital signing. The algorithm is based on the difficulty of factoring large prime numbers. RSA Data Security Inc. markets RSA in the United States. The U.S. patent covering RSA expired Sept. 20, 2000.

ElGamal. ElGamal, described by T. ElGamal in 1985, is another discrete logarithm algorithm used for encryption and digital signatures. ElGamal is thought to be more secure because of its longer keys.

Elliptic Curve Public Keys. The computer power required to use either discrete logarithms or large prime factoring algorithms is very evident. RSA is supposedly 100 to 1,000 times slower than the symmetric DES algorithm. Faster approaches to public-key cryptography, however, are being investigated. One leader in this race is elliptic curve cryptography.

V. S. Miller and Neal Koblitz first independently described an application of the discrete logarithm algorithm in 1985. Elliptic curve public keys use the elliptic curve discrete log problem. It has the advantage of taking less computing power. The algebra is fairly straightforward, making software and hardware versions much easier to develop.

Elliptic curve cryptography is still fairly new. RSA and Baltimore Technologies (http://www.baltimore.com), however, have offered products using it. Certicom’s (http://www.certicom.com) version offers security for Palm and Windows CE PDAs (personal digital assistants). The U.S. Government first used this product on June 30, 1998, to send a check for $32,153 over the Internet to GTE for work done on a military contract.

Message Digests. During an encrypted session, message digests are used to check the validity or the data authenticity of the information. If the digest doesn’t match the information transmitted, it has either been scrambled by a bad network connection or someone has altered the information. Message digests use a hash function, which takes input and creates a smaller unique value. Hash functions convert text into digits, generally 128 to 160 bits in length. Message digest algorithms take input of any length, say a program file or an e-mail message, and produce a unique number representing the input data that is generally 128 to 160 bits long, depending on the message digest algorithm.

Message digest algorithms are very sensitive to minor changes in the input, making them ideal for checking against tampering. In addition, they make it very difficult to deduce the input data from the final message digest value.

Message Digest 5. Ronald Rivest created MD5 (Message Digest 5). His earlier attempts, MD2 and MD4, proved slow and not as secure. RSA Data Security distributes it free of charge.

Secure Hash Algorithm. The NIST (National Institute of Standards and Technology) and the NSA (National Security Agency) developed the SHA-1 (Secure Hash Algorithm). It was intended for use with the DSS (Digital Signature Standard) and was modeled on Rivest’s earlier MD4 algorithm, along with some improvements. SHA-1 produces a 160-bit message digest instead of a 128-bit one.

RIPE-MD. RIPE-MD is a European effort to create an MD4-derived message digest. A later version, RIPEMD-160, produces a 160-bit digest. It’s intended to replace MD4 and MD5 designs.

HAVAL. Yuliang Zheng, Josef Peiprzyk, and Jennifer Seberry developed the message digest algorithm HAVAL. It is a variation on MD5 that allows hash values from 92 bits to 256 bits in length and has a variable number of encryption rounds.

MACs. Essentially, a MAC (message authentication code) acts like a message digest, except that it is encrypted with a secret key. There are several ways to create a MAC, through a one-time key, a one-way hash function with a key (HMAC), or through stream and block algorithms. A MAC is considered a tamper-resistant method of ensuring the message has not been altered.

Signatures, Envelopes & Certificates. Taking a message digest and encrypting it with a private key is one way to make a digital signature. The use of the private key verifies the authenticity, making it difficult to claim you didn’t “sign” something.

A digital envelope is used to send a secret symmetric key (session key) to another user. It is produced by encrypting the session key with the recipient’s public key.

Digital certificates are issued from a “trusted” server called a certificate authority. The following example describes it best. Bob applies for a digital certificate and is given a certificate containing a private key. Bob’s public-key certificate is on the certificate authority. Alice and Fred, who also use the certificate authority, can encrypt items for Bob using his public certificate without ever having to contact him for it.

Digital certificates can contain the user’s name, the name of the issuing certificate authority, a serial number, the key, the date issued, the expiration date, and the issuer’s digital signature. Once issued, a CRL (certificate revocation list) tacks the digital certificate.

In a PKI (public-key infrastructure), several certificate authorities are linked. Maintaining and validating certificates in large PKI systems can become daunting. Large commercial PKI services include VeriSign (http://www.verisign.com) and GTE CyberTrust (acquired by Baltimore Technologies in January 2000).

Internet Security Protocols. You can see the importance of cryptographic algorithms in their use to secure communication on the Internet. Here are some security protocols that make e-business possible:

•The IPSec (Internet Protocol Security Protocol) is the standard for cryptographic authentication of datagrams at the network layer. It uses RSA, Diffie-Hellman, MD5, DES, Triple-DES and SHA-1.

•PPTP (Point-to-Point Tunneling Protocol) creates VPN (virtual private network) connections between computers. It uses RSA and DES.

•Developed jointly by VISA and MasterCard, the SET (Secure Electronic Transactions) security protocol protects Internet credit card transactions. It uses RSA, DES, SHA-1, and HMAC-SHA1.

•S/MIME (Secure Multipurpose Internet Message Extensions) ensures secure transmission, forwarding, and authentication of data at the application layer. It uses RSA, DES, Triple-DES, RC2, MD5, and SHA-1.

•The SSH (Secure Shell) gives users secure remote access to another computer over a network. SSH is used to replace the weak “remote services” often misused by hackers for computer break-ins. It uses RSA, RC5, RC4, RC2, DES, and 3DES.

•At the transport layer, the SSL (Secure Sockets Layer) provides a secure link between two computers, allowing authentication and encrypted data exchange. An improved version is TLS (Transport Layer Security). SSL was first developed by Netscape. SSL uses RSA, RC4, SHA-1, MD5, 3DES, DES, and Diffie-Hellman.

Stop The Hack Attack. System security hasn’t become important, it’s become vital. As the technology becomes more sophisticated, so will the codes that accompany it in an effort to stay one step ahead of the hackers.

by Bill Hayes

View the graphics that accompany this article.
(NOTE: These pages are PDF (Portable Document Format) files. You will need Adobe Acrobat Reader to view these pages. Download Adobe Acrobat Reader)

Terms To Know

algorithm —A list of mathematically based instructions used to accomplish a task such as encryption or decryption.

ciphertext—Encrypted text or data that cannot be read. The ciphertext appears as gibberish.

decryption—The process of translating encrypted data back into its original language. Only users with the correct encryption key make the file readable.

plaintext—An encryption term that refers to normal, readable text prior to encryption and following decryption. This term is used interchangeably with the term cleartext.

public-key encryption—An asymmetric scheme that uses two keys for encryption: a public key and a private key. The public key encrypts the data; the private key decrypts the data. Each person’s public key may be published and therefore known by everyone, but the private key is kept exclusive to the holder. Messages are then encrypted using the recipient’s public key and thus can only be decrypted by the recipient’s private key.

private-key encryption—A symmetric scheme that uses one secret key for encryption and decryption. This means both the sender and the recipient of the data must know the secret key. The problem with this is trying to find a secure way of passing the secret key onto the message recipient for decoding.

Want more information about a topic you found of interest while reading this article? Type a word or phrase that identifies the topic and click "Search" to find relevant articles from within our editorial database.