Cracking the Universal Product Code by Count Nibble --------------- Have you ever though of what fun you could have by altering that little set of black bars? If you were lucky enough, you might be able to slip a box of industrial size laundry detergent by that dizzy 16- year-old girl at the Safeway and have the computer charge you the price of a pack of Juicy Fruit, or some other such mischief. Well, to help you in your explorations of How To Screw Over Others In This Grand Old Computerized World of Ours, I proudly present HOW TO CRACK TO UPC CODE. Use the information contained herein as you will. You will need the file UPC.PIC, hopefully available from the same place you found this file. And so, let's begin: When the lady at the corner market runs the package over the scanner (or whatever it is they do in your area), the computerized cash register reads the UPC code as a string of binary digits. First it finds the "frame bars" - a sequence of "101" (see A on picture). There are three sets of frame bars on any given code...one on either side, and one in the center. These do nothing but set off the rest of the data, and are the same on any UPC code. Next is the "number system character" digit, which is encoded in leftside code (see later). This digit tells the computer what type of merchandise is being purchased. The digits and their meanings are: 0 - Ordinary grocery items. Bread, magazines, soup, etc. 2 - Variable-weight items. Meats, fruits & veggies, etc. 3 - Health items. Aspirin, bandaids, tampons, etc. 5 - Cents-off coupon. (Not sure how this works). The next cluster of digits is the manufacturer number, again stored in leftside code. THere are five digits here all the time. Some numbers include 51000 for Campbell's Soup, 14024 for Ziff-Davis publishing (Creative Computing, A...), and 51051 for Infocom. The next five digits (after the frame bars) are the product/size id number. The number for "The Hitchhiker's Guide to the Galaxy" from Infocom is 01191. These digits are stored in rightside code. Finally there is the checksum, in rightside, which will be discussed later. Now, why are there two types of codes, leftside and rightside? That's so the person at the checkout counter can slide the thing by the scanner any way she pleases. By having different codings for either side the computer can tell the right value no matter how the digits are read in. Here are the codes for the digits 0 through 9: Digit Leftside code Rightside code ----- ------------- -------------- 0 0001101 1110010 1 0011001 1100110 2 0010011 1101100 3 0111101 1000010 4 0100011 1011100 5 0110001 1001110 6 0101111 1010000 7 0111011 1000100 8 0110111 1001000 9 0001011 1110100 The more observant among you may have noticed that Rightside code is nothing more than logical-NOTed Leftside code, i.e., a 0 in Leftside is a 1 in Right- side, and vice versa. Later on we will discuss another type called Reversed Rightside, in which the binary values in Rightside are reversed, meaning that 1110100 (9) in Rightside would be 0010111 in Reversed Rightside. RR is used only when there is an extra set of codes off to the right of the main code bars, as with books and magazines. Now we see the hard part: how the checksum digit is encoded. Let's try working out the checksum for "Hitchhiker's Guide". First, notice the Number System Character. Software is considered a Grocery Item by UPC, so the NSC is 0 (zero). Next, Infocom's Manufacturer's Number is 51051, and the game's id number is 01191. Good enough. Set together, these numbers look like this: 0 51051 01191 Now, take the digits of the code and write them on alternate lines, odd on one line, even below, giving this: 0 1 5 0 1 1 5 0 1 1 9 Now add each set of numbers: 0+1+5+0+1+1 = 8 5+0+1+1+9 = 16 Multiply the first number (the ones created by adding the first, third, etc digits) by three: 8x3 = 24 And add that to the result of the other number (second, fourth, etc digits added together): 24+16=40 Subtract this from the next higher or equal multiple of 10 (40 in this case) 40-40=0 And the remainder, here 0 (zero), is the checksum digit. Now, what if there's a set of other bars off to the side? For books, the sequence is as follows: Five digits Starts with 1011 If (first digit is even) then sequence is L-RR-L-L-RR else sequence is RR-L-L-RR-L each digit is separated with 01 Therefore, the sequence for 29656 is: 1011 0010011 01 0010111 01 0101111 01 0110001 01 0000101 2L 9RR 6L 5L 6RR and the sequence for 14032 is: 1011 0110011 01 0100011 01 0001101 01 0100001 01 0010011 1RR 4L 0L 3RR 2L Naturally, all these bars are run together. There is no checksum. For magazines, the sequence is even more complex. There are two digits in each bar, and the numbers usually run from 1-12, signifying the month. The first digits are encoded thusly: L if the digit is 1,4,5,8 or 9 and RR if the digit is 2,3,6,7 or 0. The second digit is coded in L if it is even, and RR if it is odd. Therefore, 06 codes as: 1011 0100111 01 0101111 and 11 codes as: 1011 0110011 01 0110011 No checksum here, either, and the fields are again separated by 01. Well, that about does it for this explanation of how to crack the UPC codes. Use this information as you will, and forward any question to THE SPACE BAR, 505-265-5178, pw:BANZAI. Enjoy! - Count Nibble -