DEC RADIX 50

RADIX 50[1][2] or RAD50[2] (also referred to as RADIX50,[3] RADIX-50[4] or RAD-50), is an uppercase-only character encoding created by Digital Equipment Corporation (DEC) for use on their DECsystem, PDP, and VAX computers.

RADIX 50's 40-character repertoire (050 in octal) can encode six characters plus four additional bits into one 36-bit machine word (PDP-6, PDP-10/DECsystem-10, DECSYSTEM-20), three characters plus two additional bits into one 18-bit word (PDP-9,[1] PDP-15),[5] or three characters into one 16-bit word (PDP-11, VAX).[2]

The actual encoding differs between the 36-bit and 16-bit systems.

36-bit systems

In 36-bit DEC systems RADIX 50 was commonly used in symbol tables for assemblers or compilers which supported six-character symbol names from a 40-character alphabet. This left four bits to encode properties of the symbol.

For its similarities to the SQUOZE encoding scheme used in IBM's SHARE Operating System for representing object code symbols, DEC's variant was also sometimes called DEC Squoze,[6] however, IBM SQUOZE packed six characters of a 50-character alphabet plus two additional flag bits into one 36-bit word.[5]

RADIX 50 was not normally used in 36-bit systems for encoding ordinary character strings; file names were normally encoded as six six-bit characters, and full ASCII strings as five seven-bit characters and one unused bit per 36-bit word.

PDP-6, PDP-10/DECsystem-10, DECSYSTEM-20[3]
Most
significant
bits
Least significant bits
000 001 010 011 100 101 110 111
000 space0123456
001 789ABCDE
010 FGHIJKLM
011 NOPQRSTU
100 VWXYZ.$%

18-bit systems

RADIX 50 (also called Radix 508 format[1]) was used in Digital's 18-bit PDP-9 and PDP-15 computers to store symbols in symbol tables, leaving two extra bits per 18-bit word ("symbol classification bits").[1]

16-bit systems

Some strings in DEC's 16-bit systems were encoded as 8-bit bytes, while others used RADIX 50 (then also called MOD40).[2][7]

In RADIX 50, strings were encoded in successive words as needed, with the first character within each word located in the most significant position.

For example, using the PDP-11 encoding, the string "ABCDEF", with character values 1, 2, 3, 4, 5, and 6, would be encoded as a word containing the value 1×402 + 2×401 + 3×400 = 1683, followed by a second word containing the value 4×402 + 5×401 + 6×400 = 6606. Thus, 16-bit words encoded values ranging from 0 (three spaces) to 63999 ("999"). When there were fewer than three characters in a word, the last word for the string was padded with trailing spaces.[2]

There were several minor variations of this encoding. For example, the RT-11 operating system considered the character corresponding to value 011101 (%) to be undefined,[2] and some utility programs used that value to represent the * character instead.

The use of RADIX 50 was the source of the filename size conventions used by Digital Equipment Corporation PDP-11 operating systems. Using RADIX 50 encoding, six characters of a filename could be stored in two 16-bit words, while three more extension (file type) characters could be stored in a third 16-bit word. The period that separated the filename and its extension was implied (i.e., was not stored and always assumed to be present). RADIX 50 was also commonly used in the symbol tables of the various PDP-11 programming languages.

PDP-11, VAX[2][4]
Most
significant
bits
Least significant bits
000 001 010 011 100 101 110 111
000 spaceABCDEFG
001 HIJKLMNO
010 PQRSTUVW
011 XYZ$.%01
100 23456789
gollark: Ah.
gollark: Is there not some extremely small one you can use?
gollark: Run a CGI server of some kind?
gollark: Eßentially, finding *any match* is easy via bitsets, but finding the *best* match requires actually looking at each document to find the term frequency (and maybe term positions), which sounds slow.
gollark: Hmm. I'm not actually sure how to fastly™ do ranking of search results for this.

See also

References

  1. "Appendix 1". PDP-9 Utility Programs--Advanced Software System--Programmer's Reference Manual (PDF). Maynard, Massachusetts, USA: Digital Equipment Corporation. 1968. Order No. DEC-9A-GUAB-D. Archived (PDF) from the original on 2020-06-04. Retrieved 2020-06-04.
  2. "8.10 .RAD50". PAL-11R Assembler - Programmer's Manual - Program Assembly Language and Relocatable Assembler for the Disk Operating System (2nd revised printing ed.). Maynard, Massachusetts, USA: Digital Equipment Corporation. May 1971 [February 1971]. p. 8-8. DEC-11-ASDB-D. Retrieved 2020-06-18. […] PDP-11 systems programs often handle symbols in a specially coded form called RADIX 50 (this form is sometimes referred to as MOD40). This form allows 3 characters to be packed into 16 bits; therefore, any 6-character symbol can be held in two words. The single operand is of the form /CCC/ where the slash (the delimiter) can be any printable character except for = and : . The delimiters enclose the characters to be converted which may be A through Z, 0 through 9, dollar ($), dot (.) and space ( ). If there are fewer than 3 characters they are considered to be left justified and trailing spaces are assumed. […] The packing algorithm is as follows: […] A. Each character is translated into its RADIX 50 equivalent as indicated in the following table: Character - RADIX 50 Equivalent (octal): (space) - 0, A–Z - 1–32, $ - 33, . - 34, 0–9 - 36–47. Note that another character could be defined for code 35. […] B. The RADIX 50 equivalents for characters 1 through 3 (C1,C2,C3) are combined as follows: RESULT=((C1*50)+C2)*50+C3 […]
  3. Durda IV., Frank (2004). "RADIX50 Character Code Reference". Archived from the original on 2005-03-31. Retrieved 2005-03-31.
  4. "Appendix B.3: Radix-50 Constants and Character Set". Compaq Fortran 77 Language Reference Manual. Compaq Computer Corporation. 1999. Archived from the original on 2012-10-14. Retrieved 2012-10-14.
  5. Jones, Douglas W. (2018). "Lecture 7, Object Codes, Loaders and Linkers - Final steps on the road to machine code". Operating Systems, Spring 2018. Part of the CS:3620 Operating Systems Collection. Department of Computer Science, The University of Iowa. Archived from the original on 2020-06-06. Retrieved 2020-06-06.
  6. Murrell, Stephen J. (2005). "DEC/PDP Character Codes". rabbit.eng.miami.edu. University of Miami. DEC Squoze Character Table. Archived from the original on 2020-06-19. Retrieved 2020-06-19.
  7. PDP-11 Getting DOS on the Air (1 ed.). Maynard, Massachusetts, USA: Digital Equipment Corporation. August 1971. DEC-11-SYDC-D. Retrieved 2020-06-18.

Further reading

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.