root@aecom.UUCP (05/04/84)
I would appreciate it if someone could supply me with either frequency tables, digraph and tri-graph charts for English and other languages, or programs to compute them, or references to easily obtainable books that contain such information. Sincerely, -- Alex S. Fuss {philabs, esquire, cucard}!aecom!alex
gwyn@brl-vgr.ARPA (Doug Gwyn ) (05/12/84)
There is a book entitled something like "Elementary Cryptanalysis" by Helen F. Gaines that contains lots of frequency tables and so forth. It is nice however to have a program around for running lots of files through to build such tables, since different plaintext domains have different statistics. For cryptanalysis work you should have table- driven analysis programs that among other things derive their model of plaintext characteristics from such tables, much as the Writer's Workbench derives its style standards from specified model documents. It is almost trivial to write such a program, especially in a decent character-handling language like C. All you have to do is: initialize_array_to_all_zero_counts; initialize_total_objects_processed_counter_to_zero; for each input character, digram, trigram, etc.: increment_corresponding_array_location; increment_total_objects_processed_counter; Then the final array elements, divided by the total_objects counter, are the relative (character, digram, trigram, etc.) frequencies.