[net.crypt] need frequency tables, or reference

root@aecom.UUCP (05/04/84)

	I would appreciate it if someone could supply me with either
frequency tables, digraph and tri-graph charts for English and other
languages, or programs to compute them, or references to easily obtainable
books that contain such information.

				Sincerely,
-- 

				Alex S. Fuss
			{philabs, esquire, cucard}!aecom!alex

gwyn@brl-vgr.ARPA (Doug Gwyn ) (05/12/84)

There is a book entitled something like "Elementary Cryptanalysis" by
Helen F. Gaines that contains lots of frequency tables and so forth.

It is nice however to have a program around for running lots of files
through to build such tables, since different plaintext domains have
different statistics.  For cryptanalysis work you should have table-
driven analysis programs that among other things derive their model
of plaintext characteristics from such tables, much as the Writer's
Workbench derives its style standards from specified model documents.
It is almost trivial to write such a program, especially in a decent
character-handling language like C.  All you have to do is:
	initialize_array_to_all_zero_counts;
	initialize_total_objects_processed_counter_to_zero;
	for each input character, digram, trigram, etc.:
		increment_corresponding_array_location;
		increment_total_objects_processed_counter;
Then the final array elements, divided by the total_objects counter,
are the relative (character, digram, trigram, etc.) frequencies.