[net.news.group] More stats as of Nov. 10, 1985

mangoe@umcp-cs.UUCP (Charley Wingate) (11/11/85)

Here are the average article sizes and top 25 groups minus top 25 users for
Nov. 10:

     Orig. Avg. Art. New      %
Rank Rank    Size   Kbytes   Chg.   Group 
  1    1      1.4    563.8    4.1%  net.news.group
  2    2      1.7    319.3   23.0%  net.politics
  3    6      1.5    308.4    7.5%  net.flame
  4   10      1.5    244.0    3.7%  net.news
  5   13      1.0    234.0    0.0%  net.movies
  6   14      1.3    233.6    0.0%  net.women
  7    9      1.3    224.9   20.5%  net.micro.mac
  8    7      2.4    208.8   32.7%  net.religion
  9    3      4.0    208.7   46.7%  net.sources
 10    4      1.1    197.1   48.1%  net.music
 11    5      2.3    189.8   50.1%  net.philosophy
 12   12      2.0    188.0   21.1%  net.religion.christian
 13   17      0.6    185.8    0.0%  net.sf-lovers
 14   19      1.1    181.9    0.0%  net.audio
 15   20      0.8    174.9    1.9%  net.unix-wizards
 16   15      2.8    158.3   28.8%  net.politics.theory
 17   23      0.8    151.0    0.0%  net.cooks
 18   21      1.0    150.5   10.4%  net.lang.c
 19   25      0.6    142.2    2.2%  net.jokes
 20   11      3.4    141.1   44.1%  net.origins
 21   22      0.8    139.9    8.8%  net.unix
 22   16      1.1    138.6   27.9%  net.micro.amiga
 23   24      1.2    136.0    7.2%  net.arch
 24   18      1.5     89.3   51.2%  net.micro.pc
 25    8      8.9     55.0   81.8%  net.sources.mac

Again, note the number of large drops.

For comparison, Rich Rosen (the top user) would have been 3rd on this list.

Charley Wingate

ems@amdahl.UUCP (ems) (11/27/85)

It would be interesting to see what percentage of total
volume and what percentage of each group volume was made up of
headers and footers.  By what percent would total net volume
drop if cute .signatures were eliminated and a standard
disclaimer were appended to all articles.  (Rather than having
each person come up with the obligatory disclaimer...)

This should save at least the couple of percent that the major
groups consume.

-- 

E. Michael Smith  ...!{hplabs,ihnp4,amd,nsc}!amdahl!ems

'If you can dream it, you can do it'  Walt Disney

This is the obligatory disclaimer of everything. (Including but
not limited to: typos, spelling, diction, logic, and nuclear war)

adams@calma.UUCP (Robert Adams) (11/27/85)

> E. Michael Smith  ...!{hplabs,ihnp4,amd,nsc}!amdahl!ems
> It would be interesting to see what percentage of total
> volume and what percentage of each group volume was made up of
> headers and footers.  By what percent would total net volume
> drop if cute .signatures were eliminated and a standard
> disclaimer were appended to all articles.  (Rather than having
> each person come up with the obligatory disclaimer...)
> 
> This should save at least the couple of percent that the major
> groups consume.

I wrote a program to scan all news on our system and gather
such statistics.  What follows is the output of same.
A "signature" is the lines after a line of "-- " which
doesn't get them all but...  "Included" lines are ones
beginning with ">".  Average article length is skewed because
of a few large files -- maps and sources.

Notice that 1/4 of the characters are in headers and that
4% of the total characters stored are in the Path: line.

	adams@calma.UUCP		-- Robert Adams
	...!ucbvax!calma!adams

------------------ cut here ----------------
files = 9328, lines = 523997, characters = 21082117
average lines per file = 56, average chars per file = 2260
header lines = 119323, characters = 5099675, percent = 24%
signature lines = 28212, characters = 957140, percent =  5%
inserted lines = 47111, characters = 2401935, percent = 11%
                                               percent    percent
            Header   occurances  total chars  of headers  of total
       Relay-Version     9327      522336       10.2%       2.5%
     Posting-Version     9322      585931       11.5%       2.8%
                Path     9327      803948       15.8%       3.8%
                From     9327      344330        6.8%       1.6%
          Newsgroups     9327      268699        5.3%       1.3%
             Subject     9327      362454        7.1%       1.7%
          Message-ID     9327      285049        5.6%       1.4%
                Date     9327      260814        5.1%       1.2%
        Article-I.D.        0           0        0.0%       0.0%
              Posted        0           0        0.0%       0.0%
       Date-Received     9327      345084        6.8%       1.6%
          References     5279      286359        5.6%       1.4%
        Distribution     2743       46750        0.9%       0.2%
        Organization     8531      393317        7.7%       1.9%
               Lines     9327       82888        1.6%       0.4%
                Xref     2381      126036        2.5%       0.6%
            Approved      394       12167        0.2%       0.1%
               Nf-ID      603       27965        0.5%       0.1%
             Nf-From      605       30231        0.6%       0.1%
             Control      275        8581        0.2%       0.0%
            Reply-To     2515      109362        2.1%       0.5%
              Sender     1150       34935        0.7%       0.2%
               Xpath       61        1379        0.0%       0.0%
            Keywords      560       17544        0.3%       0.1%
             Summary      727       17643        0.3%       0.1%
         Followup-To      132        3405        0.1%       0.0%
             Expires       67        2054        0.0%       0.0%
                  Cc        7          65        0.0%       0.0%
       Apparently-To        5         160        0.0%       0.0%
         In-reply-to        1          63        0.0%       0.0%
        This-Account        1          26        0.0%       0.0%
            Reply-tp        1          27        0.0%       0.0%
    Original-Subject        3         179        0.0%       0.0%
        Followups-to        2          50        0.0%       0.0%
               other        2         117        0.0%       0.0%