## Burrow the Burrows ##

__Statistics_on_the_subgraph_of_type-1_selectors__
       (last update: 20180827 17:00:00 GMT)

### N.B.: THE CRAWL IS NOT COMPLETE, SO THESE 
    RESULTS ARE PARTIAL AND NOT CONCLUSIVE.

 The graph of type-1 selectors is obtained by 
 associating a node to each menu that is traversed
 by `burrow`, and by drawing a directed edge from
 menu A to menu B if there exists a selector to 
 menu B in menu A. As a result, this graph contains
 only menus in servers that have been found active
 at the time of the crawl.

 The subgraph of type-1 selectors containing other
 type-1 selectors to active servers consists of:

 *  7372325   menus linked through
 *  8843199   live selectors

 Of those menus, only:

 *  1178121   have valid type-1 selectors to visited
              active servers, and each of them 
              points on average to 
 *   7.5062   other menus

 The remaining type-1 selectors contain no 
 link to the active servers visited so far (but 
 they are linked from at least one of them, otherwise
 they would not be part of the graph).

 You find below a plot (axes in log-scale) of the
 number of menus having a given out-degree (i.e.,
 number of type-1 selectors to other menus):

   1e+06 ++++----+-++--+----+-++--+----+-++--+----+-++--+----++++   
         O+.        +          +          +          +         ++   
  100000 +-+O.O.                  # menus by out-degree +..O..+-+   
         ++     OOO                                            ++   
         ++       OO                                           ++   
   10000 +++       OOO        O                               +++   
         ++           OOO   O :O                               ++   
    1000 +++          OOOOOOOOOO                              +++   
         ++              OOOOOOO                               ++   
     100 +-+                OOO:   O                          +-+   
         ++                  OOOOO ::                          ++   
         ++                    OOOOOOOOO                       ++   
      10 +++                    OOOOOOOOOO                    +++   
         ++         +          + OOOOOOOOOOOOOO  OO  +  ..O    ++   
       1 +-++----+-++--+----+-++-OOOOOOOOOOOOOOOOOO-OOO-+----++-+   
         1          10        100        1000      10000      100000
                             outgoing edges                         
                                                                    
 This means that the vast majority of menus in 
 Goperspace have less than 10 type-1 selectors
 to other live menus.

 Of those 7372325 active menus:

 *  7072600   are referred by 1 other menu
 *   150873   are referred by 2 other menus
 *    77778   are referred by 3 other menus
 *    24063   are referred by 4 other menus
 ...and only:
 *    47011   are referred by 5 or more menus.

 As a result, each menu in an active server is
 referred on average by:

 *   1.1995   other menus. 

 But 97.9% of all the menus in the Gopherspace
 are referred by less than 3 other menus. 

 You find below a plot (axes in log-scale) of the
 number of menus having a given in-degree (i.e.,
 number of other menus pointing to them):

   1e+07 O+++----+-++--+----+-++--+----+-++--+----+-++--+----++++   
         ++         +          +          +          +         ++   
   1e+06 +++                       # menus by in-degree +..O..+++   
         ++:                                                   ++   
  100000 +++O.O.                                              +++   
         ++     O                                              ++   
   10000 +++     O                                            +++   
         ++       OO O                                         ++   
    1000 +++        OOO                                       +++   
         ++           OOOOO                                    ++   
     100 +++             OOOOO                                +++   
         ++                OOOOOO     O                        ++   
      10 +++                  OOOOOOOOOO                      +++   
         ++         +          +OOOOOOOOOOO OO  O    +         ++   
       1 +-++----+-++--+----+-++--OOOOOOOOOOOOOOOO+-++O-+----O+-+   
         1          10        100        1000      10000      100000
                             incoming edges                         

__________Strongly-connected_Components__________

 A strongly-connected component of a directed 
 graph is a set of nodes such that any node in
 that set can reach all other nodes in the set 
 through a directed path *and* can be reached by
 all the other nodes in that set. 
 
 The 10 largest strongly-connected components of
 the graph of type-1 selectors among active 
 servers contain:

@@@@@@

 *   247302  nodes
 *   133193  nodes
 *     6221  nodes
 *     6081  nodes
 *     3768  nodes
 *     3398  nodes
 *     3333  nodes
 *     2020  nodes
 *     1987  nodes
 *     1914  nodes

 You find below a plot (axes in log-scale) of the
 size of each strongly-connected component against
 its rank:

   1e+06 +++--+-++-+--+-++-+--+-++-+--+-+--+--+-+--+--+++--+--+++   
         ++      +       +       +      +       +       +      ++   
  100000 O-+                     component size by rank +..O..+-+   
         ++                                                    ++   
         ++                                                    ++   
   10000 ++O.O                                                +++   
         ++   OOOOO                                            ++   
    1000 +++      OOO                                         +++   
         ++         OOO                                        ++   
     100 +-+          OOOO                                    +-+   
         ++              OOO                                   ++   
         ++                OO                                  ++   
      10 +++                OO                                +++   
         ++      +       +   OOOOO      +       +       +      ++   
       1 +-+--+-++-+--+-++-+--+-+OOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO+   
         1       10     100     1000  10000   100000  1e+06   1e+07 
                             component rank                         
                                                                    

___________About_Gopher_Archives__________________

 It is worth noting that a large fraction of all 
 the menus found by `burrow` are inside large file
 and news archives/mirrors. Typical examples 
 include ftp.icm.edu.pl, taz.de, bkandbc.com, and 
 hngopher.com. Those four gopherholes alone 
 account for about 90% of all the links traversed
 by `burrow`. ftp.icm.edu.pl alone (an archive 
 containing software, operating systems, packages, 
 and documentation) accounts for about 79% of all 
 the links in the Gopherspace.
 I personally think that this is totally fine, 
 and actually expected, since Gopher is first and
 foremost a distributed file system. The fact 
 that the "social" aspect of Gopher, with phlogs
 and personal gopherholes, is not that central in
 terms of bare size does not mean anything: that 
 part can still be the most important one to 
 most of us, the humans who make and shape the 
 Gopherspace ;-)