Skip to content

rlugojr/ia-bin-tools

 
 

Repository files navigation

Readme for ia-bin-tools: Internet Archive Command-line Utilities

Tools included:

   sha1sum, md5sum: forked from gnu coreutils, but with base32 support added

   gzip-chunks: identifies valid gzip chunks in the input and writes them
                verbatim to the output

   bin-search: performs binary search of sorted text file(s)

   whoiz: performs whois queries (RFC 3912)

Download:

   Download latest stable source distribution:
   https://builds.archive.org:1443/job/ia-bin-tools/lastStableBuild/

Build:

   * from tarball: ./configure && make && sudo make install
   * from git: ./autogen.sh && make && sudo make install

   See also the file INSTALL.

Development:

   Source is hosted at https://github.com/internetarchive/ia-bin-tools

Why the fork of sha1sum/md5sum?

   There is an open bug with patch, but the sentiment seems to be that it
   shouldn't be included. See http://debbugs.gnu.org/cgi/bugreport.cgi?bug=6056
   
Why another whois client?

   1. Apache license
   2. Recursive lookup starting with whois.iana.org
   3. Support for whois:// URLs

   When I started working on the whois fetcher for Heritrix I went looking for
   some existing code to base it on. But the existing clients were all GPL, and
   Heritrix is now mostly Apache licensed. So I started from scratch.

   Existing whois clients mostly have lists of servers for different TLDs and
   such. But it turns out that today only a small handful of servers need to be
   hardcoded, because you can lookup TLDs at whois.iana.org, and it will refer
   you to a more specific server. 

   In addition, this whois client supports whois://{server}/{query} URLs, a
   scheme I invented for Heritrix.

   See also: 
      * http://builds.archive.org:8080/javadoc/heritrix-3.x-snapshot/org/archive/modules/fetcher/FetchWhois.html
      * http://webarchive.jira.com/browse/HER-1645

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C 97.9%
  • Makefile 1.1%
  • Other 1.0%