IUBIO ARCHIVE FOR BIOLOGY Welcome to the IUBio Archive of biology software and data. This is an anonymous ftp archive that you will find on the computer with the Internet name of ftp.bio.indiana.edu. Molecular biology is the area of concentration, and it is also a home for Drosophila research data. It will include software for Macintosh, VAX-VMS, Unix, MS-DOS and any other important computer operating systems. Access to the archive is via anonymous FTP (file transfer protocol) programs that connect to computers on the Internet. ACCESS TO IUBIO ARCHIVE ----------------------- This IuBio Archive is on the Internet network of computers with the name FTP.Bio.Indiana.Edu The actual host computer and Internet number for this archive may change. The Internet address of the current Iubio archive computer is 129.79.224.25. If your computer system is linked to the Internet, it probably has an FTP program. Each FTP program has it's own peculiarities, but most follow a general syntax: ftp ftp.bio.indiana.edu -- connect to archive computer user: anonymous -- log on to archive computer password: your e-mail address ? or help -- general help for ftp cd subdirectory -- change to subdirectory cd .. -- change to superdirectory binary -- use full binary transfer ascii -- use text transfer get any.file -- fetch a file from the archive put my.file -- put a file to the archive (only for Incoming/ directory) bye -- close the connection This archive uses Unix conventions and software. *PLEASE NOTE*, all commands and file names are CASE-Sensitive. Mostly these are all lowercase. You will need to use your shift key some places, like "Incoming" and "Readme". See below for a detailed example session. CURRENT CONTENTS OF THE ARCHIVE ------------------------------- An abbreviated directory of the archive Archive.doc About this archive (this document) Files Full list of archive files Files.new Most recent archive files biology/ General biology chemistry/ Chemistry fly/ Drosophila stocks and data help/ Help documents molbio/ Molecular biology science/ General sciences usenet/ Archive of biology news from Usenet util/ Computer and archive utilities Incoming/ Contributions go here ./molbio: align/ Sequence alignment codon/ Codon tables data/ Molecular data evolve/ Evolution and phylogeny ibmpc/ MSDOS software mac/ Macintosh software rnafold/ RNA secondary structure search/ Databank searching The folders bin/, dev/, etc/, private/ and usr/ are for archive housekeeping; please ignore them. Most of the software is as received from the authors or contributor. In the case of software source, it may or may not be ready to compile and run on a given operating system. Even those programs that do run may require adjustments to read the data formats that you use. I have edited and recompiled many of the programs in source form for different platforms to one extent or another, and can say that most will run but may require the talents of a programmer to install them. The archive of Bionet newsgroups, Sci.Bio and Info-GCG from the Usenet electronic news media was added starting with news from about 1 Dec 91. See the usenet/Readme file for details. If you have suggestions, questions or comments, please let me know. Addresses are listed below. HISTORY ------- This archive was first started in October 1989 on the computer called Iubio.Bio.Indiana.Edu. At that time, the archive was my personal reference collection of public molecular biology software and data. I made the archive available to others because the only similar archive available at the time, at BioNet, closed it's public software operation. It seemed little extra effort for me to make my collection available to others (my mistake...). During the summer of 1991, the archive moved to Cricket.Bio. Indiana.Edu, my desktop Macintosh running Apple Unix. This computer has served well, handling the traffic of 300 callers/week and 1000 files/week transferred to people around the world. Somewhere around this time, the California Education and Research Federation Network selected this archive as it's 1991 Biological Sciences winner for its CERFnet award for excellence in networked applications. As of Nov 1991, the archive moved to a computer called Fly.Bio.Indiana.Edu. Through the Bloomington Drosophila stock center, NSF's Division of Instrumentation and Resources has provided the funds for the purchase of a Sun Sparcstation 2 to function as the fly community's database server, and also to host the public IUBio archive. ACKNOWLEDGEMENTS ---------------- I would like to thank all of the hard working and often under-acknowledged authors who have contributed their software and data collections to this and other public archives. It is they who deserve credit for what this and other public archives mean to you and your ability to do your work. Please keep in mind that frequently the authors of these works receive little or no credit from their peers, the authors represented in this archive for the most part receive no money for their efforts, which are commonly done on weekends, evenings, vacations. If you would, please remember to mention the authors who make their software and databases available to you as it aids your work. Any program or database that is available publicly is published. If the author does not have a paper publication that you can cite, I recommend this form: Doe, John, 1991*. (Insert title of software or database). Published electronically on the Internet, available via anonymous ftp from ftp.bio.indiana.edu.% * If no date is given explicitly, use the file dates. % Substitute the name of the archive you obtained it from. You need not cite this archive for files you obtain here, however you may consider it equivalent to a paper journal or book in some ways. It is certainly an information resource. The proper citation for this public archive is: Gilbert, D.G., 1989. IUBio archive of molecular and general biology software and data. An Internet resource available via anonymous ftp to ftp.bio.indiana.edu. -------------- Don Gilbert CONTRIBUTING TO THE ARCHIVE --------------------------- Contributions of broad interest in any area of biology, and related areas of chemistry and other sciences, are welcome. These may be software or data. Contributions of interest over several computer platforms should either be plain text files or .ZIP or .ARC archives. You may put your contribution in the "Incoming" directory, using your FTP put command. You may also send e-mail compatible files (usually .UUE or .HQX encoded files or plain text) to Archive@Bio.Indiana.Edu (preferred Internet address) Don Gilbert, BioComputing Office (land mail) Biology Department, Indiana University Bloomington, IN 47405 USA Any general mail about the archive should be addressed here also. USING THE ARCHIVE ----------------- To change from the main directory to a subdiretory, use the command: ftp> cd subname To move up one level, ftp> cd .. To move over to another subdirectory at the same level, ftp> cd ../anothersub Some examples, ftp> cd molbio/align ftp> cd /molbio/evolve Some of the programs are in source form, or include data or documentation that is useful on many platforms. Since most of the programs require many files and to save space and transmission time, we store many of these programs in some archived format. The archiver previously common to all of the above operating systems is the ARC format popularized by System Enhancement Assoc. for MS-DOS machines. Many of the programs, especially those useful only on one operating system, are in the "native" archive format, whatever is most popular on that OS. Over the past several years, the ARC format has lost favor. There is now a multi-platform archive format called ZIP (first popular on MS-DOS). I am evaluating this, and may change to ZIP as a standard for this archive. Right now, there is quite a range of formats. You pretty much need to refer to the list of file suffixes below to determine which is which. Look in the /util subdirectory for programs which will decode these archived files. In some cases, the archive files and programs are stored here only as BINARY files. This means you must set your transmission software to transfer a full 8-bit byte, with the ftp command "Binary". See below list of file suffixes to tell which are in binary format. Most or all Macintosh files have been converted to .HQX format, or BinHex. This is an ascii (text) encoded form that you can fetch with the normal FTP setting of text (or ascii) transfer. You need Stuffit, BinHex 4.0 or other programs to decode this format. Some Macintosh programs may be stored in "MacBinary" format, which is binary and adds file system information at the start of the files. Your transfer software needs to know about macbinary for you to use these (NCSA Telnet, VersaTerm Pro, Red Ryder and most mac communications software does support MacBinary). Many of the archives and any of the .DOC, .TXT or .README files are in Ascii (plain text) format, suitable for transfer with a default Ascii method, or via e-mail programs. The encoded programs (.HQX or .UUE suffixes) require that you have a decoder on your computer. These are for Macintosh .HQX [Archive.Util.Mac]BinHex.* MSDOS .UUE [Archive.Util.IBMPC]UUDecode.Bas, *.C VMS .UUE [Archive.Util.VMS.UUEncode]*.* Unix .UUE use MS-DOS or VMS source code File name suffixes ------------------ TEXT (ascii) File formats ---- .DOC Plain text documentation .TXT Plain text documentation .README Plain text documentation .HQX Macintosh BinHex encoded file .UUE MS-DOS, VMS, Unix UUEncode encoded file .C C source text .F ForTran source text .FOR ForTran source text .P Pascal source text .PAS Pascal source text .COM VMS command text .MAKE Unix command text .DAT general data file (usually text) .SEQ sequence data, usually nucleic acids (text) .PEP amino acid sequence data (text) .AA amino acid sequence data (text) BINARY File formats (requires ftp binary command before transfer) ------ .ARC Archived files (MS-Dos, Macintosh, Unix, VMS) .ZIP Archived (MS_Dos, Unix, VMS, others) .ZOO Archived files (MS_Dos, Unix, VMS, others) .TAR Unix archived files .Z Unix compressed file .TAR.Z Unix archived + compressed file .BCK VMS Backup archive .SIT Macintosh archive .EXE VMS or MSDOS program Here is an example session with the IUBio archive (Unix host) from a Unix computer (^^^^^ indicates lines that you complete). MyHost% ftp ftp.bio.indiana.edu ^^^^^^^^^^^^^^^^^^^^^^^ Connected to fly.bio.indiana.edu. 220 fly FTP server (Version 6.9 Wed Oct 30 12:13:34 EST 1991) ready. Name (ftp.bio.indiana.edu:gilbertd): anonymous ^^^^^^^^^ 331 Guest login ok, send e-mail address as password. Password: joe ^^^ 230-Next time please use your e-mail address as your password 230- for example: joe@sunflower.bio.indiana.edu 230-Please read the file Readme 230- it was last modified on Wed Jun 5 20:00:12 1991 - 148 days ago 230 Guest login ok, access restrictions apply. ftp> dir ^^^ 200 PORT command successful. 150 Opening ASCII mode data connection for /bin/ls. total 70 -rw-r--r-- 1 gilbert archive 13425 Jun 6 01:00 Archive.doc -rw-r--r-- 1 gilbert archive 36939 Sep 29 17:41 Files -rw-r--r-- 1 gilbert archive 3397 Sep 29 18:14 Files.new drwxrwxrwx 3 ftp archive 512 Oct 31 13:32 Incoming lrwxrwxrwx 1 root daemon 11 Oct 30 17:46 Readme -> Archive.doc dr-x-----x 2 root wheel 512 Oct 30 17:23 bin drwxr-xr-x 4 gilbert archive 512 Oct 29 23:07 biology drwxr-xr-x 6 gilbert archive 512 Aug 16 18:41 chemistry dr-xr-xr-x 2 root wheel 512 Oct 29 22:30 dev dr-xr-xr-x 2 root wheel 512 Oct 30 17:27 etc drwxrwxr-x 4 matthewk archive 1024 Oct 24 21:02 fly drwxr-xr-x 7 gilbert archive 1536 Sep 29 16:49 help drwxr-xr-x 19 gilbert archive 512 Aug 15 23:06 molbio drwx------ 2 gilbert archive 512 Oct 22 15:18 private drwxr-xr-x 5 gilbert archive 512 Aug 17 00:51 science dr-xr-xr-x 3 root wheel 512 Oct 29 22:29 usr drwxr-xr-x 6 gilbert archive 512 Aug 15 23:05 util 226 Transfer complete. 1075 bytes received in 0.013 seconds (78 Kbytes/s) ftp> get Readme iubio-archive.readme ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 200 PORT command successful. 150 Opening ASCII mode data connection for Readme (13425 bytes). 226 Transfer complete. local: iubio-archive.readme remote: Readme 13804 bytes received in 0.068 seconds (2e+02 Kbytes/s) ftp> cd molbio ^^^^^^^^^ 250 CWD command successful. ftp> dir ^^^ 200 PORT command successful. 150 Opening ASCII mode data connection for /bin/ls. total 19 drwxr-xr-x 2 gilbert archive 512 Sep 29 16:52 align drwxr-xr-x 2 gilbert archive 512 Sep 29 17:03 atarist drwxr-xr-x 2 gilbert archive 1024 Aug 1 14:33 codon drwxr-xr-x 2 gilbert archive 1024 Sep 29 16:47 data drwxr-xr-x 4 gilbert archive 512 Aug 15 22:46 evolve drwxr-xr-x 2 gilbert archive 512 Jun 4 21:27 gcg drwxr-xr-x 2 gilbert archive 512 May 14 15:30 genbank drwxr-xr-x 3 gilbert archive 1024 Aug 15 23:48 ibmpc drwxr-xr-x 5 gilbert archive 2560 Sep 29 17:00 mac drwxr-xr-x 2 gilbert archive 512 May 14 16:01 ncbi drwxr-xr-x 2 gilbert archive 512 Aug 7 22:42 primer drwxr-xr-x 4 gilbert archive 512 Oct 21 22:23 readseq drwxr-xr-x 2 gilbert archive 512 May 14 16:18 renzyme drwxr-xr-x 2 gilbert archive 512 May 14 16:18 rnafold drwxr-xr-x 3 gilbert archive 512 Sep 29 17:05 search drwxr-xr-x 3 gilbert archive 512 Oct 21 21:56 unix drwxr-xr-x 3 gilbert archive 512 Sep 29 17:02 vax 226 Transfer complete. 1051 bytes received in 0.11 seconds (9.5 Kbytes/s) ftp> cd readseq ^^^^^^^^^^ 250 CWD command successful. ftp> dir ^^^ 200 PORT command successful. 150 Opening ASCII mode data connection for /bin/ls. total 206 -rw-r--r-- 1 gilbert archive 56832 May 14 16:16 readseq.arc -rw-r--r-- 1 gilbert archive 11381 Oct 21 22:22 readseq.c -rw-r--r-- 1 gilbert archive 2595 May 14 16:16 readseq.doc -rw-r--r-- 1 gilbert archive 5061 May 14 16:16 readseq.p -rw-r--r-- 1 gilbert archive 63559 May 14 16:16 readseq_sun.uue drwxr-xr-x 2 gilbert archive 512 Oct 21 22:23 sparc-bin -rw-r--r-- 1 gilbert archive 42507 Oct 21 22:22 ureadseq.c -rw-r--r-- 1 gilbert archive 1365 May 14 16:17 ureadseq.h -rw-r--r-- 1 gilbert archive 21010 May 14 16:17 ureadseq.p drwxr-xr-x 2 gilbert archive 512 Oct 21 22:25 vms-vax-bin 226 Transfer complete. 676 bytes received in 0.13 seconds (5.2 Kbytes/s) ftp> get readseq.doc ^^^^^^^^^^^^^^^ 200 PORT command successful. 150 Opening ASCII mode data connection for readseq.doc (2595 bytes). 226 Transfer complete. local: readseq.doc remote: readseq.doc 2670 bytes received in 0.015 seconds (1.7e+02 Kbytes/s) ftp> binary ^^^^^^ 200 Type set to I. ftp> get readseq.arc ^^^^^^^^^^^^^^^ 200 PORT command successful. 150 Opening BINARY mode data connection for readseq.arc (56832 bytes). 226 Transfer complete. local: readseq.arc remote: readseq.arc 56832 bytes received in 0.096 seconds (5.8e+02 Kbytes/s) ftp> bye ^^^ 221 Goodbye.