| Example for the structure of a Gene Directory-file | Example for the structure of a uniqID-reference file |
| # Query Results # Connection: MySQL server # Host: localhost # Saved: 2003-09-07 20:31:25 # # Query: # SELECT DISTINCT GeneSymbol, GOAcc, GeneIdentifier FROM 'flymar03' # 'GeneSymbol','GOAcc','GeneIdentifier', '&agr;-Adaptin','5886','FBgn0015567', '&agr;-Adaptin','7269','FBgn0015567', '&agr;-Adaptin','16192','FBgn0015567', '&agr;-Adaptin','30122','FBgn0015567', '&agr;-Adaptin','6901','FBgn0015567', '&agr;-Adaptin','8021','FBgn0015567', '&agr;-Adaptin','16183','FBgn0015567', '&agr;-Adaptin','30135','FBgn0015567', '&agr;-Cat','3779','FBgn0010215', '&agr;-Cat','7016','FBgn0010215', '&agr;-Cat','8092','FBgn0010215', and so on... | # Query Results # Connection: MySQL server # Host: localhost # Saved: 2003-09-17 10:38:31 # # Query: # SELECT uniqID, GeneIdentifier FROM `probe2gene` # 'uniqID','GeneIdentifier', '145501_at','FBgn0031208', '153565_at','FBgn0002121', '145502_at','FBgn0031209', '154367_at','FBgn0028472', '145507_at','FBgn0031213', '145508_at','FBgn0031214', '153716_at','FBgn0031216', '145510_at','FBgn0031217', '141332_at','FBgn0026787', '154909_at','FBgn0005278', and so on... |
| Please note that the comment lines (starting with #) are not necessarily required. However, the first lines containing the column names (i.e. 'GeneSymbol','GOAcc','GeneIdentifier',) are important. The GOAcc is the GO-specific identifier for each termand in GO-Cluster it is used without the "GO:"-prefix and the leading zeros. It is also important that each item is quoted in " ' " and thatthe field values are separated with " , ". Note that no blanks are between the last " , " and and the linebreaks. Data lines of other structure than stated above will be ignored by GO-Cluster! If you don't know how to setup the files, please talk to your favorite computer person in your department. | |
| UniqID | wild-type | condition 1 | condition 2 |
| 142126_at | 32432,3 | 5644,4 | 6546,4 |
| 142127_at | 342,5 | 345,5 | 865,4 |
| 142128_at | 45366,7 | 74544,5 | 45355,6 |
Note for experienced users: Also "maximal" Cluster&Treeview-files can be read by GO-Cluster. However, the fields Name, GWeight, GOrderEOrder will be ignored, since GO-Cluster only offers a very simple clustering approach.B) Bibliography
Further important differences between Cluster&Treeview and GO-Cluster in the interpretation of the input file:
- Your data has to be normalized and scaled. There are no functions in GO-Cluster to do so.
- The UniqID in the first column has to be really unique! Otherwise, the program might crash when looking for a particular gene.
- Label fields have to appear. GO-Tree determines the number of data columns by interpretation of the first row.
- The decimal separator ("," or ".") is used according to your system's settings.
- Missing numerical data is replaced by "1.0" and negative values will be read as absolute numbers.
- Internal calculations are performed with the complete precision ofyour input. Display however is limited to one decimal, which is okay formost applications. If you want to see all of your data, simply multiply yourinput data by 10, 100 or 1000 before loading it into GO-Cluster.
- The maximal number of experiments is 16.