Abstract: Distribution of character frequency in sample of 51,000 IDs from 51 biomedical databases: The three columns show the ASCII code of the character, the printable form and the frequency of occurence. We hide the ASCII codes 128 - 255, because they never showed up in the sample IDs. Overall we see 182 characters do not show up in any ID (54 in the range of frequently used 7-bit ASCII subset)
License: GNU GENERAL PUBLIC LICENSE Version 2, June 1991
DOI: 10.5447/ipk/2012/8
Content: 0 Directories 1 Files (17.8 KB)
| CONTRIBUTOR: |
Matthias Lange,
Hendrik Mehlhorn,
Uwe Scholz,
Falk Schreiber
[Show full information]
|
| CREATOR: |
Matthias Lange,
Hendrik Mehlhorn,
Uwe Scholz,
Falk Schreiber
[Show full information]
|
| PUBLISHER: | e!DAL - Plant Genomics and Phenomics Research Data Repository (PGP), IPK Gatersleben, Seeland OT Gatersleben, Corrensstraße 3, D-06466, Germany |
| SIZE: | 17.8 KB |
| SUBJECT: | bioinformatics, information retrieval |