The HIV Brain Sequence Database contains three categories of annotations: publication references, patient and sampling information, and sequence properties (see table below).
The publication annotations include bibliographic information identifying the study that generated the sequences. Patient sampling annotations contain information describing the individual patients, as well as clinical information at the time of sampling.
This information was obtained by manual curation of the original publications and in some cases direct communications with the study authors. In cases where multiple studies examined tissue samples from the same patient, the resulting sequences are linked to the same patient code to increase statistical power.
Sample timepoint annotations describe the patient’s clinical health status, neuropathological, neurocognitive status, CD4 counts, viral load, and anti-retroviral treatment history at the time of sampling. Clone and sequence annotations describe the individual sequences and the tissue from which they were cloned.
This includes the sequence start and end locations numbered based on alignment to the HXB2 reference genome, and tissue source coded using terms from a formal anatomical ontology. Alignment to HXB2 was performed using the HIV Sequence Locator tool located at the LANL HIV Sequence Database.
|HIV risk factor
|Tissue bank distributing samples
|Patient year of death
|patient year of death
|patient geo-region at time of sampling
|patient country at time of sampling
|patient city at time of sampling
|patient age at sampling
|patient health status at sampling
|predominant subtype at time of sampling
|has patient had ART
|patient ART history
|Viral load plasma (copies/mL)
|plasma viral load
|Viral load brain (copies/million cells)
|brain viral load
|Viral load lymphoid (copies/million cells)
|lymphoid viral load
|CD4 count (cells/uL)
|were giant cells present in the brain
|Genbank accession number
|Genbank GI number
|Pubmed ID for original publicaiton
|publication assigned clone name
|Sample tissue class
|global tissue class (Brain, Blood & Lymphoid, etc…)
|Sample tissue name
|Sample tissue FMA code
|tissue FMA code
|Nucleic acid type
|was proviral DNA or viral RNA sequenced
|Start and end coordinates
|sequence start and end referenced to HXB2