T2T-MFA8, the complete sequence of a crab-eating macaque.
We generated a parthenogenesis cell line, MFA582-1, from a crab-eating macaque, from which we produced 53× PacBio HiFi, 77× ONT, 32× Illumina WGS, and 156× Hi-C data. Then, we constructed a telomere-to-telomere assembly of the crab-eating macaque (T2T-MFA8). All resources are available on this site.
Check our sister reference genome: T2T-MMU8.
- T2T-MFA8v1.1: GCF_037993035.2 / GCA_037993035.2 (NCBI RefSeq)
- T2T-MFA8v1.0: GCF_037993035.1 / GCA_037993035.1 (NCBI RefSeq, suppressed)
Track hubs were generated using bigtrack.
- T2T-MFA8v1.0 (suppressed)
- T2T-MFA8v1.1: unmasked, 20 autosomes + chrX + chrY + chrMT
- MFA582-1 (autosomes, chrX and chrMT)
- PacBio WGS: SRX22719293
- Oxford Nanopore Technology WGS: SRX22719294
- Illumina WGS: SRX22702646
- Hi-C: SRX22702647
- MFA0214 (chrY)
- PacBio WGS: SRX22874566 and SRX25542261
- Oxford Nanopore Technology WGS: SRX22874568
- Illumina WGS: SRX22874666 and SRX25542262
- NCBI RefSeq GCF_037993035.2-RS_2025_03
- Curated gene annotation: LiftOff result from Mmul_10 RefSeq and lso-Seq transcripts annotated by GeneMarkS-T, with manual curation
- LiftOff from Mmul_10 RefSeq
- 21-mer with no error: produced by GenMap
- 31-mer with no error: produced by GenMap
- Centromere
- CenSat: the 500-kbp extended regions of centromeres
- Segmental duplications: native bed format or merged bed format
- Tandem repeats: bed format or merged bed format
- RepeatMasker: native out format or bed format or merged bed format
- WindowMasker (with SDust)
- Centromere suprachromosomal family annotation
- rDNA models
- chrY annotations
- CpG islands
- CpG methylation from ONT: autosomes + chrX or chrY, identified by Nanopolish v0.14.0
Important
To verify file integrity, simply append .md5 to the download URL.
To avoid confusion, previous versions are not displayed on this page, but you can still access them through the links.
We would appreciate if you would acknowledge and cite our paper:
Zhang, S., Xu, N., Fu, L. et al. Integrated analysis of the complete sequence of a macaque genome. Nature (2025). https://doi.org/10.1038/s41586-025-08596-w
All data is released to the public domain (CC0).