Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Mar 17;434(7031):325-37.
doi: 10.1038/nature03440.

The DNA sequence of the human X chromosome

Mark T Ross  1 Darren V GrafhamAlison J CoffeySteven SchererKirsten McLayDonna MuznyMatthias PlatzerGareth R HowellChristine BurrowsChristine P BirdAdam FrankishFrances L LovellKevin L HoweJennifer L AshurstRobert S FultonRalf SudbrakGaiping WenMatthew C JonesMatthew E HurlesT Daniel AndrewsCarol E ScottStephen SearleJuliane RamserAdam WhittakerRebecca DeadmanNigel P CarterSarah E HuntRui ChenAndrew CreePreethi GunaratnePaul HavlakAnne HodgsonMichael L MetzkerStephen RichardsGraham ScottDavid SteffenErica SodergrenDavid A WheelerKim C WorleyRachael AinscoughKerrie D AmbroseM Ali Ansari-LariSwaroop AradhyaRobert I S AshwellAnne K BabbageClaire L BagguleyAndrea BallabioRuby BanerjeeGary E BarkerKaren F BarlowIan P BarrettKaren N BatesDavid M BeareHelen BeasleyOliver BeasleyAlfred BeckGraeme BethelKarin BlechschmidtNicola BradySarah Bray-AllenAnne M BridgemanAndrew J BrownMary J BrownDavid BonninElspeth A BrufordChristian BuhayPaula BurchDeborah BurfordJoanne BurgessWayne BurrillJohn BurtonJackie M ByeCarol CarderLaura CarrelJoseph ChakoJoanne C ChapmanDean ChavezEllson ChenGuan ChenYuan ChenZhijian ChenCraig ChinaultAlfredo CiccodicolaSue Y ClarkGraham ClarkeChris M CleeSheila CleggKerstin Clerc-BlankenburgKaren CliffordVicky CobleyCharlotte G ColeJen S ConquerNicole CorbyRichard E ConnorRobert DavidJoy DaviesClay DavisJohn DavisOliver DelgadoDenise DeshazoPawandeep DhamiYan DingHuyen DinhSteve DodsworthHeather DraperShannon Dugan-RochaAndrew DunhamMatthew DunnK James DurbinIreena DuttaTamsin EadesMatthew EllwoodAlexandra Emery-CohenHelen ErringtonKathryn L EvansLouisa FaulknerFiona FrancisJohn FranklandAudrey E FraserPetra GalgoczyJames GilbertRachel GillGernot GlöcknerSimon G GregorySusan GribbleColine GriffithsRussell GrocockYanghong GuRhian GwilliamCerissa HamiltonElizabeth A HartAlicia HawesPaul D HeathKatja HeitmannSteffen HennigJudith HernandezBernd HinzmannSarah HoMichael HoffsPhillip J HowdenElizabeth J HuckleJennifer HumePaul J HuntAdrienne R HuntJudith IsherwoodLeni JacobDavid JohnsonSally JonesPieter J de JongShirin S JosephStephen KeenanSusan KellyJoanne K KershawZiad KhanPetra KioschisSven KlagesAndrew J KnightsAnna KosiuraChristie Kovar-SmithGavin K LairdCordelia LangfordStephanie LawlorMargaret LevershaLora LewisWen LiuChristine LloydDavid M LloydHermela LoulsegedJane E LovelandJamieson D LovellRyan LozadoJing LuRachael LyneJie MaManjula MaheshwariLucy H MatthewsJennifer McDowallStuart McLarenAmanda McMurrayPatrick MeidlThomas MeitingerSarah MilneGeorge MinerShailesh L MistryMargaret MorganSidney MorrisInes MüllerJames C MullikinNgoc NguyenGabriele NordsiekGerald NyakaturaChristopher N O'DellGeoffery OkwuonuSophie PalmerRichard PandianDavid ParkerJulia ParrishShiran PasternakDina PatelAlex V PearceDanita M PearsonSarah E PelanLesette PerezKeith M PorterYvonne RamseyKathrin ReichwaldSusan RhodesKerry A RidlerDavid SchlessingerMary G SchuelerHarminder K SehraCharles Shaw-SmithHua ShenElizabeth M SheridanRatna ShownkeenCarl D SkuceMichelle L SmithElizabeth C SotheranHelen E SteingruberCharles A StewardRoy StoreyR Mark SwannDavid SwarbreckPaul E TaborStefan TaudienTineace TaylorBrian TeagueKaren ThomasAndrea ThorpeKirsten TimmsAlan TraceySteve TrevanionAnthony C TromansMichele d'UrsoDaniel VerduzcoDonna VillasanaLenee WaldronMelanie WallQiaoyan WangJames WarrenGeorgina L WarryXuehong WeiAnthony WestSiobhan L WhiteheadMathew N WhiteleyJane E WilkinsonDavid L WilleyGabrielle WilliamsLeanne WilliamsAngela WilliamsonHelen WilliamsonLaurens WilmingRebecca L WoodmanseyPaul W WrayJennifer YenJingkun ZhangJianling ZhouHuda ZoghbiSara ZorillaDavid BuckRichard ReinhardtAnnemarie PoustkaAndré RosenthalHans LehrachAlfons MeindlPatrick J MinxLadeana W HillierHuntington F WillardRichard K WilsonRobert H WaterstonCatherine M RiceMark VaudinAlan CoulsonDavid L NelsonGeorge WeinstockJohn E SulstonRichard DurbinTim HubbardRichard A GibbsStephan BeckJane RogersDavid R Bentley
Affiliations

The DNA sequence of the human X chromosome

Mark T Ross et al. Nature. .

Abstract

The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence.

PubMed Disclaimer

Figures

Figure 2
Figure 2
Xp and Xq pericentromeric contigs extend into the X-chromosome-specific higher-order alpha satellite, DXZ1. The pericentromeric region of the X chromosome is shown as a truncated ideogram. Self-self alignments of proximal sequences from each arm are illustrated by dotter plots below the ideogram. On each plot, the junction between the arm sequence and the arm-specific satellite region is marked by a red arrow, and the junction between the arm-specific satellite region and the X-chromosome-specific alpha satellite array (DXZ1) is marked with a blue arrow. Approximately 594 kb of sequence were analysed from Xp, including ~21 kb of DXZ1 sequence. The ~454 kb of sequence analysed from Xq included ~44 kb of DXZ1 sequence. In each case, ~100 kb of arm sequence were included. The highly repetitive structure of pericentromeric satellites is in stark contrast to the near absence of repetitive structure in the arm sequences, despite an unusually high density of LINE repeats in these regions. Gaps in the dark satellite regions occur where interspersed elements (LINEs, SINEs and LTRs) interrupt the satellite sequences. In the Array Sequences dotter plot, the most proximal ~21 kb of the Xp sequence is joined to the most proximal ~44 kb of the Xq sequence. The periodic nature of the centromeric, higher-order alpha satellite array is evident. Black horizontal lines on the plot reveal near identity of sequences spaced at ~2 kb intervals. This DXZ1 sample represents ~65 kb of the 3 (±0.4) Mb alpha satellite array. The regions outlined in blue are self-self alignments (‘Xp DXZ1’ and ‘Xq DXZ1’), and the remaining rectangular region of the plot is an alignment of Xp versus Xq DXZ1, which reveals the close relationship between DXZ1 sequences from each arm.
Figure 3
Figure 3
Homologies between the human X chromosome and chicken autosomes. a, Plot of BLASTZ sequence alignments between the X chromosome and chicken chromosomes 1 (red) and 4 (blue). Grey bar centred at approximately 60 Mb shows the position of the X centromere. Only the relevant section of each chicken chromosome is shown (see Mb scale at left for chromosome 1 and at right for chromosome 4). A schematic interpretation of the homologies shows the XAR and XCR as red and blue bars, respectively (see Fig. 1). Homologies at the ends of the XAR are indicated with arrows and are expanded in b. b, (Top) Genes at the ends of the human XAR. Genes from distal Xp (magenta arrow in a) are in magenta and genes from Xp11.3 (black arrow in a) are shown in black. (Bottom) Arrangement of the orthologous genes on chicken chromosome 1. A hypothetical ring chromosome, with the equivalent gene order to that observed in the chicken, is indicated by the curved, dotted red line. Recombination between one end of the established X chromosome and the ring chromosome at the arrowed position could, in a single step, have added the XAR and created the gene order observed on the human X chromosome.
Figure 4
Figure 4
Conservation of the X chromosome in eutherian mammals. Plot of BLASTZ sequence alignments between the human X chromosome and the mouse (red) and rat (blue) X chromosomes. The rodent chromosomes are oriented with their centromeres pointing downwards. Regions indicated with arrows are long, highly similar repeats in the mouse sequence that are absent from the human and rat sequences. These repeats were apparently collapsed in an earlier analysed version of the mouse sequence, which also had a large inversion with respect to the mouse assembly used here (NCBI32). The NCBI32 assembly has a gap from 0-3 Mb, which explains the absence of homology to the human X sequence in this part of the plot. The open horizontal bar shows the terminal section of human Xp, which is not conserved on the rodent X chromosomes.
Figure 5
Figure 5
Limited homology between the human sex chromosomes illustrates the extent of Y chromosome erosion in non-recombining regions. a, BLASTN alignments (length ≥80 bp, sequence identity ≥70%) between the finished sequences of the X and Y chromosomes. The centromere positions are represented by grey bars. The analysed Y chromosome sequence ends at the large, heterochromatic segment on Yq, which is indicated by the black bar on the truncated Y chromosome ideogram. b, Major blocks of homology remaining between the XAR and the YAR. Expansion of the BLASTN plot from 0-12 Mb on the X chromosome and 0-20 Mb on the Y chromosome. On the X chromosome, the major homologies lie in the terminal 8.5 Mb of Xp: PAR1 (magenta line) and numbered blocks 1-10. Lesser homologies 11 and 12 contain the TBL1X/TBL1Y and AMELX/AMELY genes, respectively. c, The XTR region in detail (88-93 Mb on X and 2.8-6.8 Mb on Y). Black arrows show large segments deleted from the Y chromosome copy of the XTR. The magenta arrow indicates the short segment that is separated from the rest of the XTR by a paracentric inversion on the Y chromosome. An independent inversion polymorphism on Yp in human populations encompasses this small segment. The position and orientation of the segment shows that the Y chromosome reference sequence is of the less common, derived Y chromosome.
Figure 6
Figure 6
Schematic representation of major homologies between the human sex chromosomes. The entire X and Y chromosomes are shown using the same scale on the left and right sides of the figure, respectively. The major heterochromatic region on Yq is indicated by the pale grey box proximal to PAR2. Expanded sections of X and Y are shown in the centre of the figure. Homologies coloured in the figure are either part of the XAR (PAR1 and blocks 1-12), or were duplicated from the X chromosome to the Y chromosome since the divergence of human and chimpanzee lineages (XTR and PAR2). The numbering of XAR-YAR blocks follows that in Fig. 5b. Blocks inverted on the Y chromosome relative to the X chromosome are assigned red, negative numbers.
Figure 7
Figure 7
Evidence for a fifth evolutionary stratum on the X chromosome. a, Sequence identity between the X and Y homology blocks 1-12 (see Figs 5b and 6) plotted in 5-kb windows. The scale shows the total amount of sequence aligned, excluding insertions and deletions (see Methods). A 10-kb spacer is placed between each consecutive block of homology. Segments of the plot are coloured according to the system used in Figs 6 and 7b. On the basis of this plot, a new evolutionary stratum S5 is defined, which includes homology blocks 1 and 2. b, A most parsimonious series of inversion events from the arrangement of homology blocks 1-12 on the X chromosome (top) to the Y chromosome (bottom), calculated using GRIMM. The grey boxes show the suggested extents of former pseudoautosomal regions within the distal part of the XAR, and the magenta box (bottom row) shows the position of the current pseudoautosomal region. This inversion sequence provides independent support for the proposed pseudoautosomal boundary movements and evolutionary strata. It was previously suggested that AMELX (in block 12) is in S4 (ref. 46), or possibly at the boundary between S3 and S4 (ref. 67). However, the more distal location of block 11, which contains TBL1X (an S3 gene46), is not consistent with these suggestions. The two regions of increased sequence identity within block 10 contain the VCX2 and VCX3B genes on the X chromosome and the VCY1B and VCY genes on the Y chromosome. This gene family might have arisen de novo in the simian lineage, which could account for the unusual characteristics of this part of the alignment.
Figure 8
Figure 8
Sequence compositional changes in the distal evolutionary strata of the X chromosome. Shown are the positions of SINE and LINE repeats and (G+C) content within PAR1, S5 and the distal half of S4. The percentage of Alu, L1 and (G+C) are shown for each region (including the whole of S4). There is an abrupt increase in Alu repeat levels and (G+C) content from S4 to S5. The five euchromatic gaps in PAR1 are shown as light brown bars. Pale blue bars represent clones for which the sequences were unfinished at the time of the sequence assembly.

Comment in

Similar articles

Cited by

References

    1. Ohno S. Sex Chromosomes and Sex-linked Genes. Springer; Berlin: 1967.
    1. McKusick-Nathans Institute for Genetic Medicine. Johns Hopkins University and National Center for Biotechnology Information. National Library of Medicine OMIM: Online Mendelian Inheritance in Man. 2000. < http://www.ncbi.nlm.nih.gov/omim/> .
    1. Royer-Pokora B, et al. Cloning the gene for an inherited human disorder—chronic granulomatous disease—on the basis of its chromosomal location. Nature. 1986;322:32–38. - PubMed
    1. Monaco AP, et al. Isolation of candidate cDNAs for portions of the Duchenne muscular dystrophy gene. Nature. 1986;323:646–650. - PubMed
    1. Bentley DR, et al. The physical maps for sequencing human chromosomes 1, 6, 9, 10, 13, 20 and X. Nature. 2001;409:942–943. - PubMed

Publication types

MeSH terms