[English] 日本語

- EMDB-49517: Structure of R2 retrotransposon protein from Platysternon megacep... -
+
Open data
-
Basic information
Entry | ![]() | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Title | Structure of R2 retrotransposon protein from Platysternon megacephalum after second strand nicking | |||||||||
![]() | ||||||||||
![]() |
| |||||||||
![]() | Retrotransposon / Reverse transcriptase / RNA BINDING PROTEIN-RNA-DNA complex | |||||||||
Biological species | ![]() | |||||||||
Method | single particle reconstruction / cryo EM / Resolution: 4.6 Å | |||||||||
![]() | Thawani A / Collins K / Nogales E | |||||||||
Funding support | ![]()
| |||||||||
![]() | ![]() Title: Structures of vertebrate R2 retrotransposon complexes during target-primed reverse transcription and after second-strand nicking. Authors: Akanksha Thawani / Anthony Rodríguez-Vargas / Briana Van Treeck / Nozhat T Hassan / David L Adelson / Eva Nogales / Kathleen Collins / ![]() ![]() Abstract: R2 retrotransposons are site-specific eukaryotic non-long terminal repeat retrotransposons that copy and paste into gene loci encoding ribosomal RNAs. Recently, we demonstrated that avian A-clade R2 ...R2 retrotransposons are site-specific eukaryotic non-long terminal repeat retrotransposons that copy and paste into gene loci encoding ribosomal RNAs. Recently, we demonstrated that avian A-clade R2 proteins achieve efficient and precise insertion of transgenes into their native safe-harbor loci in human cells. The features of A-clade R2 proteins that support gene insertion are not well characterized. Here, we report high-resolution cryo-electron microscopy structures of two vertebrate A-clade R2 proteins at the initiation of target-primed reverse transcription and after cDNA synthesis and second-strand nicking. Using biochemical and cellular assays, we illuminate the basis for high selectivity of template use and unique roles for each of the three zinc-finger domains in nucleic acid recognition. Reverse transcriptase active site architecture is reinforced by an unanticipated insertion motif specific to vertebrate A-clade R2 proteins. Our work provides the first insights into A-clade R2 protein structure during gene insertion and may enable future improvement and adaptation of R2-based systems for precise transgene insertion. | |||||||||
History |
|
-
Structure visualization
Supplemental images |
---|
-
Downloads & links
-EMDB archive
Map data | ![]() | 3.4 MB | ![]() | |
---|---|---|---|---|
Header (meta data) | ![]() ![]() | 22.8 KB 22.8 KB | Display Display | ![]() |
FSC (resolution estimation) | ![]() | 8 KB | Display | ![]() |
Images | ![]() | 78.7 KB | ||
Filedesc metadata | ![]() | 7.5 KB | ||
Others | ![]() ![]() | 33.1 MB 33 MB | ||
Archive directory | ![]() ![]() | HTTPS FTP |
-Validation report
Summary document | ![]() | 737.3 KB | Display | ![]() |
---|---|---|---|---|
Full document | ![]() | 736.9 KB | Display | |
Data in XML | ![]() | 13.5 KB | Display | |
Data in CIF | ![]() | 19.2 KB | Display | |
Arichive directory | ![]() ![]() | HTTPS FTP |
-Related structure data
Related structure data | ![]() 9nl4MC ![]() 9nl2C ![]() 9nl3C M: atomic model generated by this map C: citing same article ( |
---|
-
Links
EMDB pages | ![]() ![]() |
---|
-
Map
File | ![]() | ||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Projections & slices | Image control
Images are generated by Spider. | ||||||||||||||||||||||||||||||||||||
Voxel size | X=Y=Z: 1.14 Å | ||||||||||||||||||||||||||||||||||||
Density |
| ||||||||||||||||||||||||||||||||||||
Symmetry | Space group: 1 | ||||||||||||||||||||||||||||||||||||
Details | EMDB XML:
|
-Supplemental data
-Half map: #2
File | emd_49517_half_map_1.map | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Projections & Slices |
| ||||||||||||
Density Histograms |
-Half map: #1
File | emd_49517_half_map_2.map | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Projections & Slices |
| ||||||||||||
Density Histograms |
-
Sample components
-Entire : R2 retrotransposon protein after second strand nicking
Entire | Name: R2 retrotransposon protein after second strand nicking |
---|---|
Components |
|
-Supramolecule #1: R2 retrotransposon protein after second strand nicking
Supramolecule | Name: R2 retrotransposon protein after second strand nicking type: complex / ID: 1 / Parent: 0 / Macromolecule list: #1-#5 |
---|---|
Source (natural) | Organism: ![]() |
Molecular weight | Theoretical: 200 KDa |
-Macromolecule #1: R2 retrotransposon protein
Macromolecule | Name: R2 retrotransposon protein / type: protein_or_peptide / ID: 1 / Number of copies: 1 / Enantiomer: LEVO |
---|---|
Source (natural) | Organism: ![]() |
Molecular weight | Theoretical: 127.729367 KDa |
Recombinant expression | Organism: ![]() ![]() |
Sequence | String: QKTIIQLPND NPACPFCGDH VGKPSALNVH LKRNHGGREV EFQCSMCNKA DPKAHSILCH IPKCKGKVTE EPTGDWACET CNKQFNTKS GLSQHKRIAH PAIRNQERIA ASQPKPNSQR GKHNSCWTVE EEQLLAAFNN MFWGKKNINI LISDHIHMKT A KQISEKRR ...String: QKTIIQLPND NPACPFCGDH VGKPSALNVH LKRNHGGREV EFQCSMCNKA DPKAHSILCH IPKCKGKVTE EPTGDWACET CNKQFNTKS GLSQHKRIAH PAIRNQERIA ASQPKPNSQR GKHNSCWTVE EEQLLAAFNN MFWGKKNINI LISDHIHMKT A KQISEKRR LLGLNKNATV TTTNPLPVSS TCHLKIRTDS PNTTTGLKDT YMCKINENIV NQGQIKFDSE VISAWMAGDS NI RSLVEST SLDILSTFLM ETPKPRKKGN NKITNKKSGK KKKWMEKRAV KKGFYKRYQH LFETDRCKLA SIILDGTERL QCQ IPLTEI LETYKSKWET LTPFEGLGQF KSHAVADNTA FEILLSAKEI MKNIKEMNKN SAPGPDKVSL RDLLLADPEC NALE KLFNT WLITGIIPNS IKECRSLLIP KTADPEALKE LGNWRPLTIG SIVLRLFSRI ITNRLAKACP INARQRGFIA TPGCS ENLK ILHTIVKQAK TSKKSLGVVF VDIAKAFDSV SHDHIMWVLQ ERGLDQHIVN IIEDSYKKIH TRMEVGTERT PPIEIK VGV KQGDPMSPLL FNLAIDPLIT ALEKANTGFS YGKNKITSLA FADDLVMLSD TWEGMNKNIQ ILETFCNLSG LKVQAKK CY GFFLSPTHDS YTINKCDAWK IDKDSLNMIQ PGESEKYLGL KVDPWIGFSK PVLAEKLTIW LKRLTEAPLK PSQKLTML N IYTIPRIIYL ADHTDTKKTL LSSLDDNIRT VVKGWLHLPP DTCNGFIYTK TRDGGLGVTR LASLIPSIQA RRLHRIATS EDETIRNIAM ANNIEEEFQN LWVTAGGKKE EIPRITDPVS IDYRLPRRIL ELLNEWEKPA PKKMYPIPCN WREAEMAHWK NLPCQGSGI EHFDNDTISN DWLQFHRGFS ERQFLMGLKI RANVYPTREY QGRGRTNKNV NCRNCTASYE SLSHILGQCP A VQGARIRR HNKLCSMLKR EAKELKWVVY EEPHLHTTEK ELRKPDLIFV KEEMALVVDV TVRFEYKEKV FEDAAAEKVR HY KDLTSQI KELTGAKEIE YFGFPLGARG KWPEINEKVL TALGMPDYQQ KRTAKRFSKR TLLYSIDVIN TFENIGKNNK NNV P |
-Macromolecule #2: Bottom strand of target rDNA
Macromolecule | Name: Bottom strand of target rDNA / type: dna / ID: 2 / Number of copies: 1 / Classification: DNA |
---|---|
Source (natural) | Organism: synthetic construct (others) |
Molecular weight | Theoretical: 21.507758 KDa |
Sequence | String: (DT)(DT)(DA)(DG)(DA)(DT)(DG)(DA)(DC)(DG) (DA)(DG)(DG)(DC)(DA)(DT)(DT)(DT)(DG)(DG) (DC)(DT)(DA)(DC)(DC)(DT)(DT)(DA)(DA) (DG)(DA)(DG)(DA)(DG)(DT)(DC)(DA)(DT)(DA) (DG) (DT)(DT)(DA)(DC)(DT)(DC) ...String: (DT)(DT)(DA)(DG)(DA)(DT)(DG)(DA)(DC)(DG) (DA)(DG)(DG)(DC)(DA)(DT)(DT)(DT)(DG)(DG) (DC)(DT)(DA)(DC)(DC)(DT)(DT)(DA)(DA) (DG)(DA)(DG)(DA)(DG)(DT)(DC)(DA)(DT)(DA) (DG) (DT)(DT)(DA)(DC)(DT)(DC)(DC)(DC) (DG)(DC)(DC)(DG)(DT)(DT)(DT)(DA)(DC)(DC) (DC)(DG) (DC)(DG)(DC)(DT)(DT)(DC)(DA) (DC)(DA)(DG) |
-Macromolecule #3: complementary DNA
Macromolecule | Name: complementary DNA / type: dna / ID: 3 / Number of copies: 1 / Classification: DNA |
---|---|
Source (natural) | Organism: synthetic construct (others) |
Molecular weight | Theoretical: 22.722555 KDa |
Sequence | String: (DG)(DG)(DC)(DT)(DA)(DT)(DT)(DT)(DT)(DC) (DC)(DG)(DA)(DA)(DC)(DA)(DC)(DA)(DT)(DA) (DT)(DA)(DA)(DT)(DT)(DA)(DA)(DT)(DA) (DT)(DA)(DT)(DG)(DT)(DT)(DC)(DC)(DT)(DT) (DT) (DT)(DC)(DC)(DG)(DG)(DG) ...String: (DG)(DG)(DC)(DT)(DA)(DT)(DT)(DT)(DT)(DC) (DC)(DG)(DA)(DA)(DC)(DA)(DC)(DA)(DT)(DA) (DT)(DA)(DA)(DT)(DT)(DA)(DA)(DT)(DA) (DT)(DA)(DT)(DG)(DT)(DT)(DC)(DC)(DT)(DT) (DT) (DT)(DC)(DC)(DG)(DG)(DG)(DT)(DT) (DA)(DA)(DG)(DT)(DA)(DA)(DA)(DG)(DG)(DT) (DG)(DG) (DC)(DC)(DC)(DG)(DT)(DC)(DC) (DA)(DC)(DC)(DT)(DT)(DG)(DC) |
-Macromolecule #5: Top strand for target rDNA
Macromolecule | Name: Top strand for target rDNA / type: dna / ID: 5 / Number of copies: 1 / Classification: DNA |
---|---|
Source (natural) | Organism: synthetic construct (others) |
Molecular weight | Theoretical: 21.654875 KDa |
Sequence | String: (DC)(DT)(DG)(DT)(DG)(DA)(DA)(DG)(DC)(DG) (DC)(DG)(DG)(DG)(DT)(DA)(DA)(DA)(DC)(DG) (DG)(DC)(DG)(DG)(DG)(DA)(DG)(DT)(DA) (DA)(DC)(DT)(DA)(DT)(DG)(DA)(DC)(DT)(DC) (DT) (DC)(DT)(DT)(DA)(DA)(DG) ...String: (DC)(DT)(DG)(DT)(DG)(DA)(DA)(DG)(DC)(DG) (DC)(DG)(DG)(DG)(DT)(DA)(DA)(DA)(DC)(DG) (DG)(DC)(DG)(DG)(DG)(DA)(DG)(DT)(DA) (DA)(DC)(DT)(DA)(DT)(DG)(DA)(DC)(DT)(DC) (DT) (DC)(DT)(DT)(DA)(DA)(DG)(DG)(DT) (DA)(DG)(DC)(DC)(DA)(DA)(DA)(DT)(DG)(DC) (DC)(DT) (DC)(DG)(DT)(DC)(DA)(DT)(DC) (DT)(DA)(DA) |
-Macromolecule #4: 3'UTR RNA
Macromolecule | Name: 3'UTR RNA / type: rna / ID: 4 / Number of copies: 1 |
---|---|
Source (natural) | Organism: synthetic construct (others) |
Molecular weight | Theoretical: 23.837215 KDa |
Sequence | String: GCAAGGUGGA CGGGCCACCU UUACUUAACC CGGAAAAGGA ACAUAUAUUA AUUAUAUGUG UUCGGAAAAU AGCC |
-Macromolecule #6: ZINC ION
Macromolecule | Name: ZINC ION / type: ligand / ID: 6 / Number of copies: 4 / Formula: ZN |
---|---|
Molecular weight | Theoretical: 65.409 Da |
-Experimental details
-Structure determination
Method | cryo EM |
---|---|
![]() | single particle reconstruction |
Aggregation state | particle |
-
Sample preparation
Concentration | 0.1 mg/mL |
---|---|
Buffer | pH: 7.5 |
Vitrification | Cryogen name: ETHANE |
-
Electron microscopy
Microscope | TFS TALOS |
---|---|
Image recording | Film or detector model: GATAN K3 BIOQUANTUM (6k x 4k) / Digitization - Dimensions - Width: 11520 pixel / Digitization - Dimensions - Height: 8184 pixel / Number grids imaged: 1 / Number real images: 9195 / Average exposure time: 1.0 sec. / Average electron dose: 50.0 e/Å2 |
Electron beam | Acceleration voltage: 200 kV / Electron source: ![]() |
Electron optics | Illumination mode: OTHER / Imaging mode: BRIGHT FIELD / Nominal defocus max: 2.5 µm / Nominal defocus min: 1.0 µm / Nominal magnification: 36000 |
Sample stage | Specimen holder model: FEI TITAN KRIOS AUTOGRID HOLDER / Cooling holder cryogen: NITROGEN |