[English] 日本語
Yorodumi
- EMDB-19398: Structure of the C. elegans Intron Lariat Spliceosome double-prim... -

+
Open data


ID or keywords:

Loading...

-
Basic information

Entry
Database: EMDB / ID: EMD-19398
TitleStructure of the C. elegans Intron Lariat Spliceosome double-primed for disassembly (ILS'')
Map data
Sample
  • Complex: Intron lariat spliceosome (ILS'')
    • RNA: x 4 types
    • Protein or peptide: x 35 types
  • Ligand: x 4 types
Keywordspre-mRNA splicing / intron lariat spliceosome / gene expression / spliceosome / splicing
Function / homology
Function and homology information


RNA lariat debranching enzyme activator activity / feminization of hermaphroditic germ-line / molting cycle / regulation of primary miRNA processing / SLBP independent Processing of Histone Pre-mRNAs / Formation of TC-NER Pre-Incision Complex / Dual incision in TC-NER / Gap-filling DNA repair synthesis and ligation in TC-NER / SLBP Dependent Processing of Replication-Dependent Histone Pre-mRNAs / Transport of Mature mRNA derived from an Intron-Containing Transcript ...RNA lariat debranching enzyme activator activity / feminization of hermaphroditic germ-line / molting cycle / regulation of primary miRNA processing / SLBP independent Processing of Histone Pre-mRNAs / Formation of TC-NER Pre-Incision Complex / Dual incision in TC-NER / Gap-filling DNA repair synthesis and ligation in TC-NER / SLBP Dependent Processing of Replication-Dependent Histone Pre-mRNAs / Transport of Mature mRNA derived from an Intron-Containing Transcript / snRNP Assembly / Downregulation of SMAD2/3:SMAD4 transcriptional activity / mRNA Splicing - Minor Pathway / germline cell cycle switching, mitotic to meiotic cell cycle / mRNA Splicing - Major Pathway / mRNA 3'-end processing / RNA Polymerase II Transcription Termination / vulval development / nematode larval development / egg-laying behavior / post-spliceosomal complex / U2-type post-mRNA release spliceosomal complex / spliceosomal complex disassembly / post-mRNA release spliceosomal complex / apoptotic DNA fragmentation / nuclear mRNA surveillance / generation of catalytic spliceosome for first transesterification step / nuclease activity / U12-type spliceosomal complex / Prp19 complex / snRNP binding / pICln-Sm protein complex / pre-mRNA binding / U2-type catalytic step 1 spliceosome / SMN-Sm protein complex / spliceosomal tri-snRNP complex / P granule / locomotion / mRNA cis splicing, via spliceosome / U2-type spliceosomal complex / commitment complex / U2-type catalytic step 2 spliceosome / embryo development ending in birth or egg hatching / U4 snRNP / U2 snRNP / U1 snRNP / U2-type prespliceosome / cyclosporin A binding / precatalytic spliceosome / generation of catalytic spliceosome for second transesterification step / spliceosomal complex assembly / germ cell development / uterus development / protein K63-linked ubiquitination / mRNA 3'-splice site recognition / spliceosomal tri-snRNP complex assembly / U5 snRNA binding / U5 snRNP / U2 snRNA binding / U6 snRNA binding / spliceosomal snRNP assembly / pre-mRNA intronic binding / U1 snRNA binding / U4/U6 x U5 tri-snRNP complex / catalytic step 2 spliceosome / RNA splicing / peptidylprolyl isomerase / helicase activity / peptidyl-prolyl cis-trans isomerase activity / RNA polymerase II transcription regulatory region sequence-specific DNA binding / spliceosomal complex / RING-type E3 ubiquitin transferase / mRNA splicing, via spliceosome / mRNA processing / rRNA processing / ubiquitin-protein transferase activity / metallopeptidase activity / ubiquitin protein ligase activity / protein folding / ribosome biogenesis / regulation of gene expression / DNA replication / nucleic acid binding / cell differentiation / RNA helicase activity / DNA-binding transcription factor activity, RNA polymerase II-specific / RNA helicase / cell division / intracellular membrane-bounded organelle / DNA repair / GTPase activity / mRNA binding / regulation of transcription by RNA polymerase II / positive regulation of DNA-templated transcription / GTP binding / apoptotic process / ATP hydrolysis activity / DNA binding / RNA binding / nucleoplasm
Similarity search - Function
Intron Large complex component GCFC2-like / Tuftelin interacting protein, N-terminal domain / GCF, C-terminal / Septin and tuftelin interacting protein / TFP11/STIP/Ntr1 / GC-rich sequence DNA-binding factor-like protein / Tuftelin interacting protein N terminal / mRNA splicing factor Cwf18-like / : / cwf18 pre-mRNA splicing factor ...Intron Large complex component GCFC2-like / Tuftelin interacting protein, N-terminal domain / GCF, C-terminal / Septin and tuftelin interacting protein / TFP11/STIP/Ntr1 / GC-rich sequence DNA-binding factor-like protein / Tuftelin interacting protein N terminal / mRNA splicing factor Cwf18-like / : / cwf18 pre-mRNA splicing factor / Nineteen complex-related protein 2 / Sde2, N-terminal ubiquitin domain / Yeast Splicing regulator sde2, N-terminal ubiquitin domain / Cwf19-like protein, C-terminal domain-2 / Cwf19-like, C-terminal domain-1 / Cwf19-like protein / Protein similar to CwfJ C-terminus 2 / Protein similar to CwfJ C-terminus 1 / DHX15, DEXH-box helicase domain / Pre-mRNA-splicing factor Isy1 / Pre-mRNA-splicing factor Isy1 superfamily / Isy1-like splicing family / : / : / Intron-binding protein aquarius, beta-barrel / Intron-binding protein aquarius insert domain / Pre-mRNA-splicing factor SPF27 / Torus domain / Breast carcinoma amplified sequence 2 (BCAS2) / Torus domain / mRNA splicing factor SYF2 / SYF2 splicing factor / CWF11 family / Intron-binding protein aquarius, N-terminal / Intron-binding protein aquarius N-terminal / Peptidyl-prolyl cis-trans isomerase E / Peptidyl-prolyl cis-trans isomerase E, RNA recognition motif / Myb-like domain profile. / Helix hairpin bin domain superfamily / G-patch domain / Cyclophilin-type peptidyl-prolyl cis-trans isomerase, cyclophilin A-like / : / Replication stress response SDE2 C-terminal / G-patch domain profile. / Pre-mRNA-processing factor 17 / G-patch domain / glycine rich nucleic binding domain / Pre-mRNA-splicing factor 19 / Pre-mRNA-processing factor 19 / Prp19/Pso4-like / : / STL11, N-terminal / U-box domain / : / DNA2/NAM7-like helicase / DNA2/NAM7 helicase, helicase domain / AAA domain / WD repeat Prp46/PLRG1-like / : / BUD31/G10-related, conserved site / : / : / : / G10 protein signature 1. / G10 protein signature 2. / SKI-interacting protein SKIP, SNW domain / SKI-interacting protein, SKIP / SKIP/SNW domain / Myb-like DNA-binding domain / Pre-mRNA-splicing factor Cwf15/Cwc15 / HAT (Half-A-TPR) repeat / Cwf15/Cwc15 cell cycle control protein / Pre-mRNA-splicing factor Cwc2/Slt11 / G10 protein / Pre-mRNA-splicing factor BUD31 / Pre-mRNA splicing factor component Cdc5p/Cef1, C-terminal / pre-mRNA splicing factor component / Small nuclear ribonucleoprotein D1 / U-box domain profile. / Modified RING finger domain / U-box domain / Zinc finger, CCCH-type superfamily / HIT-like superfamily / Leucine-rich repeat / : / Helicase associated domain (HA2), ratchet-like / zinc finger / Pre-mRNA-splicing factor Syf1-like / Snu114, GTP-binding domain / DEAD-box helicase, OB fold / Oligonucleotide/oligosaccharide-binding (OB)-fold / Helicase-associated domain / Helicase associated domain (HA2), winged-helix / Helicase associated domain (HA2) Add an annotation / 116kDa U5 small nuclear ribonucleoprotein component, N-terminal / 116kDa U5 small nuclear ribonucleoprotein component, C-terminal / 116 kDa U5 small nuclear ribonucleoprotein component N-terminus / : / Small nuclear ribonucleoprotein Sm D3 / Small nuclear ribonucleoprotein Sm D2
Similarity search - Domain/homology
Small nuclear ribonucleoprotein Sm D1 / Replication stress response regulator SDE2 / GCF C-terminal domain-containing protein / WD_REPEATS_REGION domain-containing protein / Cell division cycle 5-like protein / CWF19-like protein 1 homolog / TPR_REGION domain-containing protein / WD_REPEATS_REGION domain-containing protein / Spliceosome-associated protein CWC15 homolog / Protein BUD31 homolog ...Small nuclear ribonucleoprotein Sm D1 / Replication stress response regulator SDE2 / GCF C-terminal domain-containing protein / WD_REPEATS_REGION domain-containing protein / Cell division cycle 5-like protein / CWF19-like protein 1 homolog / TPR_REGION domain-containing protein / WD_REPEATS_REGION domain-containing protein / Spliceosome-associated protein CWC15 homolog / Protein BUD31 homolog / Pre-mRNA-splicing factor 8 homolog / Probable small nuclear ribonucleoprotein F / Pre-mRNA-splicing factor SYF1 / Probable small nuclear ribonucleoprotein-associated protein B / Pre-mRNA-splicing factor syf-2 / Pre-mRNA-processing factor 19 / CWF19-like protein 2 homolog / Small nuclear ribonucleoprotein Sm D3 / Septin and tuftelin-interacting protein 1 homolog / Peptidyl-prolyl cis-trans isomerase / Probable small nuclear ribonucleoprotein Sm D2 / WD_REPEATS_REGION domain-containing protein / Protein isy-1 / Pre-mRNA-splicing factor ATP-dependent RNA helicase ddx-15 / RRM domain-containing protein / Pre-mRNA-splicing factor RBM22 / Pre-mRNA-splicing factor SPF27 / Uncharacterized protein T27F2.1 / Tr-type G domain-containing protein / Coiled-coil domain-containing protein 12 / Probable U2 small nuclear ribonucleoprotein A' / Probable small nuclear ribonucleoprotein G / Intron-binding protein aquarius / Peptidyl-prolyl cis-trans isomerase E / Probable small nuclear ribonucleoprotein E
Similarity search - Component
Biological speciesCaenorhabditis elegans (invertebrata)
Methodsingle particle reconstruction / cryo EM / Resolution: 3.0 Å
AuthorsVorlaender MK / Rothe P / Plaschka C
Funding supportEuropean Union, 1 items
OrganizationGrant numberCountry
European Research Council (ERC)European Union
CitationJournal: Nature / Year: 2024
Title: Mechanism for the initiation of spliceosome disassembly.
Authors: Matthias K Vorländer / Patricia Rothe / Justus Kleifeld / Eric D Cormack / Lalitha Veleti / Daria Riabov-Bassat / Laura Fin / Alex W Phillips / Luisa Cochella / Clemens Plaschka /
Abstract: Precursor-mRNA (pre-mRNA) splicing requires the assembly, remodelling and disassembly of the multi-megadalton ribonucleoprotein complex called the spliceosome. Recent studies have shed light on ...Precursor-mRNA (pre-mRNA) splicing requires the assembly, remodelling and disassembly of the multi-megadalton ribonucleoprotein complex called the spliceosome. Recent studies have shed light on spliceosome assembly and remodelling for catalysis, but the mechanism of disassembly remains unclear. Here we report cryo-electron microscopy structures of nematode and human terminal intron lariat spliceosomes along with biochemical and genetic data. Our results uncover how four disassembly factors and the conserved RNA helicase DHX15 initiate spliceosome disassembly. The disassembly factors probe large inner and outer spliceosome surfaces to detect the release of ligated mRNA. Two of these factors, TFIP11 and C19L1, and three general spliceosome subunits, SYF1, SYF2 and SDE2, then dock and activate DHX15 on the catalytic U6 snRNA to initiate disassembly. U6 therefore controls both the start and end of pre-mRNA splicing. Taken together, our results explain the molecular basis of the initiation of canonical spliceosome disassembly and provide a framework to understand general spliceosomal RNA helicase control and the discard of aberrant spliceosomes.
History
DepositionJan 11, 2024-
Header (metadata) releaseAug 7, 2024-
Map releaseAug 7, 2024-
UpdateAug 21, 2024-
Current statusAug 21, 2024Processing site: PDBe / Status: Released

-
Structure visualization

Supplemental images

Downloads & links

-
Map

FileDownload / File: emd_19398.map.gz / Format: CCP4 / Size: 41.4 MB / Type: IMAGE STORED AS FLOATING POINT NUMBER (4 BYTES)
Voxel sizeX=Y=Z: 1.3013 Å
Density
Contour LevelBy AUTHOR: 0.5
Minimum - Maximum0.0 - 10.125123
Average (Standard dev.)0.069810666 (±0.2813843)
SymmetrySpace group: 1
Details

EMDB XML:

Map geometry
Axis orderXYZ
Origin104135104
Dimensions231199236
Spacing199231236
CellA: 258.9587 Å / B: 300.6003 Å / C: 307.1068 Å
α=β=γ: 90.0 °

-
Supplemental data

-
Sample components

+
Entire : Intron lariat spliceosome (ILS'')

EntireName: Intron lariat spliceosome (ILS'')
Components
  • Complex: Intron lariat spliceosome (ILS'')
    • RNA: U2 snRNA
    • RNA: U5 snRNA
    • RNA: U6 snRNA
    • Protein or peptide: Pre-mRNA-splicing factor 8 homolog
    • Protein or peptide: Tr-type G domain-containing protein
    • Protein or peptide: Protein isy-1
    • Protein or peptide: Pre-mRNA-splicing factor ATP-dependent RNA helicase ddx-15
    • Protein or peptide: WD_REPEATS_REGION domain-containing protein
    • Protein or peptide: Pre-mRNA-splicing factor SYF1
    • RNA: Intron lariat RNA
    • Protein or peptide: TPR_REGION domain-containing protein
    • Protein or peptide: Pre-mRNA-splicing factor SPF27
    • Protein or peptide: Cell division cycle 5-like protein
    • Protein or peptide: CWF19-like protein 1 homolog
    • Protein or peptide: CWF19-like protein 2 homolog
    • Protein or peptide: Pre-mRNA-splicing factor syf-2
    • Protein or peptide: Protein BUD31 homolog
    • Protein or peptide: Pre-mRNA-splicing factor RBM22
    • Protein or peptide: Spliceosome-associated protein CWC15 homolog
    • Protein or peptide: GCF C-terminal domain-containing protein
    • Protein or peptide: Intron-binding protein aquarius
    • Protein or peptide: Uncharacterized protein T27F2.1
    • Protein or peptide: Peptidyl-prolyl cis-trans isomerase
    • Protein or peptide: WD_REPEATS_REGION domain-containing protein
    • Protein or peptide: Septin and tuftelin-interacting protein 1 homolog
    • Protein or peptide: WD_REPEATS_REGION domain-containing protein
    • Protein or peptide: Replication stress response regulator SDE2
    • Protein or peptide: Coiled-coil domain-containing protein 12
    • Protein or peptide: Small nuclear ribonucleoprotein Sm D3
    • Protein or peptide: Probable small nuclear ribonucleoprotein-associated protein B
    • Protein or peptide: Small nuclear ribonucleoprotein Sm D1
    • Protein or peptide: Probable small nuclear ribonucleoprotein Sm D2
    • Protein or peptide: Probable small nuclear ribonucleoprotein E
    • Protein or peptide: Probable small nuclear ribonucleoprotein F
    • Protein or peptide: Probable small nuclear ribonucleoprotein G
    • Protein or peptide: Probable U2 small nuclear ribonucleoprotein A'
    • Protein or peptide: RRM domain-containing protein
    • Protein or peptide: Pre-mRNA-processing factor 19
    • Protein or peptide: Peptidyl-prolyl cis-trans isomerase E
  • Ligand: MAGNESIUM ION
  • Ligand: INOSITOL HEXAKISPHOSPHATE
  • Ligand: GUANOSINE-5'-TRIPHOSPHATE
  • Ligand: ZINC ION

+
Supramolecule #1: Intron lariat spliceosome (ILS'')

SupramoleculeName: Intron lariat spliceosome (ILS'') / type: complex / ID: 1 / Parent: 0 / Macromolecule list: #1-#39
Source (natural)Organism: Caenorhabditis elegans (invertebrata)

+
Macromolecule #1: U2 snRNA

MacromoleculeName: U2 snRNA / type: rna / ID: 1
Details: Full sequence: AUCGCUUCUUCGGCUUAUUAGCUAAGAUCAAAGUGUAGUAUCUGUUCUUAUCGUAUUAACCUACGGUAUACACUCGAAUGAGUGUAAUAAAGGUUAUAUGAUUUUUGGAACCUAGGGAAGACUCGGGGCUUGCUCCGACUUCCCAAGGGUCGUCCUGGCGUUGCACUGCUGCCGGGCUCGGCCCAGUCCCC
Number of copies: 1
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 68.422445 KDa
SequenceString: AUCGCUUCUU CGGCUUAUUA GCUAAGAUCA AAGUGUAGUA UCUGUUCUUA UCGUAUUAAC CUACGGUAUA CACUCGAAUG AGUGUAAUA AAGGUUAUAU GAUUUUUGGA ACCUAGGGAA GACUCGGGGC UUGCUCCGAC UUCCCAAGGG UCGUCCUGGC G UUGCACUG ...String:
AUCGCUUCUU CGGCUUAUUA GCUAAGAUCA AAGUGUAGUA UCUGUUCUUA UCGUAUUAAC CUACGGUAUA CACUCGAAUG AGUGUAAUA AAGGUUAUAU GAUUUUUGGA ACCUAGGGAA GACUCGGGGC UUGCUCCGAC UUCCCAAGGG UCGUCCUGGC G UUGCACUG CUGCCGGGCU CGGCCCAGUC CCC(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)(N)(N) (N)

+
Macromolecule #2: U5 snRNA

MacromoleculeName: U5 snRNA / type: rna / ID: 2 / Number of copies: 1
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 35.826145 KDa
SequenceString:
ACUCUGGUUC CUCUGCAUUU AACCGUGAAA AUCUUUCGCC UUUUACUAAA GAUUUCCGUG CAAAGGAGCA UACAUUGAGU ACAAUUUUU GGAGUCCCCU CGAGAGAGCG GGA

+
Macromolecule #3: U6 snRNA

MacromoleculeName: U6 snRNA / type: rna / ID: 3 / Number of copies: 1
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 32.483355 KDa
SequenceString:
GUUCUUCCGA GAACAUAUAC UAAAAUUGGA ACAAUACAGA GAAGAUUAGC AUGGCCCCUG CGCAAGGAUG ACACGCAAAU UCGUGAAGC GUUCCAAAUU UU

+
Macromolecule #10: Intron lariat RNA

MacromoleculeName: Intron lariat RNA / type: rna / ID: 10 / Number of copies: 1
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 10.348173 KDa
SequenceString:
GU(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N) (N) (N)(N)(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)A(N) (N)(N)

+
Macromolecule #4: Pre-mRNA-splicing factor 8 homolog

MacromoleculeName: Pre-mRNA-splicing factor 8 homolog / type: protein_or_peptide / ID: 4 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 272.396156 KDa
SequenceString: MANYGGHPQT EPHAIPDSIL EEKSRKWKQL QGKRYSEKKK FGMSDTQKEE MPPEHVRKVI RDHGDMTSRK YRHDKRVYLG ALKYMPHAV LKLLENMPMP WEQIRDVKVL YHITGAITFV NDIPRVIEPV YMAQWGTMWI MMRREKRDRR HFKRMRFPPF D DEEPPLDY ...String:
MANYGGHPQT EPHAIPDSIL EEKSRKWKQL QGKRYSEKKK FGMSDTQKEE MPPEHVRKVI RDHGDMTSRK YRHDKRVYLG ALKYMPHAV LKLLENMPMP WEQIRDVKVL YHITGAITFV NDIPRVIEPV YMAQWGTMWI MMRREKRDRR HFKRMRFPPF D DEEPPLDY ADNILDVEPL EPIQMELDPE EDGAVAEWFY DHKPLATTRF VNGPTYRKWA FSIPQMSTLY RLANQLLTDL VD DNYFYLF DMKSFFTAKA LNVAIPGGPK FEPLVKDLHT DEDWNEFNDI NKVIIRAPIR TEYRIAFPFM YNNLISSLPV QVS WYHTPS VVFIKTEDPD LPAFYYDPLI NPIVLSNLKA TEENLPEGEE EDEWELPEDV RPIFEDVPLY TDNTANGLAL LWAP RPFNL RSGRTRRAVD VPLVKSWYRE HCPAGMPVKV RVSYQKLLKV FVLNALKHRP PKPQKRRYLF RSFKATKFFQ TTTLD WVEA GLQVLRQGYN MLNLLIHRKN LNYLHLDYNF NLKPVKTLTT KERKKSRFGN AFHLCREILR LTKLVVDAHV QYRLNN VDA YQLADGLQYI FAHVGQLTGM YRYKYKLMRQ VRMCKDLKHL IYYRFNTGPV GKGPGCGFWA PGWRVWLFFL RGITPLL ER WLGNLLSRQF EGRHSKGVAK TVTKQRVESH FDLELRAAVM HDILDMMPDG IKQNKARVIL QHLSEAWRCW KANIPWKV P GLPTPVENMI LRYVKAKADW WTNSAHYNRE RVRRGATVDK TVCKKNLGRL TRLYLKSEQE RQHNYLKDGP YISAEEAVA IYTTTVHWLE SRRFSPIPFP PLSYKHDTKL LILALERLKE SYSVKNRLNQ SQREELALIE QAYDNPHEAL SRIKRHMLTQ RAFKEVGIE FMDLYTHLIP VYDIEPLEKV TDAYLDQYLW YEADKRRLFP AWVKPGDTEP PPLLTYKWCQ GLNNLQDVWE T SEGECNVI METKLEKIAE KMDLTLLNRL LRLIVDHNIA DYMTSKNNVL INYKDMNHTN SFGIIRGLQF ASFIVQFYGL VL DLLVLGL RRASEIAGPP QCPNEFLQFQ DVATEIGHPI RLYCRYIDRV WIMFRFSADE ARDLIQRYLT EHPDPNNENI VGY NNKKCW PRDARMRLMK HDVNLGRAVF WDIKNRLPRS ITTVEWENSF VSVYSKDNPN MLFDMSGFEC RILPKCRTAN EEFV HRDGV WNLQNEVTKE RTAQCFLKVD EESLSKFHNR IRQILMSSGS TTFTKIVNKW NTALIGLMTY FREAVVNTQE LLDLL VKCE NKIQTRIKIG LNSKMPSRFP PVVFYTPKEI GGLGMLSMGH VLIPQSDLRW MQQTEAGGVT HFRSGMSHDE DQLIPN LYR YIQPWEAEFV DSVRVWAEYA LKRQEANAQN RRLTLEDLDD SWDRGIPRIN TLFQKDRHTL AYDKGWRVRT EFKAYQI LK QNPFWWTHQR HDGKLWNLNN YRTDMIQALG GVEGILEHTL FRGTYFPTWE GLFWERASGF EESMKFKKLT NAQRSGLN Q IPNRRFTLWW SPTINRANVY VGFQVQLDLT GIFMHGKIPT LKISLIQIFR AHLWQKIHES VVMDLCQVFD QELDALEIQ TVQKETIHPR KSYKMNSSCA DVLLFAQYKW NVSRPSLMAD SKDVMDNTTT QKYWLDVQLR WGDYDSHDVE RYARAKFLDY TTDNMSIYP SPTGVLIAID LAYNLYSAYG NWFPGMKPLI RQAMAKIIKA NPAFYVLRER IRKGLQLYSS EPTEPYLTSQ N YGELFSNQ IIWFVDDTNV YRVTIHKTFE GNLTTKPING AIFIFNPRTG QLFLKIIHTS VWAGQKRLSQ LAKWKTAEEV AA LIRSLPV EEQPRQIIVT RKAMLDPLEV HLLDFPNIVI KGSELMLPFQ AIMKVEKFGD LILKATEPQM VLFNLYDDWL KTI SSYTAF SRVVLIMRGM HINPDKTKVI LKPDKTTITE PHHIWPTLSD DDWIKVELAL KDMILADYGK KNNVNVASLT QSEV RDIIL GMEISAPSQQ RQQIADIEKQ TKEQSQVTAT TTRTVNKHGD EIITATTSNY ETASFASRTE WRVRAISSTN LHLRT QHIY VNSDDVKDTG YTYILPKNIL KKFITISDLR TQIAGFMYGV SPPDNPQVKE IRCIVLVPQT GSHQQVNLPT QLPDHE LLR DFEPLGWMHT QPNELPQLSP QDVTTHAKLL TDNISWDGEK TVMITCSFTP GSVSLTAYKL TPSGYEWGKA NTDKGNN PK GYMPTHYEKV QMLLSDRFLG YFMVPSNGVW NYNFQGQRWS PAMKFDVCLS NPKEYYHEDH RPVHFHNFKA FDDPLGTG S ADREDAFA

UniProtKB: Pre-mRNA-splicing factor 8 homolog

+
Macromolecule #5: Tr-type G domain-containing protein

MacromoleculeName: Tr-type G domain-containing protein / type: protein_or_peptide / ID: 5 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 110.612859 KDa
SequenceString: MDSDLYDEFG NYIGPELDSD DDAGDIDDNG DDEDRSDVDE DDEPDRMEED DAEEIPQNQV VLHEDKKYYA TALEVYGEGV ETLVQEEDA QPLTEPIVKP VSKKKFQAAE RFLPETVYKK EYLADLMDCP HIMRNVAIAG HLHHGKTTFL DCLMEQTHPE F YRAEDADA ...String:
MDSDLYDEFG NYIGPELDSD DDAGDIDDNG DDEDRSDVDE DDEPDRMEED DAEEIPQNQV VLHEDKKYYA TALEVYGEGV ETLVQEEDA QPLTEPIVKP VSKKKFQAAE RFLPETVYKK EYLADLMDCP HIMRNVAIAG HLHHGKTTFL DCLMEQTHPE F YRAEDADA RFTDILFIEK QRGCSIKSQP VSIVAQDSRS KSYLLNIIDT PGHVNFSDEM TASYRLADGV VVMVDAHEGV MM NTERAIR HAIQERLAVT LCISKIDRLL LELKLPPADA YFKLRLIIDQ VNNILSTFAE EDVPVLSPLN GNVIFSSGRY NVC FSLLSF SNIYAKQHGD SFNSKEFARR LWGDIYFEKK TRKFVKKSPS HDAPRTFVQF ILEPMYKIFS QVVGDVDTCL PDVM AELGI RLSKEEQKMN VRPLIALICK RFFGDFSAFV DLVVQNIKSP LENAKTKIEQ TYLGPADSQL AQEMQKCNAE GPLMV HTTK NYPVDDATQF HVFGRVMSGT LEANTDVRVL GENYSIQDEE DCRRMTVGRL FVRVASYQIE VSRVPAGCWV LIEGID QPI VKTATIAELG YEEDVYIFRP LKFNTRSCVK LAVEPINPSE LPKMLDGLRK VNKSYPLLTT RVEESGEHVL LGTGEFY MD CVMHDMRKVF SEIDIKVADP VVTFNETVIE TSTLKCFAET PNKKNKITMM AEPLEKQLDE DIENEVVQIG WNRRRLGE F FQTKYNWDLL AARSIWAFGP DTTGPNILLD DTLPSEVDKH LLSTVRESLV QGFQWATREG PLCEEPIRQV KFKLLDAAI ATEPLYRGGG QMIPTARRCA YSAFLMATPR LMEPYYTVEV VAPADCVAAV YTVLAKRRGH VTTDAPMPGS PMYTISAYIP VMDSFGFET DLRIHTQGQA FCMSAFHHWQ LVPGDPLDKS IVIKTLDVQP TPHLAREFMI KTRRRKGLSE DVSVNKFFDD P MLLELAKQ QDYTGF

UniProtKB: Tr-type G domain-containing protein

+
Macromolecule #6: Protein isy-1

MacromoleculeName: Protein isy-1 / type: protein_or_peptide / ID: 6 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 31.326172 KDa
SequenceString: MARNAEKAMT ALARWRRMKE EEERGPIARR PHDVKDCRNL SDAERFRREI VRDASKKITA IQNPGLGEFK LRDLNDEVNR LIKLKHAWE QRIRELGGTD YRKYAQKELD AIGRETGNSR GYKYFGAAKD LPGVRELFEK STEGEEQRRH RADLLRNIDA H YFGYLDDE ...String:
MARNAEKAMT ALARWRRMKE EEERGPIARR PHDVKDCRNL SDAERFRREI VRDASKKITA IQNPGLGEFK LRDLNDEVNR LIKLKHAWE QRIRELGGTD YRKYAQKELD AIGRETGNSR GYKYFGAAKD LPGVRELFEK STEGEEQRRH RADLLRNIDA H YFGYLDDE DGRLIPLEKL IEEKNIERIN KEFAEKQAQK QQTASDAAPE NIYKVEEDDD DDLETQESTV IGEDGRPMTI RH VLLPTQQ DIEEMLLEQK KQELMAKYLD

UniProtKB: Protein isy-1

+
Macromolecule #7: Pre-mRNA-splicing factor ATP-dependent RNA helicase ddx-15

MacromoleculeName: Pre-mRNA-splicing factor ATP-dependent RNA helicase ddx-15
type: protein_or_peptide / ID: 7 / Number of copies: 1 / Enantiomer: LEVO / EC number: RNA helicase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 84.495477 KDa
SequenceString: MSSRHRLDLD GSGRGDRRRS PNRRSRSRSR SPHRRSSPDR KRQIGAVGNM KIQINPYNNQ PFSNRYWAIW EKRSQLPVWE YKEKFMELL RNNQCITLVG ETGSGKTTQI PQWAVEFMKQ QQQGQPPGQA RLVACTQPRR VAAMSVATRV AEEMDVVLGQ E VGYSIRFE ...String:
MSSRHRLDLD GSGRGDRRRS PNRRSRSRSR SPHRRSSPDR KRQIGAVGNM KIQINPYNNQ PFSNRYWAIW EKRSQLPVWE YKEKFMELL RNNQCITLVG ETGSGKTTQI PQWAVEFMKQ QQQGQPPGQA RLVACTQPRR VAAMSVATRV AEEMDVVLGQ E VGYSIRFE DCISERTVLK YCTDGMLLRE AMNSPLLDKY KVLILDEAHE RTLATDILMG LIKEIVRNRA DIKVVIMSAT LD AGKFQRY FEDCPLLSVP GRTFPVEIFF TPNAEKDYLE AAIRTVIQIH MVEEVEGDIL LFLTGQEEIE EACKRIDREI QAL GADAGA LSCIPLYSTL PPAAQQRIFE PAPPNRPNGA ISRKCVISTN IAETSLTIDG VVFVIDPGFS KQKVYNPRIR VESL LVCPI SKASAMQRAG RAGRTKPGKC FRLYTETAYG SEMQDQTYPE ILRSNLGSVV LQLKKLGTED LVHFDFMDPP APETL MRAL ELLNYLQAIN DDGELTELGS LMAEFPLDPQ LAKMLITSTE LNCSNEILSI TAMLSVPQCW VRPNEMRTEA DEAKAR FAH IDGDHLTLLN VYHSFKQNQE DPQWCYDNFI NYRTMKTADT VRTQLSRVMD KYNLRRVSTD FKSRDYYLNI RKALVAG FF MQVAHLERSG HYVTVKDNQL VNLHPSTVLD HKPEWALYNE FVLTTKNFIR TVTDVRPEWL LQIAPQYYDL DNFPDGDT K RKLTTVMQTL QRNAGRGY

UniProtKB: Pre-mRNA-splicing factor ATP-dependent RNA helicase ddx-15

+
Macromolecule #8: WD_REPEATS_REGION domain-containing protein

MacromoleculeName: WD_REPEATS_REGION domain-containing protein / type: protein_or_peptide / ID: 8 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 36.865559 KDa
SequenceString: MALVTSSGQQ LVSSGFPQQT AQRFSNLMAP TMVLLGHEGE IYTGAFSPDG TCLATSGYDQ KIFFWNVYGE CENFSTIKGH SGAVMDLKF TTDSSSLVSC GTDKSVRVWD METGTCARRF RTHTDFVNAV HPSRRGVTLV ASASDDGTCR VHDMRTKEPV K TYTNRYQQ ...String:
MALVTSSGQQ LVSSGFPQQT AQRFSNLMAP TMVLLGHEGE IYTGAFSPDG TCLATSGYDQ KIFFWNVYGE CENFSTIKGH SGAVMDLKF TTDSSSLVSC GTDKSVRVWD METGTCARRF RTHTDFVNAV HPSRRGVTLV ASASDDGTCR VHDMRTKEPV K TYTNRYQQ TAVTFNDSSD QVISGGIDNV LKVWDMRRDE ITYTLTGHRD TITGISLSPS GKFIISNSMD CTVRQWDIRP FV PGQRSVG VFAGHNHNFE KNLLKCSWSP CERFITAGSS DRFLYVWETL SKKIVYKLPG HMGSVNCTDF HPKEPIMLSC GSD KRVFLG EIDMS

UniProtKB: WD_REPEATS_REGION domain-containing protein

+
Macromolecule #9: Pre-mRNA-splicing factor SYF1

MacromoleculeName: Pre-mRNA-splicing factor SYF1 / type: protein_or_peptide / ID: 9 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 99.675094 KDa
SequenceString: MADKENATKI EKMPNSETMK GISSEDVPFE EDIIRNPTSV NCWQRYIDHK LQNKSPAKQM FLIYERALAV FERSYKLWYH YLKYRESTI VNKCPTDNSW RALCDTYERC LMRLHKMPRI WICYCEVMIK RGLITETRRV FDRALRSLPV TQHMRIWTLY I GFLTSHDL ...String:
MADKENATKI EKMPNSETMK GISSEDVPFE EDIIRNPTSV NCWQRYIDHK LQNKSPAKQM FLIYERALAV FERSYKLWYH YLKYRESTI VNKCPTDNSW RALCDTYERC LMRLHKMPRI WICYCEVMIK RGLITETRRV FDRALRSLPV TQHMRIWTLY I GFLTSHDL PETTIRVYRR YLKMNPKARE DYVEYLIERD QIDEAAKELT TLVNQDQNVS EKGRTAHQLW TQLCDLISKN PV KIFSLNV DAIIRQGIYR YTDQVGFLWC SLADYYIRSA EFERARDVYE EAIAKVSTVR DFAQVYDAYA AFEEREVSIM MQE VEQSGD PEEEVDLEWM FQRYQHLMER KNELMNSVLL RQNPHNVGEW LNRVNIYEGN YNKQIETFKE AVKSVNPKIQ VGKV RDLWI GLAKLYEDNG DLDAARKTFE TAVISQFGGV SELANVWCAY AEMEMKHKRA KAALTVMQRA CVVPKPGDYE NMQSV QARV HRSPILWAMY ADYEECCGTV ESCRKVYDKM IELRVASPQM IMNYAMFLEE NEYFELAFQA YEKGIALFKW PGVFDI WNT YLVKFIKRYG GKKLERARDL FEQCLENCPP THAKYIFLLY AKLEEEHGLA RHALSIYNRA CSGVDRADMH SMYNIYI KK VQEMYGIAQC RPIFERAISE LPEDKSRAMS LRYAQLETTV GEIDRARAIY AHAAEISDPK VHVKFWDTWK NFEVAHGN E ATVRDMLRVR RSVEASYNVN VTLTSVQMRV DAERKAQETT TSSNPMDSLD QQQQQPSDGA GSITQVSMNK GNISFVRGA GKTVQQNTTE NPDEIDLDED DDDEEDDGGD ADISVKVVPA QIFGNLKLAE EEEEA

UniProtKB: Pre-mRNA-splicing factor SYF1

+
Macromolecule #11: TPR_REGION domain-containing protein

MacromoleculeName: TPR_REGION domain-containing protein / type: protein_or_peptide / ID: 11 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 88.116 KDa
SequenceString: MSDDEAAVPG NKPIRLPKKA AKVKNKAPAQ LQITAEQLLR EAKERELELI PPAPKTKITD PDELKEYQRK KRKEFEDGIR KNRMQLANW IKYGKWEESI GEIQRARSVF ERALDVDHRS ISIWLQYAEM EMRCKQINHA RNVFDRAITI MPRAMQFWLK Y SYMEEVIE ...String:
MSDDEAAVPG NKPIRLPKKA AKVKNKAPAQ LQITAEQLLR EAKERELELI PPAPKTKITD PDELKEYQRK KRKEFEDGIR KNRMQLANW IKYGKWEESI GEIQRARSVF ERALDVDHRS ISIWLQYAEM EMRCKQINHA RNVFDRAITI MPRAMQFWLK Y SYMEEVIE NIPGARQIFE RWIEWEPPEQ AWQTYINFEL RYKEIDRARS VYQRFLHVHG INVQNWIKYA KFEERNGYIG NA RAAYEKA MEYFGEEDIN ETVLVAFALF EERQKEHERA RGIFKYGLDN LPSNRTEEIF KHYTQHEKKF GERVGIEDVI ISK RKTQYE KMVEENGYNY DAWFDYLRLL ENEETDREEV EDVYERAIAN IPPHSEKRYW RRYIYLWINY ALYEELVAKD FDRA RQVYK ACIDIIPHKT FTFAKVWIMF AHFEIRQLDL NAARKIMGVA IGKCPKDKLF RAYIDLELQL REFDRCRKLY EKFLE SSPE SSQTWIKFAE LETLLGDTDR SRAVFTIAVQ QPALDMPELL WKAYIDFEIA CEEHEKARDL YETLLQRTNH IKVWIS MAE FEQTIGNFEG ARKAFERANQ SLENAEKEER LMLLEAWKEC ETKSGDQEAL KRVETMMPRR VKKRRQIQTE DGVDAGW EE YFDYIFPQDQ AAKGSFKLLE AAARWKRERE EAAARAAQEL DAPIPEGDDD EEKEEAGKDA EEKVREGDSD TDLSESSS S SDSESSSSSS SDSSDSSDDD EDK

UniProtKB: TPR_REGION domain-containing protein

+
Macromolecule #12: Pre-mRNA-splicing factor SPF27

MacromoleculeName: Pre-mRNA-splicing factor SPF27 / type: protein_or_peptide / ID: 12 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 27.679885 KDa
SequenceString: MSSKPLALTG GSGSSQLQDD QVLVDALPYL DTEYNEADRQ LAMKLVEHEC KTFRPTKNYL THLPVPDYDA FLTKCMLKEM DRMKKKEEM GKLDMSRCEL PAPSAVKGVD RKLWAKVLRN AKAQNEHLLM RQINLELMDE YAAESYLQRN KVMEDLLTHA E KELRKTKE ...String:
MSSKPLALTG GSGSSQLQDD QVLVDALPYL DTEYNEADRQ LAMKLVEHEC KTFRPTKNYL THLPVPDYDA FLTKCMLKEM DRMKKKEEM GKLDMSRCEL PAPSAVKGVD RKLWAKVLRN AKAQNEHLLM RQINLELMDE YAAESYLQRN KVMEDLLTHA E KELRKTKE AVMEVHANRK MAQLKAGEKV KQLEQSWVSM VTNNYRMEME NRQIDSDNRK QIKALKLDPT KLDDKEDQEN

UniProtKB: Pre-mRNA-splicing factor SPF27

+
Macromolecule #13: Cell division cycle 5-like protein

MacromoleculeName: Cell division cycle 5-like protein / type: protein_or_peptide / ID: 13 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 85.843469 KDa
SequenceString: MVRVIIKGGV WKNTEDEILK AAIMKYGKNQ WSRIASLLHR KSAKQCKARW FEWLDPGIKK TEWSREEDEK LLHLAKLMPT QWRTIAPIV GRTSAQCLER YEHLLDEAQR KAEGLDEEAT ETRKLKPGEI DPTPETKPAR PDPIDMDDDE LEMLSEARAR L ANTQGKKA ...String:
MVRVIIKGGV WKNTEDEILK AAIMKYGKNQ WSRIASLLHR KSAKQCKARW FEWLDPGIKK TEWSREEDEK LLHLAKLMPT QWRTIAPIV GRTSAQCLER YEHLLDEAQR KAEGLDEEAT ETRKLKPGEI DPTPETKPAR PDPIDMDDDE LEMLSEARAR L ANTQGKKA KRKARERQLS DARRLASLQK RREMRAAGLA FARKFKPKRN QIDYSEEIPF EKHVPAGFHN PSEDRYVVED AN QKAIEDH QKPRGREIEM EMRREDREKL KKRKEQGEAD AVFNIKEKKR SKLVLPEPQI SDRELEQIVK IGHASDSVRQ YID GTATSG LLTDYTESAR ANAVAARTMR TPMLKDTVQL ELENLMALQN TESALKGGLN TPLHESELGK GVLPTPKVAA TPNT VLHAI AATPGTQSQF PGSTPGGFAT PAGSVAATPF RDQMRINEEI AGSALEQKAS LKRALASLPT PKNDFEVVGP DDDEV EGAV EDESNQDEDG WIEDASERAE NKAKRNAENR VRNMKMRSQV IQRSLPKPTK VNEQATRATN SSADDMVKAE MSKLLA WDV DNKPPSVIYS REELDAAADL IKQEAESGPE LNSLMWKVVE QCTSEIILSK DKFTRIAILP REEQMKALND EFQMYRG WM NQRAKRAAKV EKKLRVKLGG YQAIHDKLCK KYQEVTTEIE MANIEKKTFE RLGEHELKAI NKRVGRLQQE VTTQETRE K DLQKMYSKLS NKQWKLSQIE IHDAASTTSA PITY

UniProtKB: Cell division cycle 5-like protein

+
Macromolecule #14: CWF19-like protein 1 homolog

MacromoleculeName: CWF19-like protein 1 homolog / type: protein_or_peptide / ID: 14 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 59.034898 KDa
SequenceString: MATQQAKILC CGDVNGNFVE LIKKISTTEK KNGPFDSLFC VGEFFGDDDD SNEKVINGNI EFPIPTYILG PANPRYSYLY PEESIEFSS NLTYLGKKGL LNTASGLQIA YLSGVEGSSK DLSCFDKADV EELLIPLGTQ VGFSGTDILL TSVWPADIAR H SHNQPSKP ...String:
MATQQAKILC CGDVNGNFVE LIKKISTTEK KNGPFDSLFC VGEFFGDDDD SNEKVINGNI EFPIPTYILG PANPRYSYLY PEESIEFSS NLTYLGKKGL LNTASGLQIA YLSGVEGSSK DLSCFDKADV EELLIPLGTQ VGFSGTDILL TSVWPADIAR H SHNQPSKP QPGSVLLSKL AAHLKPRYHF AGLGVHYERQ PYRNHRVLLE PARHTTRFIG LAAIGNPEKQ KWLYACNVKP MR KMEKEEL TAQPPNASEF PYRELLEEIA AKETLSRMNG NGQRPEGSQY RFEMGGAEDG AGNGRKRHND GGNDGPRNKQ PVG PCWFCL SNVDAEKHLV VAIGNKCYAA MPKGPLTEDH VMVLSVGHIQ SQVSAPVEVR DEIEKFKSAF TLMANKQGKA LVTF ERNFR TQHLQVQMVM IDKSSSKALK SSFTTAAACA GFELVTMGPD ESLLDMVNEG CPYFVAELPD GSKLFTRSMK GFPLH FGRE VLASTPILDC EDKVDWKACV LAKEKEVELV NKLKSDFKPF DFTAEDDSD

UniProtKB: CWF19-like protein 1 homolog

+
Macromolecule #15: CWF19-like protein 2 homolog

MacromoleculeName: CWF19-like protein 2 homolog / type: protein_or_peptide / ID: 15 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 53.269211 KDa
SequenceString: MFRRPDDDDD DYSGRKVIKP KPIAEKYAKK LGSEFSTGKT FVTGSDKSQK DFYGSQQVRN DVMKSSDSGG APLTEDEKNK LSAKILKAE MKGDTDLVKK LKRKLESGIS GDDEPPKSKS KEVTMMRRDR EGNILPASSR RSDSDRHGEG SSRMRREYEK S QDLDSMVR ...String:
MFRRPDDDDD DYSGRKVIKP KPIAEKYAKK LGSEFSTGKT FVTGSDKSQK DFYGSQQVRN DVMKSSDSGG APLTEDEKNK LSAKILKAE MKGDTDLVKK LKRKLESGIS GDDEPPKSKS KEVTMMRRDR EGNILPASSR RSDSDRHGEG SSRMRREYEK S QDLDSMVR EEKTGTAGDQ LRLFERSLIK SSKIRRHDDE SVDDIAEMQK GKKKSDEKDK KRKEKESIKE HKRIERSFDD CS RCIDSSR LKKHNIIAVG INTYLAVVEW DGLDDEHLII VPTQHCSSTI QLDENVWDEM RLWRKGLVAV WKSQNRDCIF FEM SRHVDS NPHVFIECVP VEQEIGDMAS IYFKKAINEC EGEYMDNKKL IETKDLRRQI PKGFSYFAVD FGLSNGFAHV IESH DHFPS TFATEIIAGM LDLPPKKWRK RETDEMSKQK SRAENFKKLW EPVDWTKRLK NDSTK

UniProtKB: CWF19-like protein 2 homolog

+
Macromolecule #16: Pre-mRNA-splicing factor syf-2

MacromoleculeName: Pre-mRNA-splicing factor syf-2 / type: protein_or_peptide / ID: 16 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 27.719021 KDa
SequenceString: MSSESQSSSS GPSSSGSKMK DFNQRFRDLH KLRQRARKEN HEQVVEEDRR SKLPKNHEAK KERDQWQVKE LQDRKAAEDK GLDYERVRS LEMSADVTEK LEQKRKRKKN PDQGFTSYED MTLRQHTRLT AALDPDLDSY KKMRECVGGE QFYPTADTLI H GNHYPTTA ...String:
MSSESQSSSS GPSSSGSKMK DFNQRFRDLH KLRQRARKEN HEQVVEEDRR SKLPKNHEAK KERDQWQVKE LQDRKAAEDK GLDYERVRS LEMSADVTEK LEQKRKRKKN PDQGFTSYED MTLRQHTRLT AALDPDLDSY KKMRECVGGE QFYPTADTLI H GNHYPTTA AMDKLTKDVH GQVKRREQYH RRRLYDPDAP IDYINEKNKK FNKKLDKYYG KYTEDIKDDL ERGTAI

UniProtKB: Pre-mRNA-splicing factor syf-2

+
Macromolecule #17: Protein BUD31 homolog

MacromoleculeName: Protein BUD31 homolog / type: protein_or_peptide / ID: 17 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 17.153879 KDa
SequenceString:
MSLATKLRRV RKSPPEGWDL IEPTLEQFEA KMREAETEPH EGKRKTEINW PIFRIHHQRS RYVYDMYYKK AEISRELYEF CLTAKFADA ALIAKWKKQG YENLCCVKCV NTRDSNFGTA CICRVPKSKL DAERVIECVH CGCHGCSG

UniProtKB: Protein BUD31 homolog

+
Macromolecule #18: Pre-mRNA-splicing factor RBM22

MacromoleculeName: Pre-mRNA-splicing factor RBM22 / type: protein_or_peptide / ID: 18 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 45.90275 KDa
SequenceString: MSMSKSSYSQ YNRKNWEDSD FPILCETCLG NNPYMRMMKD KYGRECKICE RPFTTFRWQP GKGARYKNTE LCQTCAKVKN VCQTCMFDL EYGLPVQVRD HELQIADNIP KQGANRDFFL QNVERTLGQG DGTQPIAQIA NNMDQAAHDR LRRMGRTQPY Y KRNAPHIC ...String:
MSMSKSSYSQ YNRKNWEDSD FPILCETCLG NNPYMRMMKD KYGRECKICE RPFTTFRWQP GKGARYKNTE LCQTCAKVKN VCQTCMFDL EYGLPVQVRD HELQIADNIP KQGANRDFFL QNVERTLGQG DGTQPIAQIA NNMDQAAHDR LRRMGRTQPY Y KRNAPHIC SFFVKGECKR GEECPYRHEK PTDPDDPLSR QNIRDRYYGT NDPVAEKILN RAAAAPTLSP PADTTITTLY IG NLGPSGA QQVTEKDLND FFYQYGDIRC LRVLTEKGCA FIEFTTREAA ERAAERSFNK TFIKGKRLTI RWGEPQAKRA ADN SNYVTP VPSVPILPVP DGLAPSTSSQ QRFTGSMPRP PAPPTFAAPR SLVVPNVRPV KAGESSGASS SSSIYYPSQD PTRL GAKGD VIE

UniProtKB: Pre-mRNA-splicing factor RBM22

+
Macromolecule #19: Spliceosome-associated protein CWC15 homolog

MacromoleculeName: Spliceosome-associated protein CWC15 homolog / type: protein_or_peptide / ID: 19 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 26.154846 KDa
SequenceString: MTTAHRPTFH PARGGTARGE GDLSKLSNQY SSKDMPSHTK MKYRQTGQET EADLRKKDLR RELEDKERNA IREKRARDSA SSSSSHSKR QRMDQIAAES AASVDADEAV DELNSSDDDD SDEDDTAALM AELEKIKKER AEEKAARDEE IKEKEEKQRM E NILAGNPL ...String:
MTTAHRPTFH PARGGTARGE GDLSKLSNQY SSKDMPSHTK MKYRQTGQET EADLRKKDLR RELEDKERNA IREKRARDSA SSSSSHSKR QRMDQIAAES AASVDADEAV DELNSSDDDD SDEDDTAALM AELEKIKKER AEEKAARDEE IKEKEEKQRM E NILAGNPL LNDTPAGSST SGGDFTVKRR WDDDVVFKNC AKGVEERKKE VTFINDAIRS EFHKKFMDKY IK

UniProtKB: Spliceosome-associated protein CWC15 homolog

+
Macromolecule #20: GCF C-terminal domain-containing protein

MacromoleculeName: GCF C-terminal domain-containing protein / type: protein_or_peptide / ID: 20 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 94.244891 KDa
SequenceString: MFRKPKAKGA IRQRKSDGWD EPDAENQQVV SAIEVKQPAV PRPAMSFDAD EGADSTFKLK KDKKKVEELK RQHKLEEEAE KLYKEEKIR KEALDKIVKK EKLSKEDKKT KNERHKYLDK YRDKSAKHIS NSESYEYEEN LDIDAEAISS VSNKFNSAFE G IPDSRAVF ...String:
MFRKPKAKGA IRQRKSDGWD EPDAENQQVV SAIEVKQPAV PRPAMSFDAD EGADSTFKLK KDKKKVEELK RQHKLEEEAE KLYKEEKIR KEALDKIVKK EKLSKEDKKT KNERHKYLDK YRDKSAKHIS NSESYEYEEN LDIDAEAISS VSNKFNSAFE G IPDSRAVF EAKKRRERAR REGNQDGYIP LDDTQKLKSK SERNRLIRED ENDDSDEECT NKFYSARELL RTEEDRRREE QE GFLEREN GDIDEAERIK GDDDSENEEW EKQQIRKAVS RREIGQLRTE KRNTSKLFGH TVPVEDDTAM DMDIDLDMDV QVI GKPEFT GPSNTGGVVK IEDILAKLKL RIQERDEALN FRKEEKRKLE QNIEENKSMI AKIEMELPNQ STKYTMYQEL RVYS RSLLE CLNEKVGEIN SIIDKKRDCG KSRTSRLSVR RRQDMRDQHA ECMQGRNARM GEAAGRAAER DARRGRRRRE REFTL ARIN HEEGLSTDDE EPTPQSMNDQ KICDEVEAVA SVLFADALDE YSDLRKVFGR MTDWLAVDPK SFQDAYVYLC IPKLSS PYV RLQILRADFL RKETILTSMQ WFHIAMLAGS ENAEIDQSHE ILVELAPAIV EKVVIPFLID TVKEEWDPMS LRQTRHL TT FCSLFEKLPN LTEKSKQFNA FLNAIRERIC DCISEDLFMP IFMPNALEQP ICRQFHDRQF WTCIKLIKSI NALSPLIS I AARFELVVEK CVNSQCVMAL RTGSKNDVTA ERKVRGLLAE LDDSLLKMGG RTSFRQLIGT LELIAEEQSK AGRSFHKEI RKFLEKLER

UniProtKB: GCF C-terminal domain-containing protein

+
Macromolecule #21: Intron-binding protein aquarius

MacromoleculeName: Intron-binding protein aquarius / type: protein_or_peptide / ID: 21 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 170.397688 KDa
SequenceString: MVTKRHQEAV VTRGAIENDT ISAVAAKFWA PFTAETHENF DAKLIDTIYD NEMLKTSFNS RKIMMLEFSQ YLEAYLWPNY VPEKASKAW NMSIVVMINE KFRERNLDSW NCFTKKSEHF PHFFKSILQL SLQEEGLASS EHCALLTFLV NAFGSVETPI V HKETRKLV ...String:
MVTKRHQEAV VTRGAIENDT ISAVAAKFWA PFTAETHENF DAKLIDTIYD NEMLKTSFNS RKIMMLEFSQ YLEAYLWPNY VPEKASKAW NMSIVVMINE KFRERNLDSW NCFTKKSEHF PHFFKSILQL SLQEEGLASS EHCALLTFLV NAFGSVETPI V HKETRKLV SIEIWAGLLD SQREDLFKKQ KKLKKIWENV RQKMTAAAAD NNEFERTYLW NLIEKFKRVL NSLEPNEAQE SE EGEVRDP IDSIKYCERF IELLIDLESI LQTRRFFNSV LHSSHILTHC LLSSLISTDA GSLFFQLVQL LKFYARFEID DLS GRQLTH KEVSEQHYQS VTRLQKAAFR LFNETMKEFY VLNVSGVDTR RALQKQFGDM NHAEVYRFAE YLHLVPAFGE DPNH QTSLL HLYPHQHLVE TITLHCERRP NQLTQLNEKP LFPTEKVIWD ENIIPYENYT GDGVLALDKL NLQFLTLHDY LLRNF NLFQ LESTYEIRQD LEDVLFRMKP FQHESRNETV FSGWARMALQ IDHFQISEVA KPLVGEKSPA VVRGVVTVNI GRRQDI RQE WENLRKHDVC FLVACRSRKS ASGLKFDVRR PFSEQIEVLS VRGCDVEGML DQDGHLLEEF TAWEKKAKIP GDLRKFR LL LDPNQYRIDM EQGTKDDIYD TFNLIVRRDS KTNNFKAVLQ TIRDLLNTEC VVPDWLTDVI LGYGEPDSAH YSKLSSAV P ELDFNDTFLS FAHVKESFPG YKIELADGFD EKEAVPPFKL EFKELERRQD VEIKPGELRT ILVTPLTRKK VTPYSYDPR KNQVKFTPSQ VEAIKSGMQP GLTMVVGPPG TGKTDVAVQI ISNIYHNWPN QRTLIVTHSN QALNQLFEKI IALDVDERHL LRMGHGEEA LETEKDFSRY GRVNYVLKER LQLLNCVEKL AKALKIVGDV AYTCENAGYF FRFSVCRVWE EFLAKVTSKG C NKLAEGII SEIFPFTGFF KDIPDLFSGN NSADLKVAHS CWRHIEQIFE KLDEFRAFEL LRNGRDRTEY LLVKEAKIIA MT CTHAALR RNELVKLGFR YDNIVMEEAA QILEVETFIP LLLQNPQDGH NRLKRWIMIG DHHQLPPVVQ NQAFQKYSNM EQS LFARLV RLSVPNVQLD RQGRARAQIA ELYQWRYNGL GNLPHVDGLP QFQNANAGFA FPFQFIDIPD FNGHGETQPS PHFY QNLGE AEYACALYTY MRILGYPAEK ISILTTYNGQ AQLIRDVFQR RCDTNPLIGM PAKVSTVDKY QGQQNDFIIL SLVKT RNIG HIRDVRRLVV ALSRARLGLY VLGRSKVFMD CLELTPAMRI FAKYPRKLVI LPFEAHPTIR KWNERSKDGE PMEIQD TLH MTHFVHEFYM SNLPAMRDAY EQAMNEYMES QRLLNPPIDE TQMDVETEHE KKHREAMERK KKQEMDDKKE ADIHFED MD HEMQEPAATA APAPGAPAVE EPPPK

UniProtKB: Intron-binding protein aquarius

+
Macromolecule #22: Uncharacterized protein T27F2.1

MacromoleculeName: Uncharacterized protein T27F2.1 / type: protein_or_peptide / ID: 22 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 60.303516 KDa
SequenceString: MSMKLRDILP APVAADEAAS QIRRDPWFGG RDNEPSAALV SKEPPPYGKR TSFRPRGPED FGDGGAFPEI HVAQFPLGLG LGDMRGKPE NTLALQYGTD GKLQHDAIAR IGHVKDKVVY SKLNDMKAKT WNEDDDDIQK PDDDAVIDAT EKTRMALEKI V NSKVASAL ...String:
MSMKLRDILP APVAADEAAS QIRRDPWFGG RDNEPSAALV SKEPPPYGKR TSFRPRGPED FGDGGAFPEI HVAQFPLGLG LGDMRGKPE NTLALQYGTD GKLQHDAIAR IGHVKDKVVY SKLNDMKAKT WNEDDDDIQK PDDDAVIDAT EKTRMALEKI V NSKVASAL PVRHADKLAP AQYIRYTPSQ QNGAAGSQQR IIRMVEEQKD PMEPPKFKIN QKIPRAPPSP PAPVMHSPPR KM TAKDQND WKIPPCISNW KNPKGFTVGL DKRLAADGRG LQQTHINENF AKLADALYIA DRKAREEVET RAQLERRVAQ NKK SEQEAK MAEAAAKARQ ERSAMRRKDD EDDEQVKVRE EIRRDRLDDI RKERNIARSR PDKADKLRKE RERDISEKIV LGLP DTNQK RTGEPQFDQR LFDKTQGLDS GAMDDDTYNP YDAAWRGGDS VQQHVYRPSK NLDNDVYGGD LDKIIEQKNR FVADK GFSG AEGSSRGSGP VQFEKDQDVF GLSSLFEHTK EKKRGGDGGD SRGESKRSRR D

UniProtKB: Uncharacterized protein T27F2.1

+
Macromolecule #23: Peptidyl-prolyl cis-trans isomerase

MacromoleculeName: Peptidyl-prolyl cis-trans isomerase / type: protein_or_peptide / ID: 23 / Number of copies: 1 / Enantiomer: LEVO / EC number: peptidylprolyl isomerase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 18.547002 KDa
SequenceString:
MPAPINDQAP YVILDTTMGK IALELYWNHA PRTCQNFSQL AKRNYYNGTI FHRIIADFMI QGGDPTGTGR GGASIYGDKF SDEIDERLK HTGAGILSMA NAGPNTNGSQ FFITLAPTQH LDGKHTIFGR VAAGMKVIAN MGRVDTDNHD RPKIEIRILK A YPSESSVL S

UniProtKB: Peptidyl-prolyl cis-trans isomerase

+
Macromolecule #24: WD_REPEATS_REGION domain-containing protein

MacromoleculeName: WD_REPEATS_REGION domain-containing protein / type: protein_or_peptide / ID: 24 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 54.766215 KDa
SequenceString: MSASVSDPYE QMPAAPTDDD LEDKPEADKK ALLNQVFKSL KRAQDLFYHD YAQPPPMPEE NDSLIRSMKR KHEYGNVIKK VEEMKVRRE NEMLALPTSQ PMHGTGSVIA SAGTPLAITD GSGKLVNQQQ GSAKSGTLLP LVPLGNSSKG EDNTTRSLLP S KAPMMMKP ...String:
MSASVSDPYE QMPAAPTDDD LEDKPEADKK ALLNQVFKSL KRAQDLFYHD YAQPPPMPEE NDSLIRSMKR KHEYGNVIKK VEEMKVRRE NEMLALPTSQ PMHGTGSVIA SAGTPLAITD GSGKLVNQQQ GSAKSGTLLP LVPLGNSSKG EDNTTRSLLP S KAPMMMKP KWHAPWKLYR VASGHTGWVR AVDVEPGNQW FASGGADRII KIWDLASGQL KLSLTGHISS VRAVKVSPRH PF LFSGGED KQVKCWDLEY NKVIRHYHGH LSAVQALSVH PSLDVLVTCA RDSTARVWDM RTKAQVHCFA GHTNTVADVV CQS VDPQVI TASHDATVRL WDLAAGRSMC TLTHHKKSVR ALTIHPRLNM FASASPDNIK QWKLPKGEFM QNLSGHNAII NTLS SNDDG VVVSGADNGS LCFWDWRSGF CFQKIQTKPQ PGSIESEAGI YASCFDKTGL RLITAEADKT IKMYKEDDEA TEESH PIVW RPEIVKKKAY

UniProtKB: WD_REPEATS_REGION domain-containing protein

+
Macromolecule #25: Septin and tuftelin-interacting protein 1 homolog

MacromoleculeName: Septin and tuftelin-interacting protein 1 homolog / type: protein_or_peptide / ID: 25 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 94.421539 KDa
SequenceString: MEDDDGRESF EINDMDLEYA MNPGGRRRFQ NKDQATYGVF APDSDDDDDE QGTSRGPYKK RSKISAPMSF VSGGIQQGNK IDKDDPASL NLNLGGEKKP KEDDEGSIQI DFDKRTKKAP KQNGAQVFAG MRSSANHGAA DINQFGSWMR GDGNSNKIMK M MQAMGYKP ...String:
MEDDDGRESF EINDMDLEYA MNPGGRRRFQ NKDQATYGVF APDSDDDDDE QGTSRGPYKK RSKISAPMSF VSGGIQQGNK IDKDDPASL NLNLGGEKKP KEDDEGSIQI DFDKRTKKAP KQNGAQVFAG MRSSANHGAA DINQFGSWMR GDGNSNKIMK M MQAMGYKP GEGLGAQGQG IVEPVQAQLR KGRGAVGAYG KESTATGPKF GESAADAQKR MAQEGTSSRP TNDDQEKSGL KI KGSWKKS QTVKTKYRTI EDVMEEGMSA SRPASHQQSQ QYSNIKVIDM TGKQQKIYSG YDSFSMKTRS EYDTVDDEER TVF DVPELI HNLNLLVDLT EEGIRRSNQQ LISLKDQTTA LEYDLQQVQK SLGTEEQEAQ HIKDVYELID GFSSNRSPSM EECQ ELFRR LRSEFPHEYE LYSLETVAIP TVLPLIQKYF VAWKPLEDKN YGCELISTWR DILDDSKNGR KMTFGHNKTK GDEIR AYDR IIWEGILPSI RRACLQWDPS TQMHEMIELV EQWIPLLSAW ITENILEQLV VPKIAERVNQ WDPMTDEIPI HEWLVP WLV LLGDRIQTVM PPIRQKLSKA LKLWDPMDRS ALETLRPWQN VWSAATFSAF IAQNIVPKLG VALDTMELNP TMNPEYP EW TACMEWLEFT HPDAIANIVT KYFFPRFYNC LCLWLDSPGV DYNEVKRWYG SWKARIPQVL VNYPTVNENL RRSMIAIG R SLQGEKVGGL QATPIAPMAP PPPMAPHFTQ AAPVQKLSLK EIIEYTAGKN GFTYHPQKDR YKDGRQVFWF GALSIYLDS EMVYVMDPIE FVWRPSGLNE LIQMAQGAQG

UniProtKB: Septin and tuftelin-interacting protein 1 homolog

+
Macromolecule #26: WD_REPEATS_REGION domain-containing protein

MacromoleculeName: WD_REPEATS_REGION domain-containing protein / type: protein_or_peptide / ID: 26 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 65.385664 KDa
SequenceString: MDALQAYGGS DSEHSDDDAS MDQVAKGKSS TLLERAIVTA PDVESKSAIR QVAIVDPKTK EIKSNPKFDQ LFKPESGPVN HFKSEQQRS QKNTLTGFVE PAHLNEFHFN RQIRSFDTLG YAQNPTAESG TTHFVGDVKK AEAEKGVSLF ESKKTGGEKR K RVRNDDSA ...String:
MDALQAYGGS DSEHSDDDAS MDQVAKGKSS TLLERAIVTA PDVESKSAIR QVAIVDPKTK EIKSNPKFDQ LFKPESGPVN HFKSEQQRS QKNTLTGFVE PAHLNEFHFN RQIRSFDTLG YAQNPTAESG TTHFVGDVKK AEAEKGVSLF ESKKTGGEKR K RVRNDDSA DIDGYTGPWS RFIDEKTVAK PTPELQKQMD EIVKKRQEKS RRFKKEKEDS EQMAEESSTL HLKEAEDYQG RS FLVPPSF TGVNLREDYV PERCFVPKKL VHTYRGHNKG VNFLQWFPKS AHLFLSCSMD TKIKLWEVYD RQRVVRTYAG HKL PVREVA FNNEGTEFLS ASFDRYVKLW DTETGQVKQR FHTGHVPYCL KYHPDDDKNH MFLVGMQNKK IIQWDSRSGE IVQE YDRHL QAVNSITFFD KNRRFASTSD DKSVRIWEWE IPVDTKLIQN VGLHAIPTMT KSPNDKWVVG QCMDNRIVLF QLVDD KLRF SKKKAFRGHN AAGYACNIDF SPDQSFLISG DADGKLFIWD WRTHKIVGKW KAHDSTCIAA LWHPHEKSRM ITAGWD GLI KMWN

UniProtKB: WD_REPEATS_REGION domain-containing protein

+
Macromolecule #27: Replication stress response regulator SDE2

MacromoleculeName: Replication stress response regulator SDE2 / type: protein_or_peptide / ID: 27 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 57.466109 KDa
SequenceString: MSATKHNERV GRRNDRKRDA SQACLPERSP SLRSEPRQQR SHPYLQTRTN THRQRPMIRP SSVVSMSGRS IAPSDSSEND DRHFDVDEE MESMSLDSPL PIEVPARPFP DKPYYWMNDY KTLEFIRTNF PSDSFYVMHN GKIIENFNAF IDENMGQSLV K YSFHLRVR ...String:
MSATKHNERV GRRNDRKRDA SQACLPERSP SLRSEPRQQR SHPYLQTRTN THRQRPMIRP SSVVSMSGRS IAPSDSSEND DRHFDVDEE MESMSLDSPL PIEVPARPFP DKPYYWMNDY KTLEFIRTNF PSDSFYVMHN GKIIENFNAF IDENMGQSLV K YSFHLRVR GGKGGFGSLL RSFRVNKSTN KLMMRDLNGR RMASVDEEAK LKRYLEKQAR KEQELKEKRK AKLEKLTAGP AK HMFEDQD YLSRREEIIN KTEDACEAGF ALMLEMKRNS RKSKEQENKN DEDEDDDVNA EDVTDLFNDR GGRKRKIAAP SIG EDGGNK RINDESDDSE DDSENEVDPE ELEAIRQYFE AKKQDEKDNG EGTSEAVPEV LVDEQEIEPA KRPRLDSASN IDDL PKIDV KTPCEYGPIS LDDFTSAEDL ELLGLEHLKS ALNDRGLKCG GSLVERAARL WCVKGKQPRE YPKNILTPEL KKKIA EEEE AERKAAKKAK KSKKNQ

UniProtKB: Replication stress response regulator SDE2

+
Macromolecule #28: Coiled-coil domain-containing protein 12

MacromoleculeName: Coiled-coil domain-containing protein 12 / type: protein_or_peptide / ID: 28 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 8.135504 KDa
SequenceString:
MDLDIVQREI TEHLKDVLHE KAIDSVDLAM LAPKKIDWDL KRDIESKLQK LERRTQKAVA TIIRQRLAE

UniProtKB: Coiled-coil domain-containing protein 12

+
Macromolecule #29: Small nuclear ribonucleoprotein Sm D3

MacromoleculeName: Small nuclear ribonucleoprotein Sm D3 / type: protein_or_peptide / ID: 29 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 14.836212 KDa
SequenceString:
MTSVGVPIKI LHEAEGHMVT LETVTGEVYR GKLSEAEDNM NCQLAETVVT FRDGRSHQLD NVFIRGNKIR FMILPDMLKN APMFKNIGR AQKGAIGMGL GGLDQRGRGR GTAFRRPMGR GGPRGMSRPG GAPTFRG

UniProtKB: Small nuclear ribonucleoprotein Sm D3

+
Macromolecule #30: Probable small nuclear ribonucleoprotein-associated protein B

MacromoleculeName: Probable small nuclear ribonucleoprotein-associated protein B
type: protein_or_peptide / ID: 30 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 16.768627 KDa
SequenceString:
MTISKNNKMM AHLNYRMKII LQDGRTFIGF FKAFDKHMNI LLAECEEHRQ IKPKAGKKTD GEEKRILGLV LVRGEHIVSM TVDGPPPRD DDSVRLAKAG GAGGVGQAKP GGRGMPAMPG MPGMPPGGAP GGLSGAMRGH GGPGMAAMQP GYGGPPGGRP F

UniProtKB: Probable small nuclear ribonucleoprotein-associated protein B

+
Macromolecule #31: Small nuclear ribonucleoprotein Sm D1

MacromoleculeName: Small nuclear ribonucleoprotein Sm D1 / type: protein_or_peptide / ID: 31 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 13.72407 KDa
SequenceString:
MKLVRFLMKL SHETVNIELK NGTQVSGTIM GVDVAMNTHL RAVSMTVKNK EPVKLDTLSI RGNNIRYIIL PDPLALDTLL IDDEPRKKA RAARAGASRG RGGRGGMRGG RGGRGRGRGG PRGAGPRR

UniProtKB: Small nuclear ribonucleoprotein Sm D1

+
Macromolecule #32: Probable small nuclear ribonucleoprotein Sm D2

MacromoleculeName: Probable small nuclear ribonucleoprotein Sm D2 / type: protein_or_peptide / ID: 32
Details: MSAQAKPRSEMTAEELAAKEDEEFNVGPLSILTNSVKNNHQVLINCRNNKKLLGRVKAFDRHCNMVLENVKEMWTEVPKTGKGKKKAKSVAKDRFISKMFLRGDSVILVVKNPLAQAE
Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 13.291529 KDa
SequenceString:
MSAQAKPRSE MTAEELAAKE DEEFNVGPLS ILTNSVKNNH QVLINCRNNK KLLGRVKAFD RHCNMVLENV KEMWTEVPKT GKGKKKAKS VAKDRFISKM FLRGDSVILV VKNPLAQAE

UniProtKB: Probable small nuclear ribonucleoprotein Sm D2

+
Macromolecule #33: Probable small nuclear ribonucleoprotein E

MacromoleculeName: Probable small nuclear ribonucleoprotein E / type: protein_or_peptide / ID: 33 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 10.625318 KDa
SequenceString:
MSTRKLNKVM VQPVNLIFRY LQNRTRVQIW LYEDVTHRLE GYIIGFDEFM NVVFDEAEEV NMKTKGRNKI GRILLKGDNI TLIHAAQQE A

UniProtKB: Probable small nuclear ribonucleoprotein E

+
Macromolecule #34: Probable small nuclear ribonucleoprotein F

MacromoleculeName: Probable small nuclear ribonucleoprotein F / type: protein_or_peptide / ID: 34 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 9.256534 KDa
SequenceString:
MSAVQPVNPK PFLNSLTGKF VVCKLKWGME YKGVLVAVDS YMNLQLAHAE EYIDGNSQGN LGEILIRCNN VLYVGGVDGE NETSA

UniProtKB: Probable small nuclear ribonucleoprotein F

+
Macromolecule #35: Probable small nuclear ribonucleoprotein G

MacromoleculeName: Probable small nuclear ribonucleoprotein G / type: protein_or_peptide / ID: 35 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 8.756209 KDa
SequenceString:
MSKTHPPELK KYMDKEMDLK LNGNRRVSGI LRGFDPFMNM VIDEAVEYQK DGGSVNLGMT VIRGNSVVIM EPKERIS

UniProtKB: Probable small nuclear ribonucleoprotein G

+
Macromolecule #36: Probable U2 small nuclear ribonucleoprotein A'

MacromoleculeName: Probable U2 small nuclear ribonucleoprotein A' / type: protein_or_peptide / ID: 36 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 28.9059 KDa
SequenceString: MVRLTTELFA ERPQFVNSVN MREINLRGQK IPVIENMGVT RDQFDVIDLT DNDIRKLDNF PTFSRLNTLY LHNNRINYIA PDIATKLPN LKTLALTNNN ICELGDIEPL AECKKLEYVT FIGNPITHKD NYRMYMIYKL PTVRVIDFNR VRLTEREAAK K MFKGKSGK ...String:
MVRLTTELFA ERPQFVNSVN MREINLRGQK IPVIENMGVT RDQFDVIDLT DNDIRKLDNF PTFSRLNTLY LHNNRINYIA PDIATKLPN LKTLALTNNN ICELGDIEPL AECKKLEYVT FIGNPITHKD NYRMYMIYKL PTVRVIDFNR VRLTEREAAK K MFKGKSGK KARDAIQKSV HTEDPSEIEP NENSSGGGAR LTDEDREKIK EAIKNAKSLS EVNYLQSILA SGKVPEKGWN RQ MDQNGAD GEAMES

UniProtKB: Probable U2 small nuclear ribonucleoprotein A'

+
Macromolecule #37: RRM domain-containing protein

MacromoleculeName: RRM domain-containing protein / type: protein_or_peptide / ID: 37 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 24.881344 KDa
SequenceString: MADINPNHTI YVNNLNEKVK KDELKRSLHM VFTQFGEIIQ LMSFRKEKMR GQAHIVFKEV SSASNALRAL QGFPFYGKPM RIQYAREDS DVISRAKGTF VEKRQKSTKI AKKPYEKPAK NGKSAAEPTQ KEPQETDGPG LPNNILFCSN IPEGTEPEQI Q TIFSQFPG ...String:
MADINPNHTI YVNNLNEKVK KDELKRSLHM VFTQFGEIIQ LMSFRKEKMR GQAHIVFKEV SSASNALRAL QGFPFYGKPM RIQYAREDS DVISRAKGTF VEKRQKSTKI AKKPYEKPAK NGKSAAEPTQ KEPQETDGPG LPNNILFCSN IPEGTEPEQI Q TIFSQFPG LREVRWMPNT KDFAFIEYES EDLSEPARQA LDNFRITPTQ QITVKFASK

UniProtKB: RRM domain-containing protein

+
Macromolecule #38: Pre-mRNA-processing factor 19

MacromoleculeName: Pre-mRNA-processing factor 19 / type: protein_or_peptide / ID: 38 / Number of copies: 4 / Enantiomer: LEVO / EC number: RING-type E3 ubiquitin transferase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 53.272633 KDa
SequenceString: MSFVCGISGE LTEDPVVSQV SGHIFDRRLI VKFIAENGTD PISHGELSED QLVSLKSGGT GSAPRNVSGT SIPSLLKMLQ DEWDTVMLN SFSLRQQLQI ARQELSHSLY QHDAACRVIS RLSKELTAAR EALSTLKPHT SAKVDDDVSI DESEDQQGLS E AILAKLEE ...String:
MSFVCGISGE LTEDPVVSQV SGHIFDRRLI VKFIAENGTD PISHGELSED QLVSLKSGGT GSAPRNVSGT SIPSLLKMLQ DEWDTVMLN SFSLRQQLQI ARQELSHSLY QHDAACRVIS RLSKELTAAR EALSTLKPHT SAKVDDDVSI DESEDQQGLS E AILAKLEE KSKSLTAERK QRGKNLPEGL AKTEELAELK QTASHTGIHS TGTPGITALD IKGNLSLTGG IDKTVVLYDY EK EQVMQTF KGHNKKINAV VLHPDNITAI SASADSHIRV WSATDSSSKA IIDVHQAPVT DISLNASGDY ILSASDDSYW AFS DIRSGK SLCKVSVEPG SQIAVHSIEF HPDGLIFGTG AADAVVKIWD LKNQTVAAAF PGHTAAVRSI AFSENGYYLA TGSE DGEVK LWDLRKLKNL KTFANEEKQP INSLSFDMTG TFLGIGGQKV QVLHVKSWSE VVSLSDHSGP VTGVRFGENA RSLVT CSLD KSLRVFSF

UniProtKB: Pre-mRNA-processing factor 19

+
Macromolecule #39: Peptidyl-prolyl cis-trans isomerase E

MacromoleculeName: Peptidyl-prolyl cis-trans isomerase E / type: protein_or_peptide / ID: 39 / Number of copies: 1 / Enantiomer: LEVO / EC number: peptidylprolyl isomerase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 8.825046 KDa
SequenceString:
KRTLYVGGFT EDVTEKVLMA AFIPFGDVVA ISIPMDYESG KHRGFGFVEF DMAEDAAMAI DNMNESELFG KTIRVNFAR

UniProtKB: Peptidyl-prolyl cis-trans isomerase E

+
Macromolecule #40: MAGNESIUM ION

MacromoleculeName: MAGNESIUM ION / type: ligand / ID: 40 / Number of copies: 7 / Formula: MG
Molecular weightTheoretical: 24.305 Da

+
Macromolecule #41: INOSITOL HEXAKISPHOSPHATE

MacromoleculeName: INOSITOL HEXAKISPHOSPHATE / type: ligand / ID: 41 / Number of copies: 2 / Formula: IHP
Molecular weightTheoretical: 660.035 Da
Chemical component information

ChemComp-IHP:
INOSITOL HEXAKISPHOSPHATE

+
Macromolecule #42: GUANOSINE-5'-TRIPHOSPHATE

MacromoleculeName: GUANOSINE-5'-TRIPHOSPHATE / type: ligand / ID: 42 / Number of copies: 1 / Formula: GTP
Molecular weightTheoretical: 523.18 Da
Chemical component information

ChemComp-GTP:
GUANOSINE-5'-TRIPHOSPHATE / GTP, energy-carrying molecule*YM

+
Macromolecule #43: ZINC ION

MacromoleculeName: ZINC ION / type: ligand / ID: 43 / Number of copies: 7 / Formula: ZN
Molecular weightTheoretical: 65.409 Da

-
Experimental details

-
Structure determination

Methodcryo EM
Processingsingle particle reconstruction
Aggregation stateparticle

-
Sample preparation

BufferpH: 7.9
VitrificationCryogen name: ETHANE

-
Electron microscopy

MicroscopeFEI TITAN KRIOS
Image recordingFilm or detector model: GATAN K3 (6k x 4k) / Average electron dose: 60.0 e/Å2
Electron beamAcceleration voltage: 300 kV / Electron source: FIELD EMISSION GUN
Electron opticsIllumination mode: FLOOD BEAM / Imaging mode: BRIGHT FIELD / Nominal defocus max: 2.0 µm / Nominal defocus min: 0.75 µm
Experimental equipment
Model: Titan Krios / Image courtesy: FEI Company

-
Image processing

Startup modelType of model: PDB ENTRY
PDB model - PDB ID:
Final reconstructionResolution.type: BY AUTHOR / Resolution: 3.0 Å / Resolution method: FSC 0.143 CUT-OFF / Number images used: 247908
Initial angle assignmentType: MAXIMUM LIKELIHOOD
Final angle assignmentType: MAXIMUM LIKELIHOOD

+
About Yorodumi

-
News

-
Feb 9, 2022. New format data for meta-information of EMDB entries

New format data for meta-information of EMDB entries

  • Version 3 of the EMDB header file is now the official format.
  • The previous official version 1.9 will be removed from the archive.

Related info.:EMDB header

External links:wwPDB to switch to version 3 of the EMDB data model

-
Aug 12, 2020. Covid-19 info

Covid-19 info

URL: https://pdbjlvh1.pdbj.org/emnavi/covid19.php

New page: Covid-19 featured information page in EM Navigator.

Related info.:Covid-19 info / Mar 5, 2020. Novel coronavirus structure data

+
Mar 5, 2020. Novel coronavirus structure data

Novel coronavirus structure data

Related info.:Yorodumi Speices / Aug 12, 2020. Covid-19 info

External links:COVID-19 featured content - PDBj / Molecule of the Month (242):Coronavirus Proteases

+
Jan 31, 2019. EMDB accession codes are about to change! (news from PDBe EMDB page)

EMDB accession codes are about to change! (news from PDBe EMDB page)

  • The allocation of 4 digits for EMDB accession codes will soon come to an end. Whilst these codes will remain in use, new EMDB accession codes will include an additional digit and will expand incrementally as the available range of codes is exhausted. The current 4-digit format prefixed with “EMD-” (i.e. EMD-XXXX) will advance to a 5-digit format (i.e. EMD-XXXXX), and so on. It is currently estimated that the 4-digit codes will be depleted around Spring 2019, at which point the 5-digit format will come into force.
  • The EM Navigator/Yorodumi systems omit the EMD- prefix.

Related info.:Q: What is EMD? / ID/Accession-code notation in Yorodumi/EM Navigator

External links:EMDB Accession Codes are Changing Soon! / Contact to PDBj

+
Jul 12, 2017. Major update of PDB

Major update of PDB

  • wwPDB released updated PDB data conforming to the new PDBx/mmCIF dictionary.
  • This is a major update changing the version number from 4 to 5, and with Remediation, in which all the entries are updated.
  • In this update, many items about electron microscopy experimental information are reorganized (e.g. em_software).
  • Now, EM Navigator and Yorodumi are based on the updated data.

External links:wwPDB Remediation / Enriched Model Files Conforming to OneDep Data Standards Now Available in the PDB FTP Archive

-
Yorodumi

Thousand views of thousand structures

  • Yorodumi is a browser for structure data from EMDB, PDB, SASBDB, etc.
  • This page is also the successor to EM Navigator detail page, and also detail information page/front-end page for Omokage search.
  • The word "yorodu" (or yorozu) is an old Japanese word meaning "ten thousand". "mi" (miru) is to see.

Related info.:EMDB / PDB / SASBDB / Comparison of 3 databanks / Yorodumi Search / Aug 31, 2016. New EM Navigator & Yorodumi / Yorodumi Papers / Jmol/JSmol / Function and homology information / Changes in new EM Navigator and Yorodumi

Read more