[English] 日本語
Yorodumi
- EMDB-19397: Composite map of the C. elegans Intron Lariat Spliceosome primed ... -

+
Open data


ID or keywords:

Loading...

-
Basic information

Entry
Database: EMDB / ID: EMD-19397
TitleComposite map of the C. elegans Intron Lariat Spliceosome primed for disassembly (ILS')
Map dataComposite Map
Sample
  • Complex: Intron lariat spliceosome
    • RNA: x 3 types
    • Protein or peptide: x 32 types
  • DNA: x 1 types
  • Ligand: x 4 types
KeywordsmRNA / splicing / Intorn Lariat spliceosome / ILS / pre-mRNA
Function / homology
Function and homology information


feminization of hermaphroditic germ-line / molting cycle / regulation of primary miRNA processing / SLBP independent Processing of Histone Pre-mRNAs / Formation of TC-NER Pre-Incision Complex / Dual incision in TC-NER / Gap-filling DNA repair synthesis and ligation in TC-NER / SLBP Dependent Processing of Replication-Dependent Histone Pre-mRNAs / Transport of Mature mRNA derived from an Intron-Containing Transcript / snRNP Assembly ...feminization of hermaphroditic germ-line / molting cycle / regulation of primary miRNA processing / SLBP independent Processing of Histone Pre-mRNAs / Formation of TC-NER Pre-Incision Complex / Dual incision in TC-NER / Gap-filling DNA repair synthesis and ligation in TC-NER / SLBP Dependent Processing of Replication-Dependent Histone Pre-mRNAs / Transport of Mature mRNA derived from an Intron-Containing Transcript / snRNP Assembly / Downregulation of SMAD2/3:SMAD4 transcriptional activity / mRNA Splicing - Minor Pathway / germline cell cycle switching, mitotic to meiotic cell cycle / mRNA Splicing - Major Pathway / mRNA 3'-end processing / RNA Polymerase II Transcription Termination / vulval development / nematode larval development / egg-laying behavior / post-spliceosomal complex / U2-type post-mRNA release spliceosomal complex / spliceosomal complex disassembly / snRNP binding / post-mRNA release spliceosomal complex / apoptotic DNA fragmentation / nuclear mRNA surveillance / generation of catalytic spliceosome for first transesterification step / nuclease activity / spliceosome conformational change to release U4 (or U4atac) and U1 (or U11) / U12-type spliceosomal complex / Prp19 complex / pICln-Sm protein complex / pre-mRNA binding / U2-type catalytic step 1 spliceosome / SMN-Sm protein complex / locomotion / spliceosomal tri-snRNP complex / P granule / U2-type spliceosomal complex / mRNA cis splicing, via spliceosome / commitment complex / U2-type catalytic step 2 spliceosome / embryo development ending in birth or egg hatching / U4 snRNP / U2 snRNP / U1 snRNP / U2-type prespliceosome / cyclosporin A binding / precatalytic spliceosome / generation of catalytic spliceosome for second transesterification step / spliceosomal complex assembly / germ cell development / uterus development / protein K63-linked ubiquitination / mRNA 3'-splice site recognition / spliceosomal tri-snRNP complex assembly / U5 snRNA binding / U5 snRNP / U2 snRNA binding / U6 snRNA binding / spliceosomal snRNP assembly / pre-mRNA intronic binding / U1 snRNA binding / U4/U6 x U5 tri-snRNP complex / catalytic step 2 spliceosome / RNA splicing / helicase activity / peptidylprolyl isomerase / peptidyl-prolyl cis-trans isomerase activity / RNA polymerase II transcription regulatory region sequence-specific DNA binding / spliceosomal complex / RING-type E3 ubiquitin transferase / mRNA splicing, via spliceosome / mRNA processing / ubiquitin-protein transferase activity / metallopeptidase activity / ubiquitin protein ligase activity / protein folding / regulation of gene expression / nucleic acid binding / RNA helicase activity / cell differentiation / DNA-binding transcription factor activity, RNA polymerase II-specific / RNA helicase / cell division / intracellular membrane-bounded organelle / DNA repair / GTPase activity / mRNA binding / apoptotic process / regulation of transcription by RNA polymerase II / GTP binding / positive regulation of DNA-templated transcription / ATP hydrolysis activity / DNA binding / RNA binding / nucleoplasm / ATP binding / nucleus / metal ion binding
Similarity search - Function
Intron Large complex component GCFC2-like / Tuftelin interacting protein, N-terminal domain / GCF, C-terminal / Septin and tuftelin interacting protein / TFP11/STIP/Ntr1 / GC-rich sequence DNA-binding factor-like protein / Tuftelin interacting protein N terminal / mRNA splicing factor Cwf18-like / : / cwf18 pre-mRNA splicing factor ...Intron Large complex component GCFC2-like / Tuftelin interacting protein, N-terminal domain / GCF, C-terminal / Septin and tuftelin interacting protein / TFP11/STIP/Ntr1 / GC-rich sequence DNA-binding factor-like protein / Tuftelin interacting protein N terminal / mRNA splicing factor Cwf18-like / : / cwf18 pre-mRNA splicing factor / Nineteen complex-related protein 2 / : / Pre-mRNA-splicing factor Isy1 / Pre-mRNA-splicing factor Isy1 superfamily / Isy1-like splicing family / : / : / : / Intron-binding protein aquarius, beta-barrel / Intron-binding protein aquarius insert domain / Pre-mRNA-splicing factor SPF27 / Torus domain / Breast carcinoma amplified sequence 2 (BCAS2) / Torus domain / mRNA splicing factor SYF2 / SYF2 splicing factor / CWF11 family / Intron-binding protein aquarius, N-terminal / Intron-binding protein aquarius N-terminal / Peptidyl-prolyl cis-trans isomerase E / Peptidyl-prolyl cis-trans isomerase E, RNA recognition motif / Myb-like domain profile. / Helix hairpin bin domain superfamily / G-patch domain / Cyclophilin-type peptidyl-prolyl cis-trans isomerase, cyclophilin A-like / G-patch domain profile. / Pre-mRNA-processing factor 17 / G-patch domain / glycine rich nucleic binding domain / Pre-mRNA-splicing factor 19 / Pre-mRNA-processing factor 19 / Prp19/Pso4-like / : / STL11, N-terminal / U-box domain / : / DNA2/NAM7 helicase, helicase domain / DNA2/NAM7-like helicase / AAA domain / WD repeat Prp46/PLRG1-like / BUD31/G10-related, conserved site / : / : / : / G10 protein signature 1. / G10 protein signature 2. / SKI-interacting protein SKIP, SNW domain / SKI-interacting protein, SKIP / SKIP/SNW domain / Myb-like DNA-binding domain / Pre-mRNA-splicing factor Cwf15/Cwc15 / HAT (Half-A-TPR) repeat / Cwf15/Cwc15 cell cycle control protein / Pre-mRNA-splicing factor Cwc2/Slt11 / G10 protein / Pre-mRNA-splicing factor BUD31 / Pre-mRNA splicing factor component Cdc5p/Cef1, C-terminal / pre-mRNA splicing factor component / Small nuclear ribonucleoprotein D1 / U-box domain profile. / Modified RING finger domain / U-box domain / Zinc finger, CCCH-type superfamily / Leucine-rich repeat / Brr2, N-terminal helicase PWI domain / : / N-terminal helicase PWI domain / Pre-mRNA-splicing helicase BRR2 plug domain / zinc finger / Pre-mRNA-splicing factor Syf1-like / Sec63 Brl domain / Snu114, GTP-binding domain / 116kDa U5 small nuclear ribonucleoprotein component, N-terminal / 116kDa U5 small nuclear ribonucleoprotein component, C-terminal / 116 kDa U5 small nuclear ribonucleoprotein component N-terminus / Small nuclear ribonucleoprotein Sm D3 / Sec63 domain / Small nuclear ribonucleoprotein Sm D2 / Sec63 Brl domain / Small nuclear ribonucleoprotein E / Small nuclear ribonucleoprotein G / Small nuclear ribonucleoprotein F / Sm-like protein Lsm7/SmG / Myb-type HTH DNA-binding domain profile. / Like-Sm (LSM) domain containing protein, LSm4/SmD1/SmD3 / Zinc finger, CCCH-type / Zinc finger C3H1-type profile. / Sm-like protein Lsm6/SmF / LSM domain / LSM domain, eukaryotic/archaea-type
Similarity search - Domain/homology
Small nuclear ribonucleoprotein Sm D1 / GCF C-terminal domain-containing protein / WD_REPEATS_REGION domain-containing protein / Cell division cycle 5-like protein / TPR_REGION domain-containing protein / WD_REPEATS_REGION domain-containing protein / Spliceosome-associated protein CWC15 homolog / Protein BUD31 homolog / Pre-mRNA-splicing factor 8 homolog / Probable small nuclear ribonucleoprotein F ...Small nuclear ribonucleoprotein Sm D1 / GCF C-terminal domain-containing protein / WD_REPEATS_REGION domain-containing protein / Cell division cycle 5-like protein / TPR_REGION domain-containing protein / WD_REPEATS_REGION domain-containing protein / Spliceosome-associated protein CWC15 homolog / Protein BUD31 homolog / Pre-mRNA-splicing factor 8 homolog / Probable small nuclear ribonucleoprotein F / Pre-mRNA-splicing factor SYF1 / Probable small nuclear ribonucleoprotein-associated protein B / Pre-mRNA-splicing factor syf-2 / Pre-mRNA-processing factor 19 / Small nuclear ribonucleoprotein Sm D3 / Septin and tuftelin-interacting protein 1 homolog / Peptidyl-prolyl cis-trans isomerase / Probable small nuclear ribonucleoprotein Sm D2 / WD_REPEATS_REGION domain-containing protein / Protein isy-1 / RRM domain-containing protein / Pre-mRNA-splicing factor RBM22 / Pre-mRNA-splicing factor SPF27 / Uncharacterized protein T27F2.1 / Tr-type G domain-containing protein / Coiled-coil domain-containing protein 12 / Probable U2 small nuclear ribonucleoprotein A' / Probable small nuclear ribonucleoprotein G / Intron-binding protein aquarius / U5 small nuclear ribonucleoprotein 200 kDa helicase / Peptidyl-prolyl cis-trans isomerase E / Probable small nuclear ribonucleoprotein E
Similarity search - Component
Biological speciesCaenorhabditis elegans (invertebrata)
Methodsingle particle reconstruction / cryo EM / Resolution: 2.9 Å
AuthorsVorlaender MK / Rothe P / Plaschka C
Funding supportEuropean Union, 1 items
OrganizationGrant numberCountry
European Research Council (ERC)European Union
CitationJournal: Nature / Year: 2024
Title: Mechanism for the initiation of spliceosome disassembly.
Authors: Matthias K Vorländer / Patricia Rothe / Justus Kleifeld / Eric D Cormack / Lalitha Veleti / Daria Riabov-Bassat / Laura Fin / Alex W Phillips / Luisa Cochella / Clemens Plaschka /
Abstract: Precursor-mRNA (pre-mRNA) splicing requires the assembly, remodelling and disassembly of the multi-megadalton ribonucleoprotein complex called the spliceosome. Recent studies have shed light on ...Precursor-mRNA (pre-mRNA) splicing requires the assembly, remodelling and disassembly of the multi-megadalton ribonucleoprotein complex called the spliceosome. Recent studies have shed light on spliceosome assembly and remodelling for catalysis, but the mechanism of disassembly remains unclear. Here we report cryo-electron microscopy structures of nematode and human terminal intron lariat spliceosomes along with biochemical and genetic data. Our results uncover how four disassembly factors and the conserved RNA helicase DHX15 initiate spliceosome disassembly. The disassembly factors probe large inner and outer spliceosome surfaces to detect the release of ligated mRNA. Two of these factors, TFIP11 and C19L1, and three general spliceosome subunits, SYF1, SYF2 and SDE2, then dock and activate DHX15 on the catalytic U6 snRNA to initiate disassembly. U6 therefore controls both the start and end of pre-mRNA splicing. Taken together, our results explain the molecular basis of the initiation of canonical spliceosome disassembly and provide a framework to understand general spliceosomal RNA helicase control and the discard of aberrant spliceosomes.
History
DepositionJan 11, 2024-
Header (metadata) releaseAug 7, 2024-
Map releaseAug 7, 2024-
UpdateAug 7, 2024-
Current statusAug 7, 2024Processing site: PDBe / Status: Released

-
Structure visualization

Supplemental images

Downloads & links

-
Map

FileDownload / File: emd_19397.map.gz / Format: CCP4 / Size: 343 MB / Type: IMAGE STORED AS FLOATING POINT NUMBER (4 BYTES)
AnnotationComposite Map
Projections & slices

Image control

Size
Brightness
Contrast
Others
AxesZ (Sec.)Y (Row.)X (Col.)
1.3 Å/pix.
x 448 pix.
= 582.982 Å
1.3 Å/pix.
x 448 pix.
= 582.982 Å
1.3 Å/pix.
x 448 pix.
= 582.982 Å

Surface

Projections

Slices (1/3)

Slices (1/2)

Slices (2/3)

Images are generated by Spider.

Voxel sizeX=Y=Z: 1.3013 Å
Density
Contour LevelBy AUTHOR: 0.5
Minimum - Maximum0.0 - 8.985284
Average (Standard dev.)0.00885728 (±0.093956225)
SymmetrySpace group: 1
Details

EMDB XML:

Map geometry
Axis orderXYZ
Origin000
Dimensions448448448
Spacing448448448
CellA=B=C: 582.9824 Å
α=β=γ: 90.0 °

-
Supplemental data

-
Sample components

+
Entire : Intron lariat spliceosome

EntireName: Intron lariat spliceosome
Components
  • Complex: Intron lariat spliceosome
    • RNA: U2 snRNA
    • RNA: U5 snRNA
    • RNA: U6 snRNA
    • Protein or peptide: Pre-mRNA-splicing factor 8 homolog
    • Protein or peptide: U5 small nuclear ribonucleoprotein 200 kDa helicase
    • Protein or peptide: Tr-type G domain-containing protein
    • Protein or peptide: Protein isy-1
    • Protein or peptide: WD_REPEATS_REGION domain-containing protein
    • Protein or peptide: Pre-mRNA-splicing factor SYF1
    • Protein or peptide: TPR_REGION domain-containing protein
    • Protein or peptide: Pre-mRNA-splicing factor SPF27
    • Protein or peptide: Cell division cycle 5-like protein
    • Protein or peptide: Pre-mRNA-splicing factor syf-2
    • Protein or peptide: Protein BUD31 homolog
    • Protein or peptide: Pre-mRNA-splicing factor RBM22
    • Protein or peptide: Spliceosome-associated protein CWC15 homolog
    • Protein or peptide: GCF C-terminal domain-containing protein
    • Protein or peptide: Intron-binding protein aquarius
    • Protein or peptide: Uncharacterized protein T27F2.1
    • Protein or peptide: Peptidyl-prolyl cis-trans isomerase
    • Protein or peptide: WD_REPEATS_REGION domain-containing protein
    • Protein or peptide: Septin and tuftelin-interacting protein 1 homolog
    • Protein or peptide: WD_REPEATS_REGION domain-containing protein
    • Protein or peptide: Coiled-coil domain-containing protein 12
    • Protein or peptide: Small nuclear ribonucleoprotein Sm D3
    • Protein or peptide: Probable small nuclear ribonucleoprotein-associated protein B
    • Protein or peptide: Small nuclear ribonucleoprotein Sm D1
    • Protein or peptide: Probable small nuclear ribonucleoprotein Sm D2
    • Protein or peptide: Probable small nuclear ribonucleoprotein E
    • Protein or peptide: Probable small nuclear ribonucleoprotein F
    • Protein or peptide: Probable small nuclear ribonucleoprotein G
    • Protein or peptide: Probable U2 small nuclear ribonucleoprotein A'
    • Protein or peptide: RRM domain-containing protein
    • Protein or peptide: Pre-mRNA-processing factor 19
    • Protein or peptide: Peptidyl-prolyl cis-trans isomerase E
  • DNA: Intron lariat RNA
  • Ligand: MAGNESIUM ION
  • Ligand: INOSITOL HEXAKISPHOSPHATE
  • Ligand: GUANOSINE-5'-TRIPHOSPHATE
  • Ligand: ZINC ION

+
Supramolecule #1: Intron lariat spliceosome

SupramoleculeName: Intron lariat spliceosome / type: complex / ID: 1 / Parent: 0 / Macromolecule list: #1-#35
Source (natural)Organism: Caenorhabditis elegans (invertebrata)

+
Macromolecule #1: U2 snRNA

MacromoleculeName: U2 snRNA / type: rna / ID: 1
Details: Full sequence: AUCGCUUCUUCGGCUUAUUAGCUAAGAUCAAAGUGUAGUAUCUGUUCUUAUCGUAUUAAC CUACGGUAUACACUCGAAUGAGUGUAAUAAAGGUUAUAUGAUUUUUGGAACCUAGGGAAG ACUCGGGGCUUGCUCCGACUUCCCAAGGGUCGUCCUGGCGUUGCACUGCUGCCGGGCUCGGCCCAGUCCCC
Number of copies: 1
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 56.554492 KDa
SequenceString: AUCGCUUCUU CGGCUUAUUA GCUAAGAUCA AAGUGUAGUA UCUGUUCUUA UCGUAUUAAC CUACGGUAUA CACUCGAAUG AGUGUAAUA AAGGUUAUAU GAUUUUUGGA ACCUAGGGAA GACUCGGGGC UUGCUCCGAC UUCCCAAGGG (N)(N)(N) (N)(N)(N)(N)(N) ...String:
AUCGCUUCUU CGGCUUAUUA GCUAAGAUCA AAGUGUAGUA UCUGUUCUUA UCGUAUUAAC CUACGGUAUA CACUCGAAUG AGUGUAAUA AAGGUUAUAU GAUUUUUGGA ACCUAGGGAA GACUCGGGGC UUGCUCCGAC UUCCCAAGGG (N)(N)(N) (N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)U CCCC

GENBANK: GENBANK: X51372.1, GENBANK: X51372.1

+
Macromolecule #2: U5 snRNA

MacromoleculeName: U5 snRNA / type: rna / ID: 2 / Number of copies: 1
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 38.680836 KDa
SequenceString:
AAACUCUGGU UCCUCUGCAU UUAACCGUGA AAAUCUUUCG CCUUUUACUA AAGAUUUCCG UGCAAAGGAG CAUACAUUGA GUAUUAUAU ACAAUUUUUG GAGUCCCCUC GAAAGAGCGG GA

GENBANK: GENBANK: Z69659.1

+
Macromolecule #3: U6 snRNA

MacromoleculeName: U6 snRNA / type: rna / ID: 3 / Number of copies: 1
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 32.483355 KDa
SequenceString:
GUUCUUCCGA GAACAUAUAC UAAAAUUGGA ACAAUACAGA GAAGAUUAGC AUGGCCCCUG CGCAAGGAUG ACACGCAAAU UCGUGAAGC GUUCCAAAUU UU

GENBANK: GENBANK: X51387.1

+
Macromolecule #4: Pre-mRNA-splicing factor 8 homolog

MacromoleculeName: Pre-mRNA-splicing factor 8 homolog / type: protein_or_peptide / ID: 4 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 272.396156 KDa
SequenceString: MANYGGHPQT EPHAIPDSIL EEKSRKWKQL QGKRYSEKKK FGMSDTQKEE MPPEHVRKVI RDHGDMTSRK YRHDKRVYLG ALKYMPHAV LKLLENMPMP WEQIRDVKVL YHITGAITFV NDIPRVIEPV YMAQWGTMWI MMRREKRDRR HFKRMRFPPF D DEEPPLDY ...String:
MANYGGHPQT EPHAIPDSIL EEKSRKWKQL QGKRYSEKKK FGMSDTQKEE MPPEHVRKVI RDHGDMTSRK YRHDKRVYLG ALKYMPHAV LKLLENMPMP WEQIRDVKVL YHITGAITFV NDIPRVIEPV YMAQWGTMWI MMRREKRDRR HFKRMRFPPF D DEEPPLDY ADNILDVEPL EPIQMELDPE EDGAVAEWFY DHKPLATTRF VNGPTYRKWA FSIPQMSTLY RLANQLLTDL VD DNYFYLF DMKSFFTAKA LNVAIPGGPK FEPLVKDLHT DEDWNEFNDI NKVIIRAPIR TEYRIAFPFM YNNLISSLPV QVS WYHTPS VVFIKTEDPD LPAFYYDPLI NPIVLSNLKA TEENLPEGEE EDEWELPEDV RPIFEDVPLY TDNTANGLAL LWAP RPFNL RSGRTRRAVD VPLVKSWYRE HCPAGMPVKV RVSYQKLLKV FVLNALKHRP PKPQKRRYLF RSFKATKFFQ TTTLD WVEA GLQVLRQGYN MLNLLIHRKN LNYLHLDYNF NLKPVKTLTT KERKKSRFGN AFHLCREILR LTKLVVDAHV QYRLNN VDA YQLADGLQYI FAHVGQLTGM YRYKYKLMRQ VRMCKDLKHL IYYRFNTGPV GKGPGCGFWA PGWRVWLFFL RGITPLL ER WLGNLLSRQF EGRHSKGVAK TVTKQRVESH FDLELRAAVM HDILDMMPDG IKQNKARVIL QHLSEAWRCW KANIPWKV P GLPTPVENMI LRYVKAKADW WTNSAHYNRE RVRRGATVDK TVCKKNLGRL TRLYLKSEQE RQHNYLKDGP YISAEEAVA IYTTTVHWLE SRRFSPIPFP PLSYKHDTKL LILALERLKE SYSVKNRLNQ SQREELALIE QAYDNPHEAL SRIKRHMLTQ RAFKEVGIE FMDLYTHLIP VYDIEPLEKV TDAYLDQYLW YEADKRRLFP AWVKPGDTEP PPLLTYKWCQ GLNNLQDVWE T SEGECNVI METKLEKIAE KMDLTLLNRL LRLIVDHNIA DYMTSKNNVL INYKDMNHTN SFGIIRGLQF ASFIVQFYGL VL DLLVLGL RRASEIAGPP QCPNEFLQFQ DVATEIGHPI RLYCRYIDRV WIMFRFSADE ARDLIQRYLT EHPDPNNENI VGY NNKKCW PRDARMRLMK HDVNLGRAVF WDIKNRLPRS ITTVEWENSF VSVYSKDNPN MLFDMSGFEC RILPKCRTAN EEFV HRDGV WNLQNEVTKE RTAQCFLKVD EESLSKFHNR IRQILMSSGS TTFTKIVNKW NTALIGLMTY FREAVVNTQE LLDLL VKCE NKIQTRIKIG LNSKMPSRFP PVVFYTPKEI GGLGMLSMGH VLIPQSDLRW MQQTEAGGVT HFRSGMSHDE DQLIPN LYR YIQPWEAEFV DSVRVWAEYA LKRQEANAQN RRLTLEDLDD SWDRGIPRIN TLFQKDRHTL AYDKGWRVRT EFKAYQI LK QNPFWWTHQR HDGKLWNLNN YRTDMIQALG GVEGILEHTL FRGTYFPTWE GLFWERASGF EESMKFKKLT NAQRSGLN Q IPNRRFTLWW SPTINRANVY VGFQVQLDLT GIFMHGKIPT LKISLIQIFR AHLWQKIHES VVMDLCQVFD QELDALEIQ TVQKETIHPR KSYKMNSSCA DVLLFAQYKW NVSRPSLMAD SKDVMDNTTT QKYWLDVQLR WGDYDSHDVE RYARAKFLDY TTDNMSIYP SPTGVLIAID LAYNLYSAYG NWFPGMKPLI RQAMAKIIKA NPAFYVLRER IRKGLQLYSS EPTEPYLTSQ N YGELFSNQ IIWFVDDTNV YRVTIHKTFE GNLTTKPING AIFIFNPRTG QLFLKIIHTS VWAGQKRLSQ LAKWKTAEEV AA LIRSLPV EEQPRQIIVT RKAMLDPLEV HLLDFPNIVI KGSELMLPFQ AIMKVEKFGD LILKATEPQM VLFNLYDDWL KTI SSYTAF SRVVLIMRGM HINPDKTKVI LKPDKTTITE PHHIWPTLSD DDWIKVELAL KDMILADYGK KNNVNVASLT QSEV RDIIL GMEISAPSQQ RQQIADIEKQ TKEQSQVTAT TTRTVNKHGD EIITATTSNY ETASFASRTE WRVRAISSTN LHLRT QHIY VNSDDVKDTG YTYILPKNIL KKFITISDLR TQIAGFMYGV SPPDNPQVKE IRCIVLVPQT GSHQQVNLPT QLPDHE LLR DFEPLGWMHT QPNELPQLSP QDVTTHAKLL TDNISWDGEK TVMITCSFTP GSVSLTAYKL TPSGYEWGKA NTDKGNN PK GYMPTHYEKV QMLLSDRFLG YFMVPSNGVW NYNFQGQRWS PAMKFDVCLS NPKEYYHEDH RPVHFHNFKA FDDPLGTG S ADREDAFA

UniProtKB: Pre-mRNA-splicing factor 8 homolog

+
Macromolecule #5: U5 small nuclear ribonucleoprotein 200 kDa helicase

MacromoleculeName: U5 small nuclear ribonucleoprotein 200 kDa helicase / type: protein_or_peptide / ID: 5 / Number of copies: 1 / Enantiomer: LEVO / EC number: RNA helicase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 244.151594 KDa
SequenceString: MADELARIQQ YEYRQNSNLV LSVDYNLTDR RGREEPTGEV LPITDKEMRK MKMGDRAIKG KAPVQDQKKK RKKKDDEKAQ QFGRNVLVD NNELMGAYKP RTQETKQTYE VILSFILDAL GDVPREVLCG AADEVLLTLK NDKFRDKEKK KEVEALLGPL T DDRIAVLI ...String:
MADELARIQQ YEYRQNSNLV LSVDYNLTDR RGREEPTGEV LPITDKEMRK MKMGDRAIKG KAPVQDQKKK RKKKDDEKAQ QFGRNVLVD NNELMGAYKP RTQETKQTYE VILSFILDAL GDVPREVLCG AADEVLLTLK NDKFRDKEKK KEVEALLGPL T DDRIAVLI NLSKKISDFS IEEENKPEGD GDIYENEGVN VQFDSDEEED DGGMVNEIKG DSEEESEEEE GVDTDYTATL KG DGHLTED EQKARGILHP RDIDAHWIQR SLAKYFKDPL IAQQKQTEVI GILKNAADDR DAENQLVLLL GFDQFEFIKC LRQ NRLMIL YCTLLRQANE KERLQIEDDM RSRPELHPIL ALLQETDEGS VVQVEKSKRD AEKSKKAATA ANEAISAGQW QAGR KMLDL NDLTFSQGSH LMSNKRCELP DGSYRRQKKS YEEIHVPALK PRPFAEGEKL VSVSELPKWA QPAFDGYKSL NRIQS RLCD SALRSKEHLL LCAPTGAGKT NVALLTMLQE IGNHLAEDGS VKLDEFKIVY IAPMKSLVQE MVGSFSKRLA PFGITV GEM TGDAQMSKEQ FMATQVIVCT PEKYDVVTRK GGERAYNQMV RLLIIDEIHL LHDDRGPVLE SIVVRTIRQM EQNHDEC RL VGLSATLPNY QDVATFLRVK PEHLHFFDNS YRPVPLEQQY IGVTEKKALK RFQAMNEVVY DKIMEHAGKS QVLVFVHS R KETAKTAKAI RDACLEKDTL SAFMREGSAS TEILRTEAEQ AKNLDLKDLL PYGFAIHHAG MNRVDRTLVE DLFADRHIQ VLFSTATLAW GVNLPAHTVI IKGTQIYNPE KGRWTELGAL DIMQMLGRAG RPQYDDRGEG ILITNHSELQ YYLSLMNQQL PVESQMVSR LTDMLNAEVV LGTVSSVSEA TNWLGYTFLF VRMLKNPTLY GITHEQARAD PLLEQRRADL IHTACVLLDK A GLIKYDKR SGIIQATELG RIASHFYCTY ESMQTYNKLL VETCSDIDLF RIFSMSSEFK LLSVRDEEKL ELQKMAEHAP IP IKENLDE ASAKTNVLLQ AYISQLKLEG FALQADMVFV AQSAGRLFRA LFEIVLWRGW AGLAQKVLTL CKMVTQRQWG SLN PLHQFK KIPSEVVRSI DKKNYSFDRL YDLDQHQLGD LIKMPKMGKP LFKFIRQFPK LEMTTLIQPI TRTTMRIELT ITPD FKWDE KVHGSAEGFW IFIEDTDGEK ILHHEFFLLK QKFCSDEHVV KMIVPMFDPM PPLYYVRIVS DRWIGAETVL PISFR HLIL PEKYPPPTEL LDLQPLPISA VTNKEFQTVF AESGFKVFNP IQTQVFRTVF ESNENVIVCA PNGSGKTAIA ELAVLR HFE NTPEAKAVYI TPMEDMATKV YADWKRRLEP AIGHTIVLLT GEQTMDLKLA QRGQLIISTP ERWDNISRRW KQRKSVQ NV KLFIADDLHM IGASNGAVFE VVCSRTRYIS SQLESAVRVV ALSSSLTNAR DLGMWLGCSA SATFNFMPST RPVPLDLE I KSFNLSHNAS RFAAMERPVY QAICRHAGKL EPKPALVFVP VRRQTRPVAV ALLTMALADG APKRFLRLAE HDDTFQALL ADIEDESLRE SVSCGVGFLH EGTAPKDVHI VQQLFESNAI QVCVVPRGMC YQIEMSAYLV VVMDTQFYNG KYHVYEDYPI ADMLHMVGL ANRPILDSDA KCVVMCQTSK RAYYKKFLCD PLPVESHLDH CLHDHFNAEI VTKTIENKQD AIDYLTWTLL Y RRMTQNPN YYNLQGTTHR HLSDALSELV ELTLKDLENS KCIAVKDEMD TVSLNLGMIA SYYYISYQTI ELFSMSLKEK TK TRALIEI ISASSEFGNV PMRHKEDVIL RQLAERLPGQ LKNQKFTDPH VKVNLLIHAH LSRVKLTAEL NKDTELIVLR ACR LVQACV DVLSSNGWLS PAIHAMELSQ MLTQAMYSNE PYLKQLPHCS AALLERAKAK EVTSVFELLE LENDDRSDIL QMEG AELAD VARFCNHYPS IEVATELEND VVTSNDNLML AVSLERDNDI DGLAPPVVAP LFPQKRKEEG WWLVIGDSES NALLT IKRL VINEKSSVQL DFAAPRPGHH KFKLFFISDS YLGADQEFDV AFKVEEPGRS NRKRKHEKEE D

UniProtKB: U5 small nuclear ribonucleoprotein 200 kDa helicase

+
Macromolecule #6: Tr-type G domain-containing protein

MacromoleculeName: Tr-type G domain-containing protein / type: protein_or_peptide / ID: 6 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 110.612859 KDa
SequenceString: MDSDLYDEFG NYIGPELDSD DDAGDIDDNG DDEDRSDVDE DDEPDRMEED DAEEIPQNQV VLHEDKKYYA TALEVYGEGV ETLVQEEDA QPLTEPIVKP VSKKKFQAAE RFLPETVYKK EYLADLMDCP HIMRNVAIAG HLHHGKTTFL DCLMEQTHPE F YRAEDADA ...String:
MDSDLYDEFG NYIGPELDSD DDAGDIDDNG DDEDRSDVDE DDEPDRMEED DAEEIPQNQV VLHEDKKYYA TALEVYGEGV ETLVQEEDA QPLTEPIVKP VSKKKFQAAE RFLPETVYKK EYLADLMDCP HIMRNVAIAG HLHHGKTTFL DCLMEQTHPE F YRAEDADA RFTDILFIEK QRGCSIKSQP VSIVAQDSRS KSYLLNIIDT PGHVNFSDEM TASYRLADGV VVMVDAHEGV MM NTERAIR HAIQERLAVT LCISKIDRLL LELKLPPADA YFKLRLIIDQ VNNILSTFAE EDVPVLSPLN GNVIFSSGRY NVC FSLLSF SNIYAKQHGD SFNSKEFARR LWGDIYFEKK TRKFVKKSPS HDAPRTFVQF ILEPMYKIFS QVVGDVDTCL PDVM AELGI RLSKEEQKMN VRPLIALICK RFFGDFSAFV DLVVQNIKSP LENAKTKIEQ TYLGPADSQL AQEMQKCNAE GPLMV HTTK NYPVDDATQF HVFGRVMSGT LEANTDVRVL GENYSIQDEE DCRRMTVGRL FVRVASYQIE VSRVPAGCWV LIEGID QPI VKTATIAELG YEEDVYIFRP LKFNTRSCVK LAVEPINPSE LPKMLDGLRK VNKSYPLLTT RVEESGEHVL LGTGEFY MD CVMHDMRKVF SEIDIKVADP VVTFNETVIE TSTLKCFAET PNKKNKITMM AEPLEKQLDE DIENEVVQIG WNRRRLGE F FQTKYNWDLL AARSIWAFGP DTTGPNILLD DTLPSEVDKH LLSTVRESLV QGFQWATREG PLCEEPIRQV KFKLLDAAI ATEPLYRGGG QMIPTARRCA YSAFLMATPR LMEPYYTVEV VAPADCVAAV YTVLAKRRGH VTTDAPMPGS PMYTISAYIP VMDSFGFET DLRIHTQGQA FCMSAFHHWQ LVPGDPLDKS IVIKTLDVQP TPHLAREFMI KTRRRKGLSE DVSVNKFFDD P MLLELAKQ QDYTGF

UniProtKB: Tr-type G domain-containing protein

+
Macromolecule #7: Protein isy-1

MacromoleculeName: Protein isy-1 / type: protein_or_peptide / ID: 7 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 31.326172 KDa
SequenceString: MARNAEKAMT ALARWRRMKE EEERGPIARR PHDVKDCRNL SDAERFRREI VRDASKKITA IQNPGLGEFK LRDLNDEVNR LIKLKHAWE QRIRELGGTD YRKYAQKELD AIGRETGNSR GYKYFGAAKD LPGVRELFEK STEGEEQRRH RADLLRNIDA H YFGYLDDE ...String:
MARNAEKAMT ALARWRRMKE EEERGPIARR PHDVKDCRNL SDAERFRREI VRDASKKITA IQNPGLGEFK LRDLNDEVNR LIKLKHAWE QRIRELGGTD YRKYAQKELD AIGRETGNSR GYKYFGAAKD LPGVRELFEK STEGEEQRRH RADLLRNIDA H YFGYLDDE DGRLIPLEKL IEEKNIERIN KEFAEKQAQK QQTASDAAPE NIYKVEEDDD DDLETQESTV IGEDGRPMTI RH VLLPTQQ DIEEMLLEQK KQELMAKYLD

UniProtKB: Protein isy-1

+
Macromolecule #8: WD_REPEATS_REGION domain-containing protein

MacromoleculeName: WD_REPEATS_REGION domain-containing protein / type: protein_or_peptide / ID: 8 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 36.865559 KDa
SequenceString: MALVTSSGQQ LVSSGFPQQT AQRFSNLMAP TMVLLGHEGE IYTGAFSPDG TCLATSGYDQ KIFFWNVYGE CENFSTIKGH SGAVMDLKF TTDSSSLVSC GTDKSVRVWD METGTCARRF RTHTDFVNAV HPSRRGVTLV ASASDDGTCR VHDMRTKEPV K TYTNRYQQ ...String:
MALVTSSGQQ LVSSGFPQQT AQRFSNLMAP TMVLLGHEGE IYTGAFSPDG TCLATSGYDQ KIFFWNVYGE CENFSTIKGH SGAVMDLKF TTDSSSLVSC GTDKSVRVWD METGTCARRF RTHTDFVNAV HPSRRGVTLV ASASDDGTCR VHDMRTKEPV K TYTNRYQQ TAVTFNDSSD QVISGGIDNV LKVWDMRRDE ITYTLTGHRD TITGISLSPS GKFIISNSMD CTVRQWDIRP FV PGQRSVG VFAGHNHNFE KNLLKCSWSP CERFITAGSS DRFLYVWETL SKKIVYKLPG HMGSVNCTDF HPKEPIMLSC GSD KRVFLG EIDMS

UniProtKB: WD_REPEATS_REGION domain-containing protein

+
Macromolecule #9: Pre-mRNA-splicing factor SYF1

MacromoleculeName: Pre-mRNA-splicing factor SYF1 / type: protein_or_peptide / ID: 9 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 99.675094 KDa
SequenceString: MADKENATKI EKMPNSETMK GISSEDVPFE EDIIRNPTSV NCWQRYIDHK LQNKSPAKQM FLIYERALAV FERSYKLWYH YLKYRESTI VNKCPTDNSW RALCDTYERC LMRLHKMPRI WICYCEVMIK RGLITETRRV FDRALRSLPV TQHMRIWTLY I GFLTSHDL ...String:
MADKENATKI EKMPNSETMK GISSEDVPFE EDIIRNPTSV NCWQRYIDHK LQNKSPAKQM FLIYERALAV FERSYKLWYH YLKYRESTI VNKCPTDNSW RALCDTYERC LMRLHKMPRI WICYCEVMIK RGLITETRRV FDRALRSLPV TQHMRIWTLY I GFLTSHDL PETTIRVYRR YLKMNPKARE DYVEYLIERD QIDEAAKELT TLVNQDQNVS EKGRTAHQLW TQLCDLISKN PV KIFSLNV DAIIRQGIYR YTDQVGFLWC SLADYYIRSA EFERARDVYE EAIAKVSTVR DFAQVYDAYA AFEEREVSIM MQE VEQSGD PEEEVDLEWM FQRYQHLMER KNELMNSVLL RQNPHNVGEW LNRVNIYEGN YNKQIETFKE AVKSVNPKIQ VGKV RDLWI GLAKLYEDNG DLDAARKTFE TAVISQFGGV SELANVWCAY AEMEMKHKRA KAALTVMQRA CVVPKPGDYE NMQSV QARV HRSPILWAMY ADYEECCGTV ESCRKVYDKM IELRVASPQM IMNYAMFLEE NEYFELAFQA YEKGIALFKW PGVFDI WNT YLVKFIKRYG GKKLERARDL FEQCLENCPP THAKYIFLLY AKLEEEHGLA RHALSIYNRA CSGVDRADMH SMYNIYI KK VQEMYGIAQC RPIFERAISE LPEDKSRAMS LRYAQLETTV GEIDRARAIY AHAAEISDPK VHVKFWDTWK NFEVAHGN E ATVRDMLRVR RSVEASYNVN VTLTSVQMRV DAERKAQETT TSSNPMDSLD QQQQQPSDGA GSITQVSMNK GNISFVRGA GKTVQQNTTE NPDEIDLDED DDDEEDDGGD ADISVKVVPA QIFGNLKLAE EEEEA

UniProtKB: Pre-mRNA-splicing factor SYF1

+
Macromolecule #10: TPR_REGION domain-containing protein

MacromoleculeName: TPR_REGION domain-containing protein / type: protein_or_peptide / ID: 10 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 88.116 KDa
SequenceString: MSDDEAAVPG NKPIRLPKKA AKVKNKAPAQ LQITAEQLLR EAKERELELI PPAPKTKITD PDELKEYQRK KRKEFEDGIR KNRMQLANW IKYGKWEESI GEIQRARSVF ERALDVDHRS ISIWLQYAEM EMRCKQINHA RNVFDRAITI MPRAMQFWLK Y SYMEEVIE ...String:
MSDDEAAVPG NKPIRLPKKA AKVKNKAPAQ LQITAEQLLR EAKERELELI PPAPKTKITD PDELKEYQRK KRKEFEDGIR KNRMQLANW IKYGKWEESI GEIQRARSVF ERALDVDHRS ISIWLQYAEM EMRCKQINHA RNVFDRAITI MPRAMQFWLK Y SYMEEVIE NIPGARQIFE RWIEWEPPEQ AWQTYINFEL RYKEIDRARS VYQRFLHVHG INVQNWIKYA KFEERNGYIG NA RAAYEKA MEYFGEEDIN ETVLVAFALF EERQKEHERA RGIFKYGLDN LPSNRTEEIF KHYTQHEKKF GERVGIEDVI ISK RKTQYE KMVEENGYNY DAWFDYLRLL ENEETDREEV EDVYERAIAN IPPHSEKRYW RRYIYLWINY ALYEELVAKD FDRA RQVYK ACIDIIPHKT FTFAKVWIMF AHFEIRQLDL NAARKIMGVA IGKCPKDKLF RAYIDLELQL REFDRCRKLY EKFLE SSPE SSQTWIKFAE LETLLGDTDR SRAVFTIAVQ QPALDMPELL WKAYIDFEIA CEEHEKARDL YETLLQRTNH IKVWIS MAE FEQTIGNFEG ARKAFERANQ SLENAEKEER LMLLEAWKEC ETKSGDQEAL KRVETMMPRR VKKRRQIQTE DGVDAGW EE YFDYIFPQDQ AAKGSFKLLE AAARWKRERE EAAARAAQEL DAPIPEGDDD EEKEEAGKDA EEKVREGDSD TDLSESSS S SDSESSSSSS SDSSDSSDDD EDK

UniProtKB: TPR_REGION domain-containing protein

+
Macromolecule #11: Pre-mRNA-splicing factor SPF27

MacromoleculeName: Pre-mRNA-splicing factor SPF27 / type: protein_or_peptide / ID: 11 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 27.679885 KDa
SequenceString: MSSKPLALTG GSGSSQLQDD QVLVDALPYL DTEYNEADRQ LAMKLVEHEC KTFRPTKNYL THLPVPDYDA FLTKCMLKEM DRMKKKEEM GKLDMSRCEL PAPSAVKGVD RKLWAKVLRN AKAQNEHLLM RQINLELMDE YAAESYLQRN KVMEDLLTHA E KELRKTKE ...String:
MSSKPLALTG GSGSSQLQDD QVLVDALPYL DTEYNEADRQ LAMKLVEHEC KTFRPTKNYL THLPVPDYDA FLTKCMLKEM DRMKKKEEM GKLDMSRCEL PAPSAVKGVD RKLWAKVLRN AKAQNEHLLM RQINLELMDE YAAESYLQRN KVMEDLLTHA E KELRKTKE AVMEVHANRK MAQLKAGEKV KQLEQSWVSM VTNNYRMEME NRQIDSDNRK QIKALKLDPT KLDDKEDQEN

UniProtKB: Pre-mRNA-splicing factor SPF27

+
Macromolecule #12: Cell division cycle 5-like protein

MacromoleculeName: Cell division cycle 5-like protein / type: protein_or_peptide / ID: 12 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 85.843469 KDa
SequenceString: MVRVIIKGGV WKNTEDEILK AAIMKYGKNQ WSRIASLLHR KSAKQCKARW FEWLDPGIKK TEWSREEDEK LLHLAKLMPT QWRTIAPIV GRTSAQCLER YEHLLDEAQR KAEGLDEEAT ETRKLKPGEI DPTPETKPAR PDPIDMDDDE LEMLSEARAR L ANTQGKKA ...String:
MVRVIIKGGV WKNTEDEILK AAIMKYGKNQ WSRIASLLHR KSAKQCKARW FEWLDPGIKK TEWSREEDEK LLHLAKLMPT QWRTIAPIV GRTSAQCLER YEHLLDEAQR KAEGLDEEAT ETRKLKPGEI DPTPETKPAR PDPIDMDDDE LEMLSEARAR L ANTQGKKA KRKARERQLS DARRLASLQK RREMRAAGLA FARKFKPKRN QIDYSEEIPF EKHVPAGFHN PSEDRYVVED AN QKAIEDH QKPRGREIEM EMRREDREKL KKRKEQGEAD AVFNIKEKKR SKLVLPEPQI SDRELEQIVK IGHASDSVRQ YID GTATSG LLTDYTESAR ANAVAARTMR TPMLKDTVQL ELENLMALQN TESALKGGLN TPLHESELGK GVLPTPKVAA TPNT VLHAI AATPGTQSQF PGSTPGGFAT PAGSVAATPF RDQMRINEEI AGSALEQKAS LKRALASLPT PKNDFEVVGP DDDEV EGAV EDESNQDEDG WIEDASERAE NKAKRNAENR VRNMKMRSQV IQRSLPKPTK VNEQATRATN SSADDMVKAE MSKLLA WDV DNKPPSVIYS REELDAAADL IKQEAESGPE LNSLMWKVVE QCTSEIILSK DKFTRIAILP REEQMKALND EFQMYRG WM NQRAKRAAKV EKKLRVKLGG YQAIHDKLCK KYQEVTTEIE MANIEKKTFE RLGEHELKAI NKRVGRLQQE VTTQETRE K DLQKMYSKLS NKQWKLSQIE IHDAASTTSA PITY

UniProtKB: Cell division cycle 5-like protein

+
Macromolecule #13: Pre-mRNA-splicing factor syf-2

MacromoleculeName: Pre-mRNA-splicing factor syf-2 / type: protein_or_peptide / ID: 13 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 27.719021 KDa
SequenceString: MSSESQSSSS GPSSSGSKMK DFNQRFRDLH KLRQRARKEN HEQVVEEDRR SKLPKNHEAK KERDQWQVKE LQDRKAAEDK GLDYERVRS LEMSADVTEK LEQKRKRKKN PDQGFTSYED MTLRQHTRLT AALDPDLDSY KKMRECVGGE QFYPTADTLI H GNHYPTTA ...String:
MSSESQSSSS GPSSSGSKMK DFNQRFRDLH KLRQRARKEN HEQVVEEDRR SKLPKNHEAK KERDQWQVKE LQDRKAAEDK GLDYERVRS LEMSADVTEK LEQKRKRKKN PDQGFTSYED MTLRQHTRLT AALDPDLDSY KKMRECVGGE QFYPTADTLI H GNHYPTTA AMDKLTKDVH GQVKRREQYH RRRLYDPDAP IDYINEKNKK FNKKLDKYYG KYTEDIKDDL ERGTAI

UniProtKB: Pre-mRNA-splicing factor syf-2

+
Macromolecule #14: Protein BUD31 homolog

MacromoleculeName: Protein BUD31 homolog / type: protein_or_peptide / ID: 14 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 17.153879 KDa
SequenceString:
MSLATKLRRV RKSPPEGWDL IEPTLEQFEA KMREAETEPH EGKRKTEINW PIFRIHHQRS RYVYDMYYKK AEISRELYEF CLTAKFADA ALIAKWKKQG YENLCCVKCV NTRDSNFGTA CICRVPKSKL DAERVIECVH CGCHGCSG

UniProtKB: Protein BUD31 homolog

+
Macromolecule #15: Pre-mRNA-splicing factor RBM22

MacromoleculeName: Pre-mRNA-splicing factor RBM22 / type: protein_or_peptide / ID: 15 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 45.90275 KDa
SequenceString: MSMSKSSYSQ YNRKNWEDSD FPILCETCLG NNPYMRMMKD KYGRECKICE RPFTTFRWQP GKGARYKNTE LCQTCAKVKN VCQTCMFDL EYGLPVQVRD HELQIADNIP KQGANRDFFL QNVERTLGQG DGTQPIAQIA NNMDQAAHDR LRRMGRTQPY Y KRNAPHIC ...String:
MSMSKSSYSQ YNRKNWEDSD FPILCETCLG NNPYMRMMKD KYGRECKICE RPFTTFRWQP GKGARYKNTE LCQTCAKVKN VCQTCMFDL EYGLPVQVRD HELQIADNIP KQGANRDFFL QNVERTLGQG DGTQPIAQIA NNMDQAAHDR LRRMGRTQPY Y KRNAPHIC SFFVKGECKR GEECPYRHEK PTDPDDPLSR QNIRDRYYGT NDPVAEKILN RAAAAPTLSP PADTTITTLY IG NLGPSGA QQVTEKDLND FFYQYGDIRC LRVLTEKGCA FIEFTTREAA ERAAERSFNK TFIKGKRLTI RWGEPQAKRA ADN SNYVTP VPSVPILPVP DGLAPSTSSQ QRFTGSMPRP PAPPTFAAPR SLVVPNVRPV KAGESSGASS SSSIYYPSQD PTRL GAKGD VIE

UniProtKB: Pre-mRNA-splicing factor RBM22

+
Macromolecule #16: Spliceosome-associated protein CWC15 homolog

MacromoleculeName: Spliceosome-associated protein CWC15 homolog / type: protein_or_peptide / ID: 16 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 26.154846 KDa
SequenceString: MTTAHRPTFH PARGGTARGE GDLSKLSNQY SSKDMPSHTK MKYRQTGQET EADLRKKDLR RELEDKERNA IREKRARDSA SSSSSHSKR QRMDQIAAES AASVDADEAV DELNSSDDDD SDEDDTAALM AELEKIKKER AEEKAARDEE IKEKEEKQRM E NILAGNPL ...String:
MTTAHRPTFH PARGGTARGE GDLSKLSNQY SSKDMPSHTK MKYRQTGQET EADLRKKDLR RELEDKERNA IREKRARDSA SSSSSHSKR QRMDQIAAES AASVDADEAV DELNSSDDDD SDEDDTAALM AELEKIKKER AEEKAARDEE IKEKEEKQRM E NILAGNPL LNDTPAGSST SGGDFTVKRR WDDDVVFKNC AKGVEERKKE VTFINDAIRS EFHKKFMDKY IK

UniProtKB: Spliceosome-associated protein CWC15 homolog

+
Macromolecule #17: GCF C-terminal domain-containing protein

MacromoleculeName: GCF C-terminal domain-containing protein / type: protein_or_peptide / ID: 17 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 94.244891 KDa
SequenceString: MFRKPKAKGA IRQRKSDGWD EPDAENQQVV SAIEVKQPAV PRPAMSFDAD EGADSTFKLK KDKKKVEELK RQHKLEEEAE KLYKEEKIR KEALDKIVKK EKLSKEDKKT KNERHKYLDK YRDKSAKHIS NSESYEYEEN LDIDAEAISS VSNKFNSAFE G IPDSRAVF ...String:
MFRKPKAKGA IRQRKSDGWD EPDAENQQVV SAIEVKQPAV PRPAMSFDAD EGADSTFKLK KDKKKVEELK RQHKLEEEAE KLYKEEKIR KEALDKIVKK EKLSKEDKKT KNERHKYLDK YRDKSAKHIS NSESYEYEEN LDIDAEAISS VSNKFNSAFE G IPDSRAVF EAKKRRERAR REGNQDGYIP LDDTQKLKSK SERNRLIRED ENDDSDEECT NKFYSARELL RTEEDRRREE QE GFLEREN GDIDEAERIK GDDDSENEEW EKQQIRKAVS RREIGQLRTE KRNTSKLFGH TVPVEDDTAM DMDIDLDMDV QVI GKPEFT GPSNTGGVVK IEDILAKLKL RIQERDEALN FRKEEKRKLE QNIEENKSMI AKIEMELPNQ STKYTMYQEL RVYS RSLLE CLNEKVGEIN SIIDKKRDCG KSRTSRLSVR RRQDMRDQHA ECMQGRNARM GEAAGRAAER DARRGRRRRE REFTL ARIN HEEGLSTDDE EPTPQSMNDQ KICDEVEAVA SVLFADALDE YSDLRKVFGR MTDWLAVDPK SFQDAYVYLC IPKLSS PYV RLQILRADFL RKETILTSMQ WFHIAMLAGS ENAEIDQSHE ILVELAPAIV EKVVIPFLID TVKEEWDPMS LRQTRHL TT FCSLFEKLPN LTEKSKQFNA FLNAIRERIC DCISEDLFMP IFMPNALEQP ICRQFHDRQF WTCIKLIKSI NALSPLIS I AARFELVVEK CVNSQCVMAL RTGSKNDVTA ERKVRGLLAE LDDSLLKMGG RTSFRQLIGT LELIAEEQSK AGRSFHKEI RKFLEKLER

UniProtKB: GCF C-terminal domain-containing protein

+
Macromolecule #18: Intron-binding protein aquarius

MacromoleculeName: Intron-binding protein aquarius / type: protein_or_peptide / ID: 18 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 170.397688 KDa
SequenceString: MVTKRHQEAV VTRGAIENDT ISAVAAKFWA PFTAETHENF DAKLIDTIYD NEMLKTSFNS RKIMMLEFSQ YLEAYLWPNY VPEKASKAW NMSIVVMINE KFRERNLDSW NCFTKKSEHF PHFFKSILQL SLQEEGLASS EHCALLTFLV NAFGSVETPI V HKETRKLV ...String:
MVTKRHQEAV VTRGAIENDT ISAVAAKFWA PFTAETHENF DAKLIDTIYD NEMLKTSFNS RKIMMLEFSQ YLEAYLWPNY VPEKASKAW NMSIVVMINE KFRERNLDSW NCFTKKSEHF PHFFKSILQL SLQEEGLASS EHCALLTFLV NAFGSVETPI V HKETRKLV SIEIWAGLLD SQREDLFKKQ KKLKKIWENV RQKMTAAAAD NNEFERTYLW NLIEKFKRVL NSLEPNEAQE SE EGEVRDP IDSIKYCERF IELLIDLESI LQTRRFFNSV LHSSHILTHC LLSSLISTDA GSLFFQLVQL LKFYARFEID DLS GRQLTH KEVSEQHYQS VTRLQKAAFR LFNETMKEFY VLNVSGVDTR RALQKQFGDM NHAEVYRFAE YLHLVPAFGE DPNH QTSLL HLYPHQHLVE TITLHCERRP NQLTQLNEKP LFPTEKVIWD ENIIPYENYT GDGVLALDKL NLQFLTLHDY LLRNF NLFQ LESTYEIRQD LEDVLFRMKP FQHESRNETV FSGWARMALQ IDHFQISEVA KPLVGEKSPA VVRGVVTVNI GRRQDI RQE WENLRKHDVC FLVACRSRKS ASGLKFDVRR PFSEQIEVLS VRGCDVEGML DQDGHLLEEF TAWEKKAKIP GDLRKFR LL LDPNQYRIDM EQGTKDDIYD TFNLIVRRDS KTNNFKAVLQ TIRDLLNTEC VVPDWLTDVI LGYGEPDSAH YSKLSSAV P ELDFNDTFLS FAHVKESFPG YKIELADGFD EKEAVPPFKL EFKELERRQD VEIKPGELRT ILVTPLTRKK VTPYSYDPR KNQVKFTPSQ VEAIKSGMQP GLTMVVGPPG TGKTDVAVQI ISNIYHNWPN QRTLIVTHSN QALNQLFEKI IALDVDERHL LRMGHGEEA LETEKDFSRY GRVNYVLKER LQLLNCVEKL AKALKIVGDV AYTCENAGYF FRFSVCRVWE EFLAKVTSKG C NKLAEGII SEIFPFTGFF KDIPDLFSGN NSADLKVAHS CWRHIEQIFE KLDEFRAFEL LRNGRDRTEY LLVKEAKIIA MT CTHAALR RNELVKLGFR YDNIVMEEAA QILEVETFIP LLLQNPQDGH NRLKRWIMIG DHHQLPPVVQ NQAFQKYSNM EQS LFARLV RLSVPNVQLD RQGRARAQIA ELYQWRYNGL GNLPHVDGLP QFQNANAGFA FPFQFIDIPD FNGHGETQPS PHFY QNLGE AEYACALYTY MRILGYPAEK ISILTTYNGQ AQLIRDVFQR RCDTNPLIGM PAKVSTVDKY QGQQNDFIIL SLVKT RNIG HIRDVRRLVV ALSRARLGLY VLGRSKVFMD CLELTPAMRI FAKYPRKLVI LPFEAHPTIR KWNERSKDGE PMEIQD TLH MTHFVHEFYM SNLPAMRDAY EQAMNEYMES QRLLNPPIDE TQMDVETEHE KKHREAMERK KKQEMDDKKE ADIHFED MD HEMQEPAATA APAPGAPAVE EPPPK

UniProtKB: Intron-binding protein aquarius

+
Macromolecule #19: Uncharacterized protein T27F2.1

MacromoleculeName: Uncharacterized protein T27F2.1 / type: protein_or_peptide / ID: 19 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 60.303516 KDa
SequenceString: MSMKLRDILP APVAADEAAS QIRRDPWFGG RDNEPSAALV SKEPPPYGKR TSFRPRGPED FGDGGAFPEI HVAQFPLGLG LGDMRGKPE NTLALQYGTD GKLQHDAIAR IGHVKDKVVY SKLNDMKAKT WNEDDDDIQK PDDDAVIDAT EKTRMALEKI V NSKVASAL ...String:
MSMKLRDILP APVAADEAAS QIRRDPWFGG RDNEPSAALV SKEPPPYGKR TSFRPRGPED FGDGGAFPEI HVAQFPLGLG LGDMRGKPE NTLALQYGTD GKLQHDAIAR IGHVKDKVVY SKLNDMKAKT WNEDDDDIQK PDDDAVIDAT EKTRMALEKI V NSKVASAL PVRHADKLAP AQYIRYTPSQ QNGAAGSQQR IIRMVEEQKD PMEPPKFKIN QKIPRAPPSP PAPVMHSPPR KM TAKDQND WKIPPCISNW KNPKGFTVGL DKRLAADGRG LQQTHINENF AKLADALYIA DRKAREEVET RAQLERRVAQ NKK SEQEAK MAEAAAKARQ ERSAMRRKDD EDDEQVKVRE EIRRDRLDDI RKERNIARSR PDKADKLRKE RERDISEKIV LGLP DTNQK RTGEPQFDQR LFDKTQGLDS GAMDDDTYNP YDAAWRGGDS VQQHVYRPSK NLDNDVYGGD LDKIIEQKNR FVADK GFSG AEGSSRGSGP VQFEKDQDVF GLSSLFEHTK EKKRGGDGGD SRGESKRSRR D

UniProtKB: Uncharacterized protein T27F2.1

+
Macromolecule #20: Peptidyl-prolyl cis-trans isomerase

MacromoleculeName: Peptidyl-prolyl cis-trans isomerase / type: protein_or_peptide / ID: 20 / Number of copies: 1 / Enantiomer: LEVO / EC number: peptidylprolyl isomerase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 18.547002 KDa
SequenceString:
MPAPINDQAP YVILDTTMGK IALELYWNHA PRTCQNFSQL AKRNYYNGTI FHRIIADFMI QGGDPTGTGR GGASIYGDKF SDEIDERLK HTGAGILSMA NAGPNTNGSQ FFITLAPTQH LDGKHTIFGR VAAGMKVIAN MGRVDTDNHD RPKIEIRILK A YPSESSVL S

UniProtKB: Peptidyl-prolyl cis-trans isomerase

+
Macromolecule #21: WD_REPEATS_REGION domain-containing protein

MacromoleculeName: WD_REPEATS_REGION domain-containing protein / type: protein_or_peptide / ID: 21 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 54.766215 KDa
SequenceString: MSASVSDPYE QMPAAPTDDD LEDKPEADKK ALLNQVFKSL KRAQDLFYHD YAQPPPMPEE NDSLIRSMKR KHEYGNVIKK VEEMKVRRE NEMLALPTSQ PMHGTGSVIA SAGTPLAITD GSGKLVNQQQ GSAKSGTLLP LVPLGNSSKG EDNTTRSLLP S KAPMMMKP ...String:
MSASVSDPYE QMPAAPTDDD LEDKPEADKK ALLNQVFKSL KRAQDLFYHD YAQPPPMPEE NDSLIRSMKR KHEYGNVIKK VEEMKVRRE NEMLALPTSQ PMHGTGSVIA SAGTPLAITD GSGKLVNQQQ GSAKSGTLLP LVPLGNSSKG EDNTTRSLLP S KAPMMMKP KWHAPWKLYR VASGHTGWVR AVDVEPGNQW FASGGADRII KIWDLASGQL KLSLTGHISS VRAVKVSPRH PF LFSGGED KQVKCWDLEY NKVIRHYHGH LSAVQALSVH PSLDVLVTCA RDSTARVWDM RTKAQVHCFA GHTNTVADVV CQS VDPQVI TASHDATVRL WDLAAGRSMC TLTHHKKSVR ALTIHPRLNM FASASPDNIK QWKLPKGEFM QNLSGHNAII NTLS SNDDG VVVSGADNGS LCFWDWRSGF CFQKIQTKPQ PGSIESEAGI YASCFDKTGL RLITAEADKT IKMYKEDDEA TEESH PIVW RPEIVKKKAY

UniProtKB: WD_REPEATS_REGION domain-containing protein

+
Macromolecule #22: Septin and tuftelin-interacting protein 1 homolog

MacromoleculeName: Septin and tuftelin-interacting protein 1 homolog / type: protein_or_peptide / ID: 22 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 94.421539 KDa
SequenceString: MEDDDGRESF EINDMDLEYA MNPGGRRRFQ NKDQATYGVF APDSDDDDDE QGTSRGPYKK RSKISAPMSF VSGGIQQGNK IDKDDPASL NLNLGGEKKP KEDDEGSIQI DFDKRTKKAP KQNGAQVFAG MRSSANHGAA DINQFGSWMR GDGNSNKIMK M MQAMGYKP ...String:
MEDDDGRESF EINDMDLEYA MNPGGRRRFQ NKDQATYGVF APDSDDDDDE QGTSRGPYKK RSKISAPMSF VSGGIQQGNK IDKDDPASL NLNLGGEKKP KEDDEGSIQI DFDKRTKKAP KQNGAQVFAG MRSSANHGAA DINQFGSWMR GDGNSNKIMK M MQAMGYKP GEGLGAQGQG IVEPVQAQLR KGRGAVGAYG KESTATGPKF GESAADAQKR MAQEGTSSRP TNDDQEKSGL KI KGSWKKS QTVKTKYRTI EDVMEEGMSA SRPASHQQSQ QYSNIKVIDM TGKQQKIYSG YDSFSMKTRS EYDTVDDEER TVF DVPELI HNLNLLVDLT EEGIRRSNQQ LISLKDQTTA LEYDLQQVQK SLGTEEQEAQ HIKDVYELID GFSSNRSPSM EECQ ELFRR LRSEFPHEYE LYSLETVAIP TVLPLIQKYF VAWKPLEDKN YGCELISTWR DILDDSKNGR KMTFGHNKTK GDEIR AYDR IIWEGILPSI RRACLQWDPS TQMHEMIELV EQWIPLLSAW ITENILEQLV VPKIAERVNQ WDPMTDEIPI HEWLVP WLV LLGDRIQTVM PPIRQKLSKA LKLWDPMDRS ALETLRPWQN VWSAATFSAF IAQNIVPKLG VALDTMELNP TMNPEYP EW TACMEWLEFT HPDAIANIVT KYFFPRFYNC LCLWLDSPGV DYNEVKRWYG SWKARIPQVL VNYPTVNENL RRSMIAIG R SLQGEKVGGL QATPIAPMAP PPPMAPHFTQ AAPVQKLSLK EIIEYTAGKN GFTYHPQKDR YKDGRQVFWF GALSIYLDS EMVYVMDPIE FVWRPSGLNE LIQMAQGAQG

UniProtKB: Septin and tuftelin-interacting protein 1 homolog

+
Macromolecule #23: WD_REPEATS_REGION domain-containing protein

MacromoleculeName: WD_REPEATS_REGION domain-containing protein / type: protein_or_peptide / ID: 23 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 65.385664 KDa
SequenceString: MDALQAYGGS DSEHSDDDAS MDQVAKGKSS TLLERAIVTA PDVESKSAIR QVAIVDPKTK EIKSNPKFDQ LFKPESGPVN HFKSEQQRS QKNTLTGFVE PAHLNEFHFN RQIRSFDTLG YAQNPTAESG TTHFVGDVKK AEAEKGVSLF ESKKTGGEKR K RVRNDDSA ...String:
MDALQAYGGS DSEHSDDDAS MDQVAKGKSS TLLERAIVTA PDVESKSAIR QVAIVDPKTK EIKSNPKFDQ LFKPESGPVN HFKSEQQRS QKNTLTGFVE PAHLNEFHFN RQIRSFDTLG YAQNPTAESG TTHFVGDVKK AEAEKGVSLF ESKKTGGEKR K RVRNDDSA DIDGYTGPWS RFIDEKTVAK PTPELQKQMD EIVKKRQEKS RRFKKEKEDS EQMAEESSTL HLKEAEDYQG RS FLVPPSF TGVNLREDYV PERCFVPKKL VHTYRGHNKG VNFLQWFPKS AHLFLSCSMD TKIKLWEVYD RQRVVRTYAG HKL PVREVA FNNEGTEFLS ASFDRYVKLW DTETGQVKQR FHTGHVPYCL KYHPDDDKNH MFLVGMQNKK IIQWDSRSGE IVQE YDRHL QAVNSITFFD KNRRFASTSD DKSVRIWEWE IPVDTKLIQN VGLHAIPTMT KSPNDKWVVG QCMDNRIVLF QLVDD KLRF SKKKAFRGHN AAGYACNIDF SPDQSFLISG DADGKLFIWD WRTHKIVGKW KAHDSTCIAA LWHPHEKSRM ITAGWD GLI KMWN

UniProtKB: WD_REPEATS_REGION domain-containing protein

+
Macromolecule #24: Coiled-coil domain-containing protein 12

MacromoleculeName: Coiled-coil domain-containing protein 12 / type: protein_or_peptide / ID: 24 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 19.138486 KDa
SequenceString:
MTDKNNSDSD EDIESLTNHE TSLEAAAKAR KRRLLAMKSK IHGIEMQEED YDEGETSTKK SREVGREFRN HKPDDAVGTQ NVDMDLDIV QREITEHLKD VLHEKAIDSV DLAMLAPKKI DWDLKRDIES KLQKLERRTQ KAVATIIRQR LAEGKGDLAA T VNAAAAQN L

UniProtKB: Coiled-coil domain-containing protein 12

+
Macromolecule #25: Small nuclear ribonucleoprotein Sm D3

MacromoleculeName: Small nuclear ribonucleoprotein Sm D3 / type: protein_or_peptide / ID: 25 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 14.836212 KDa
SequenceString:
MTSVGVPIKI LHEAEGHMVT LETVTGEVYR GKLSEAEDNM NCQLAETVVT FRDGRSHQLD NVFIRGNKIR FMILPDMLKN APMFKNIGR AQKGAIGMGL GGLDQRGRGR GTAFRRPMGR GGPRGMSRPG GAPTFRG

UniProtKB: Small nuclear ribonucleoprotein Sm D3

+
Macromolecule #26: Probable small nuclear ribonucleoprotein-associated protein B

MacromoleculeName: Probable small nuclear ribonucleoprotein-associated protein B
type: protein_or_peptide / ID: 26 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 16.768627 KDa
SequenceString:
MTISKNNKMM AHLNYRMKII LQDGRTFIGF FKAFDKHMNI LLAECEEHRQ IKPKAGKKTD GEEKRILGLV LVRGEHIVSM TVDGPPPRD DDSVRLAKAG GAGGVGQAKP GGRGMPAMPG MPGMPPGGAP GGLSGAMRGH GGPGMAAMQP GYGGPPGGRP F

UniProtKB: Probable small nuclear ribonucleoprotein-associated protein B

+
Macromolecule #27: Small nuclear ribonucleoprotein Sm D1

MacromoleculeName: Small nuclear ribonucleoprotein Sm D1 / type: protein_or_peptide / ID: 27 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 13.72407 KDa
SequenceString:
MKLVRFLMKL SHETVNIELK NGTQVSGTIM GVDVAMNTHL RAVSMTVKNK EPVKLDTLSI RGNNIRYIIL PDPLALDTLL IDDEPRKKA RAARAGASRG RGGRGGMRGG RGGRGRGRGG PRGAGPRR

UniProtKB: Small nuclear ribonucleoprotein Sm D1

+
Macromolecule #28: Probable small nuclear ribonucleoprotein Sm D2

MacromoleculeName: Probable small nuclear ribonucleoprotein Sm D2 / type: protein_or_peptide / ID: 28 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 13.291529 KDa
SequenceString:
MSAQAKPRSE MTAEELAAKE DEEFNVGPLS ILTNSVKNNH QVLINCRNNK KLLGRVKAFD RHCNMVLENV KEMWTEVPKT GKGKKKAKS VAKDRFISKM FLRGDSVILV VKNPLAQAE

UniProtKB: Probable small nuclear ribonucleoprotein Sm D2

+
Macromolecule #29: Probable small nuclear ribonucleoprotein E

MacromoleculeName: Probable small nuclear ribonucleoprotein E / type: protein_or_peptide / ID: 29 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 10.625318 KDa
SequenceString:
MSTRKLNKVM VQPVNLIFRY LQNRTRVQIW LYEDVTHRLE GYIIGFDEFM NVVFDEAEEV NMKTKGRNKI GRILLKGDNI TLIHAAQQE A

UniProtKB: Probable small nuclear ribonucleoprotein E

+
Macromolecule #30: Probable small nuclear ribonucleoprotein F

MacromoleculeName: Probable small nuclear ribonucleoprotein F / type: protein_or_peptide / ID: 30 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 9.256534 KDa
SequenceString:
MSAVQPVNPK PFLNSLTGKF VVCKLKWGME YKGVLVAVDS YMNLQLAHAE EYIDGNSQGN LGEILIRCNN VLYVGGVDGE NETSA

UniProtKB: Probable small nuclear ribonucleoprotein F

+
Macromolecule #31: Probable small nuclear ribonucleoprotein G

MacromoleculeName: Probable small nuclear ribonucleoprotein G / type: protein_or_peptide / ID: 31 / Number of copies: 2 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 8.756209 KDa
SequenceString:
MSKTHPPELK KYMDKEMDLK LNGNRRVSGI LRGFDPFMNM VIDEAVEYQK DGGSVNLGMT VIRGNSVVIM EPKERIS

UniProtKB: Probable small nuclear ribonucleoprotein G

+
Macromolecule #32: Probable U2 small nuclear ribonucleoprotein A'

MacromoleculeName: Probable U2 small nuclear ribonucleoprotein A' / type: protein_or_peptide / ID: 32 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 28.9059 KDa
SequenceString: MVRLTTELFA ERPQFVNSVN MREINLRGQK IPVIENMGVT RDQFDVIDLT DNDIRKLDNF PTFSRLNTLY LHNNRINYIA PDIATKLPN LKTLALTNNN ICELGDIEPL AECKKLEYVT FIGNPITHKD NYRMYMIYKL PTVRVIDFNR VRLTEREAAK K MFKGKSGK ...String:
MVRLTTELFA ERPQFVNSVN MREINLRGQK IPVIENMGVT RDQFDVIDLT DNDIRKLDNF PTFSRLNTLY LHNNRINYIA PDIATKLPN LKTLALTNNN ICELGDIEPL AECKKLEYVT FIGNPITHKD NYRMYMIYKL PTVRVIDFNR VRLTEREAAK K MFKGKSGK KARDAIQKSV HTEDPSEIEP NENSSGGGAR LTDEDREKIK EAIKNAKSLS EVNYLQSILA SGKVPEKGWN RQ MDQNGAD GEAMES

UniProtKB: Probable U2 small nuclear ribonucleoprotein A'

+
Macromolecule #33: RRM domain-containing protein

MacromoleculeName: RRM domain-containing protein / type: protein_or_peptide / ID: 33 / Number of copies: 1 / Enantiomer: LEVO
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 24.881344 KDa
SequenceString: MADINPNHTI YVNNLNEKVK KDELKRSLHM VFTQFGEIIQ LMSFRKEKMR GQAHIVFKEV SSASNALRAL QGFPFYGKPM RIQYAREDS DVISRAKGTF VEKRQKSTKI AKKPYEKPAK NGKSAAEPTQ KEPQETDGPG LPNNILFCSN IPEGTEPEQI Q TIFSQFPG ...String:
MADINPNHTI YVNNLNEKVK KDELKRSLHM VFTQFGEIIQ LMSFRKEKMR GQAHIVFKEV SSASNALRAL QGFPFYGKPM RIQYAREDS DVISRAKGTF VEKRQKSTKI AKKPYEKPAK NGKSAAEPTQ KEPQETDGPG LPNNILFCSN IPEGTEPEQI Q TIFSQFPG LREVRWMPNT KDFAFIEYES EDLSEPARQA LDNFRITPTQ QITVKFASK

UniProtKB: RRM domain-containing protein

+
Macromolecule #34: Pre-mRNA-processing factor 19

MacromoleculeName: Pre-mRNA-processing factor 19 / type: protein_or_peptide / ID: 34 / Number of copies: 4 / Enantiomer: LEVO / EC number: RING-type E3 ubiquitin transferase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 53.272633 KDa
SequenceString: MSFVCGISGE LTEDPVVSQV SGHIFDRRLI VKFIAENGTD PISHGELSED QLVSLKSGGT GSAPRNVSGT SIPSLLKMLQ DEWDTVMLN SFSLRQQLQI ARQELSHSLY QHDAACRVIS RLSKELTAAR EALSTLKPHT SAKVDDDVSI DESEDQQGLS E AILAKLEE ...String:
MSFVCGISGE LTEDPVVSQV SGHIFDRRLI VKFIAENGTD PISHGELSED QLVSLKSGGT GSAPRNVSGT SIPSLLKMLQ DEWDTVMLN SFSLRQQLQI ARQELSHSLY QHDAACRVIS RLSKELTAAR EALSTLKPHT SAKVDDDVSI DESEDQQGLS E AILAKLEE KSKSLTAERK QRGKNLPEGL AKTEELAELK QTASHTGIHS TGTPGITALD IKGNLSLTGG IDKTVVLYDY EK EQVMQTF KGHNKKINAV VLHPDNITAI SASADSHIRV WSATDSSSKA IIDVHQAPVT DISLNASGDY ILSASDDSYW AFS DIRSGK SLCKVSVEPG SQIAVHSIEF HPDGLIFGTG AADAVVKIWD LKNQTVAAAF PGHTAAVRSI AFSENGYYLA TGSE DGEVK LWDLRKLKNL KTFANEEKQP INSLSFDMTG TFLGIGGQKV QVLHVKSWSE VVSLSDHSGP VTGVRFGENA RSLVT CSLD KSLRVFSF

UniProtKB: Pre-mRNA-processing factor 19

+
Macromolecule #35: Peptidyl-prolyl cis-trans isomerase E

MacromoleculeName: Peptidyl-prolyl cis-trans isomerase E / type: protein_or_peptide / ID: 35 / Number of copies: 1 / Enantiomer: LEVO / EC number: peptidylprolyl isomerase
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 36.469277 KDa
SequenceString: MNTNFPHNRK RTLYVGGFTE DVTEKVLMAA FIPFGDVVAI SIPMDYESGK HRGFGFVEFD MAEDAAMAID NMNESELFGK TIRVNFARP PKATERSQKP VWADDEWLKK YGRGGEAAAE EDGDAEKAAT SSSSASTKLP RVYLGVKIGI RYIGRIVIEL R TDVTPKTA ...String:
MNTNFPHNRK RTLYVGGFTE DVTEKVLMAA FIPFGDVVAI SIPMDYESGK HRGFGFVEFD MAEDAAMAID NMNESELFGK TIRVNFARP PKATERSQKP VWADDEWLKK YGRGGEAAAE EDGDAEKAAT SSSSASTKLP RVYLGVKIGI RYIGRIVIEL R TDVTPKTA ENFRCLCTGE RGFGYEGSIF HRIIPKFMLQ GGDFTKGDGT GGKSIYGTKF DDENFTLRHT MPGTVSMANC GA NTNGSQF FICTEKTDWL DGKHVVFGHV VEGMNIVRQV EQQGTPSGKP QMVVKIVESG EIEPEKRIAA EKLAQKAVVP GAE IQEPLP QAMET

UniProtKB: Peptidyl-prolyl cis-trans isomerase E

+
Macromolecule #36: Intron lariat RNA

MacromoleculeName: Intron lariat RNA / type: dna / ID: 36 / Number of copies: 1 / Classification: DNA
Source (natural)Organism: Caenorhabditis elegans (invertebrata)
Molecular weightTheoretical: 3.680845 KDa
SequenceString:
(N)(N)(N)(N)(N)(N)(N)(N)(N)(N) (N)(N)(N)(N)(N)(N)(N)(N)(N)

+
Macromolecule #37: MAGNESIUM ION

MacromoleculeName: MAGNESIUM ION / type: ligand / ID: 37 / Number of copies: 7 / Formula: MG
Molecular weightTheoretical: 24.305 Da

+
Macromolecule #38: INOSITOL HEXAKISPHOSPHATE

MacromoleculeName: INOSITOL HEXAKISPHOSPHATE / type: ligand / ID: 38 / Number of copies: 2 / Formula: IHP
Molecular weightTheoretical: 660.035 Da
Chemical component information

ChemComp-IHP:
INOSITOL HEXAKISPHOSPHATE

+
Macromolecule #39: GUANOSINE-5'-TRIPHOSPHATE

MacromoleculeName: GUANOSINE-5'-TRIPHOSPHATE / type: ligand / ID: 39 / Number of copies: 1 / Formula: GTP
Molecular weightTheoretical: 523.18 Da
Chemical component information

ChemComp-GTP:
GUANOSINE-5'-TRIPHOSPHATE / GTP, energy-carrying molecule*YM

+
Macromolecule #40: ZINC ION

MacromoleculeName: ZINC ION / type: ligand / ID: 40 / Number of copies: 6 / Formula: ZN
Molecular weightTheoretical: 65.409 Da

-
Experimental details

-
Structure determination

Methodcryo EM
Processingsingle particle reconstruction
Aggregation stateparticle

-
Sample preparation

BufferpH: 7.9
VitrificationCryogen name: ETHANE
DetailsCrosslinked with glutaraledhyde

-
Electron microscopy

MicroscopeFEI TITAN KRIOS
Image recordingFilm or detector model: GATAN K3 BIOQUANTUM (6k x 4k) / Average electron dose: 60.0 e/Å2
Electron beamAcceleration voltage: 300 kV / Electron source: FIELD EMISSION GUN
Electron opticsIllumination mode: FLOOD BEAM / Imaging mode: BRIGHT FIELD / Nominal defocus max: 2.0 µm / Nominal defocus min: 0.7000000000000001 µm
Experimental equipment
Model: Titan Krios / Image courtesy: FEI Company

-
Image processing

Startup modelType of model: PDB ENTRY
PDB model - PDB ID:
Final reconstructionResolution.type: BY AUTHOR / Resolution: 2.9 Å / Resolution method: FSC 0.143 CUT-OFF / Number images used: 879523
Initial angle assignmentType: MAXIMUM LIKELIHOOD
Final angle assignmentType: MAXIMUM LIKELIHOOD

-
Atomic model buiding 1

Initial modelChain - Source name: AlphaFold / Chain - Initial model type: in silico model
RefinementProtocol: AB INITIO MODEL
Output model

PDB-8ro0:
Structure of the C. elegans Intron Lariat Spliceosome primed for disassembly (ILS')

+
About Yorodumi

-
News

-
Feb 9, 2022. New format data for meta-information of EMDB entries

New format data for meta-information of EMDB entries

  • Version 3 of the EMDB header file is now the official format.
  • The previous official version 1.9 will be removed from the archive.

Related info.:EMDB header

External links:wwPDB to switch to version 3 of the EMDB data model

-
Aug 12, 2020. Covid-19 info

Covid-19 info

URL: https://pdbjlvh1.pdbj.org/emnavi/covid19.php

New page: Covid-19 featured information page in EM Navigator.

Related info.:Covid-19 info / Mar 5, 2020. Novel coronavirus structure data

+
Mar 5, 2020. Novel coronavirus structure data

Novel coronavirus structure data

Related info.:Yorodumi Speices / Aug 12, 2020. Covid-19 info

External links:COVID-19 featured content - PDBj / Molecule of the Month (242):Coronavirus Proteases

+
Jan 31, 2019. EMDB accession codes are about to change! (news from PDBe EMDB page)

EMDB accession codes are about to change! (news from PDBe EMDB page)

  • The allocation of 4 digits for EMDB accession codes will soon come to an end. Whilst these codes will remain in use, new EMDB accession codes will include an additional digit and will expand incrementally as the available range of codes is exhausted. The current 4-digit format prefixed with “EMD-” (i.e. EMD-XXXX) will advance to a 5-digit format (i.e. EMD-XXXXX), and so on. It is currently estimated that the 4-digit codes will be depleted around Spring 2019, at which point the 5-digit format will come into force.
  • The EM Navigator/Yorodumi systems omit the EMD- prefix.

Related info.:Q: What is EMD? / ID/Accession-code notation in Yorodumi/EM Navigator

External links:EMDB Accession Codes are Changing Soon! / Contact to PDBj

+
Jul 12, 2017. Major update of PDB

Major update of PDB

  • wwPDB released updated PDB data conforming to the new PDBx/mmCIF dictionary.
  • This is a major update changing the version number from 4 to 5, and with Remediation, in which all the entries are updated.
  • In this update, many items about electron microscopy experimental information are reorganized (e.g. em_software).
  • Now, EM Navigator and Yorodumi are based on the updated data.

External links:wwPDB Remediation / Enriched Model Files Conforming to OneDep Data Standards Now Available in the PDB FTP Archive

-
Yorodumi

Thousand views of thousand structures

  • Yorodumi is a browser for structure data from EMDB, PDB, SASBDB, etc.
  • This page is also the successor to EM Navigator detail page, and also detail information page/front-end page for Omokage search.
  • The word "yorodu" (or yorozu) is an old Japanese word meaning "ten thousand". "mi" (miru) is to see.

Related info.:EMDB / PDB / SASBDB / Comparison of 3 databanks / Yorodumi Search / Aug 31, 2016. New EM Navigator & Yorodumi / Yorodumi Papers / Jmol/JSmol / Function and homology information / Changes in new EM Navigator and Yorodumi

Read more