[English] 日本語
Yorodumi Papers
- Database of articles cited by EMDB/PDB/SASBDB data -

+
Search query

Keywords
Structure methods
Author
Journal
IF

-
Structure paper

TitleDiscovery of CRISPR-Cas12a clades using a large language model.
Journal, issue, pagesNat Commun, Vol. 16, Issue 1, Page 7877, Year 2025
Publish dateAug 23, 2025
AuthorsYuanyuan Feng / Junchao Shi / Zhanwei Li / Yongqian Li / Jiaxi Yang / Shisheng Huang / Jinfang Zheng / Wei Han / Yunbo Qiao / Jun Zhang / Qi Liu / Yao Yang / Chunyi Hu / Lina Wu / Xiaokang Zhang / Jin Tang / Xingxu Huang / Peixiang Ma /
PubMed AbstractCRISPR-Cas systems revolutionize life science. Metagenomes contain millions of unknown Cas proteins. Traditional mining relies on protein sequence alignments. In this work, we employ an evolutionary ...CRISPR-Cas systems revolutionize life science. Metagenomes contain millions of unknown Cas proteins. Traditional mining relies on protein sequence alignments. In this work, we employ an evolutionary scale language model (ESM) to learn the information beyond sequences. Trained with CRISPR-Cas data, ESM accurately identifies Cas proteins without alignment. Limited experimental data restricts feature prediction, but integrating with machine learning enables trans-cleavage activity prediction of uncharacterized Cas12a. We discover 7 undocumented Cas12a subtypes with unique CRISPR loci. Structural analyses reveal 8 subtypes of Cas1, Cas2, and Cas4. Cas12a subtypes display distinct 3D-folds. CryoEM analyses unveil unique RNA interactions with the uncharacterized Cas12a. These proteins show distinct double-strand and single-strand DNA cleavage preferences and broad PAM recognition. Finally, we establish a specific detection strategy for the oncogene SNP without traditional Cas12a PAM. This study highlights the potential of language models in exploring undocumented Cas protein function via gene cluster classification.
External linksNat Commun / PubMed:40849498 / PubMed Central
MethodsEM (single particle)
Resolution2.9 Å
Structure data

EMDB-37219, PDB-8kgf:
Structure of AmCas12a with crRNA
Method: EM (single particle) / Resolution: 2.9 Å

Chemicals

ChemComp-MG:
Unknown entry

Source
  • anaeroglobus (bacteria)
KeywordsDNA BINDING PROTEIN/RNA / CRISPR/Cas12a / DNA binding protein / DNA BINDING PROTEIN-RNA complex

+
About Yorodumi Papers

-
News

-
Feb 9, 2022. New format data for meta-information of EMDB entries

New format data for meta-information of EMDB entries

  • Version 3 of the EMDB header file is now the official format.
  • The previous official version 1.9 will be removed from the archive.

Related info.:EMDB header

External links:wwPDB to switch to version 3 of the EMDB data model

-
Aug 12, 2020. Covid-19 info

Covid-19 info

URL: https://pdbj.org/emnavi/covid19.php

New page: Covid-19 featured information page in EM Navigator.

Related info.:Covid-19 info / Mar 5, 2020. Novel coronavirus structure data

+
Mar 5, 2020. Novel coronavirus structure data

Novel coronavirus structure data

Related info.:Yorodumi Speices / Aug 12, 2020. Covid-19 info

External links:COVID-19 featured content - PDBj / Molecule of the Month (242):Coronavirus Proteases

+
Jan 31, 2019. EMDB accession codes are about to change! (news from PDBe EMDB page)

EMDB accession codes are about to change! (news from PDBe EMDB page)

  • The allocation of 4 digits for EMDB accession codes will soon come to an end. Whilst these codes will remain in use, new EMDB accession codes will include an additional digit and will expand incrementally as the available range of codes is exhausted. The current 4-digit format prefixed with “EMD-” (i.e. EMD-XXXX) will advance to a 5-digit format (i.e. EMD-XXXXX), and so on. It is currently estimated that the 4-digit codes will be depleted around Spring 2019, at which point the 5-digit format will come into force.
  • The EM Navigator/Yorodumi systems omit the EMD- prefix.

Related info.:Q: What is EMD? / ID/Accession-code notation in Yorodumi/EM Navigator

External links:EMDB Accession Codes are Changing Soon! / Contact to PDBj

+
Jul 12, 2017. Major update of PDB

Major update of PDB

  • wwPDB released updated PDB data conforming to the new PDBx/mmCIF dictionary.
  • This is a major update changing the version number from 4 to 5, and with Remediation, in which all the entries are updated.
  • In this update, many items about electron microscopy experimental information are reorganized (e.g. em_software).
  • Now, EM Navigator and Yorodumi are based on the updated data.

External links:wwPDB Remediation / Enriched Model Files Conforming to OneDep Data Standards Now Available in the PDB FTP Archive

-
Yorodumi Papers

Database of articles cited by EMDB/PDB/SASBDB data

  • Database of articles cited by EMDB, PDB, and SASBDB entries
  • Using PubMed data

Related info.:EMDB / PDB / SASBDB / Yorodumi / EMN Papers / Changes in new EM Navigator and Yorodumi

Read more