[English] 日本語

- EMDB-50040: Scalable protein design using hallucination in a relaxed sequence... -
+
Open data
-
Basic information
Entry | ![]() | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Title | Scalable protein design using hallucination in a relaxed sequence space | |||||||||
![]() | Main unsharpened map | |||||||||
![]() |
| |||||||||
![]() | De novo designed protein K12 / DE NOVO PROTEIN | |||||||||
Biological species | synthetic construct (others) | |||||||||
Method | single particle reconstruction / cryo EM / Resolution: 3.36 Å | |||||||||
![]() | Frank CJ / Dietz H | |||||||||
Funding support | European Union, ![]()
| |||||||||
![]() | ![]() Title: Scalable protein design using optimization in a relaxed sequence space. Authors: Christopher Frank / Ali Khoshouei / Lara Fuβ / Dominik Schiwietz / Dominik Putz / Lara Weber / Zhixuan Zhao / Motoyuki Hattori / Shihao Feng / Yosta de Stigter / Sergey Ovchinnikov / Hendrik Dietz / ![]() ![]() ![]() Abstract: Machine learning (ML)-based design approaches have advanced the field of de novo protein design, with diffusion-based generative methods increasingly dominating protein design pipelines. Here, we ...Machine learning (ML)-based design approaches have advanced the field of de novo protein design, with diffusion-based generative methods increasingly dominating protein design pipelines. Here, we report a "hallucination"-based protein design approach that functions in relaxed sequence space, enabling the efficient design of high-quality protein backbones over multiple scales and with broad scope of application without the need for any form of retraining. We experimentally produced and characterized more than 100 proteins. Three high-resolution crystal structures and two cryo-electron microscopy density maps of designed single-chain proteins comprising up to 1000 amino acids validate the accuracy of the method. Our pipeline can also be used to design synthetic protein-protein interactions, as validated experimentally by a set of protein heterodimers. Relaxed sequence optimization offers attractive performance with respect to designability, scope of applicability for different design problems, and scalability across protein sizes. | |||||||||
History |
|
-
Structure visualization
Supplemental images |
---|
-
Downloads & links
-EMDB archive
Map data | ![]() | 18.4 MB | ![]() | |
---|---|---|---|---|
Header (meta data) | ![]() ![]() | 17.9 KB 17.9 KB | Display Display | ![]() |
FSC (resolution estimation) | ![]() | 6.9 KB | Display | ![]() |
Images | ![]() | 52.4 KB | ||
Masks | ![]() | 36.3 MB | ![]() | |
Filedesc metadata | ![]() | 6.2 KB | ||
Others | ![]() ![]() ![]() | 34.3 MB 33.8 MB 33.8 MB | ||
Archive directory | ![]() ![]() | HTTPS FTP |
-Related structure data
Related structure data | ![]() 9exkMC ![]() 8s89C ![]() 8yl4C ![]() 8yl8C ![]() 9exzC ![]() 9f0lC M: atomic model generated by this map C: citing same article ( |
---|
-
Links
EMDB pages | ![]() ![]() |
---|
-
Map
File | ![]() | ||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Annotation | Main unsharpened map | ||||||||||||||||||||||||||||||||||||
Projections & slices | Image control
Images are generated by Spider. | ||||||||||||||||||||||||||||||||||||
Voxel size | X=Y=Z: 1.31 Å | ||||||||||||||||||||||||||||||||||||
Density |
| ||||||||||||||||||||||||||||||||||||
Symmetry | Space group: 1 | ||||||||||||||||||||||||||||||||||||
Details | EMDB XML:
|
-Supplemental data
-Mask #1
File | ![]() | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Projections & Slices |
| ||||||||||||
Density Histograms |
-Additional map: Additional, sharpened map used for model building
File | emd_50040_additional_1.map | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Annotation | Additional, sharpened map used for model building | ||||||||||||
Projections & Slices |
| ||||||||||||
Density Histograms |
-Half map: #2
File | emd_50040_half_map_1.map | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Projections & Slices |
| ||||||||||||
Density Histograms |
-Half map: #1
File | emd_50040_half_map_2.map | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Projections & Slices |
| ||||||||||||
Density Histograms |
-
Sample components
-Entire : De Novo designed Protein K12
Entire | Name: De Novo designed Protein K12 |
---|---|
Components |
|
-Supramolecule #1: De Novo designed Protein K12
Supramolecule | Name: De Novo designed Protein K12 / type: organelle_or_cellular_component / ID: 1 / Parent: 0 / Macromolecule list: all |
---|---|
Source (natural) | Organism: synthetic construct (others) |
Molecular weight | Theoretical: 114 KDa |
-Macromolecule #1: De novo designed protein K12
Macromolecule | Name: De novo designed protein K12 / type: protein_or_peptide / ID: 1 / Number of copies: 1 / Enantiomer: LEVO |
---|---|
Source (natural) | Organism: synthetic construct (others) |
Molecular weight | Theoretical: 115.735977 KDa |
Recombinant expression | Organism: ![]() ![]() |
Sequence | String: MSGAVYFLLL DLRAEVDEEI AWARRLGLDD LVAALEAVRA LIEGALATLE SADFDYLEFT QRLADALSSL VRVYDDLIAR LEEQPATTL RRAYRILLEY RRKEVRELLE AVQELRDVLE TLERLSRRLG RPDFAGWLVS FVLDHYGELV APDILTNPAK G FRALAHLL ...String: MSGAVYFLLL DLRAEVDEEI AWARRLGLDD LVAALEAVRA LIEGALATLE SADFDYLEFT QRLADALSSL VRVYDDLIAR LEEQPATTL RRAYRILLEY RRKEVRELLE AVQELRDVLE TLERLSRRLG RPDFAGWLVS FVLDHYGELV APDILTNPAK G FRALAHLL RAFLYVLLAL KLRSPDEELR EEARRAVAFL YGEEFVKAHS DEELAELLLE RAREAILEAA RYNSALREEF DA AGGPEGR EAWLERQLLR LRGLVERFLE LWENSELRAG PDGELVAVPG VKGLEIIKKL LEEGKGVNLA LWTLGRLLRA LDL SPEARA AYEAALEALR RARLQLQYVQ SERYEGSDRE RAEAIRAAFE TIRAAAETIR AVIEADTSLP AELKAAYIEV IYAY LLQVA REVRDALWRL AEEILPEYIE KFFKGSEEEQ RLTLYELLRA LGEDYFFLDL EKEGYSEEEL RELFRNAKLE VINAD ESGK IKLYNLILDA KKLNRKVLIK ITLTELSEGS YIITIEVFKS PDAEIPEYEI RVAAVGATSE EILKYLEELK EKAKEG ELI RELLLLYVDR QIAELEEKVA NADKIDPVVA RLAIEEARAR GEELTEADVI EGTRAGYQAA LDVLRRIKAE LEKEKSP EN PFYQFYDKLT EKLKEKGFVS EEEAFEIARE TFGFPADLPP LAAAALRDFA STVLTILEIF KTAEDFSKWY KENKEKLI E LAGLSEEELD KIVRKTLTLL LEALARSVFG SKLGRELLNE ALGTFIKELL ESFFRTHYGL TRGDAVIDFD AKTGILSLR FTPRAYARIR VKEYRDPSLG EKFDNLLDVL SSNPSLKGQV DRLRVSYAFG TPVGTTPALR DATAEDLETD PRLKRHRDFI EEVENLYAE LLIRLEEALK DEPETVEILT EIIGRHLKEV IHDPDVINAL LDRRDLSPEE FAARARAVLD EIIAEEKKLQ E KLLEAVED NPEAKKIVEE IFPKIIATIE RYREWPEREL AGLPLGGSHH HHHH |
-Experimental details
-Structure determination
Method | cryo EM |
---|---|
![]() | single particle reconstruction |
Aggregation state | particle |
-
Sample preparation
Concentration | 3.6 mg/mL |
---|---|
Buffer | pH: 7.4 / Component - Concentration: 0.4 mg/ml / Component - Formula: PBS / Component - Name: PBS / Details: Gibco PBS |
Vitrification | Cryogen name: ETHANE / Chamber humidity: 95 % / Chamber temperature: 283 K |
-
Electron microscopy
Microscope | FEI TITAN KRIOS |
---|---|
Image recording | Film or detector model: TFS FALCON 4i (4k x 4k) / Average electron dose: 50.0 e/Å2 |
Electron beam | Acceleration voltage: 300 kV / Electron source: ![]() |
Electron optics | Illumination mode: SPOT SCAN / Imaging mode: BRIGHT FIELD / Cs: 2.7 mm / Nominal defocus max: 3.74 µm / Nominal defocus min: 0.1 µm / Nominal magnification: 120000 |
Experimental equipment | ![]() Model: Titan Krios / Image courtesy: FEI Company |
+
Image processing
-Atomic model buiding 1
Initial model | Chain - Source name: AlphaFold / Chain - Initial model type: in silico model |
---|---|
Refinement | Protocol: FLEXIBLE FIT |
Output model | ![]() PDB-9exk: |