fix: LBP pipeline broken due to wrong Chai-1 FASTA generator#6
Open
Xinping-Liu wants to merge 1 commit intoOTeam-AI4S:mainfrom
Open
fix: LBP pipeline broken due to wrong Chai-1 FASTA generator#6Xinping-Liu wants to merge 1 commit intoOTeam-AI4S:mainfrom
Xinping-Liu wants to merge 1 commit intoOTeam-AI4S:mainfrom
Conversation
The LBP pipeline was broken because _prepare_refold_chai1 called make_chai1_fasta_multi_process, which is AME-specific and requires an AME CSV to look up task names and pre-built ligand SMILES files. LBP has no AME CSV, so all 24 FASTA files failed to generate and Chai-1 received zero inputs. Fix by adding make_chai1_fasta_from_backbone_dir to ReFold, which: - reads ligand residue names directly from backbone PDB HETATM records - looks up SMILES from the CCD components.cif database - reads LigandMPNN-generated sequences from inverse_fold/seqs/ Update _prepare_refold_chai1 in pipeline_framework.py to use the new method for LBP and interface tasks. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Problem
The LBP (and interface) pipeline failed silently at the
refold_preparestage._prepare_refold_chai1calledmake_chai1_fasta_multi_process, which is AME-specific: it looks up task names from an AME CSV and reads pre-built ligand SMILES fromassets/ame/standard_nonprotein_fasta/. LBP has no AME CSV, so all FASTA files failed to generate with"Failed to get task name"warnings, and Chai-1 received zero inputs.Fix
Add
make_chai1_fasta_from_backbone_dirtoReFold, a general-purpose method that:components.cifdatabase (no AME CSV required)inverse_fold/seqs/Update
_prepare_refold_chai1inpipeline_framework.pyto use the new method for LBP/interface tasks. AME continues to usemake_chai1_fasta_multi_processunchanged.Test
Ran end-to-end on 3 LBP backbone structures (× 8 sequences = 24 designs):
refold_prepare: 24/24 FASTA files generated ✓raw_data.csv✓Files changed
refold/refold_api.py: addmake_chai1_fasta_from_backbone_dirmethodscripts/pipeline_framework.py: update_prepare_refold_chai1to use new method