Usage Overview¶
credtools provides a comprehensive suite of commands for multi-ancestry fine-mapping analysis. This page provides an overview of all available commands and their typical usage patterns.
Command Summary¶
credtools includes 7 main subcommands that can be used individually or as part of integrated workflows:
Data Preparation Commands¶
munge - Summary Statistics Munging¶
Standardizes GWAS summary statistics from various formats into credtools-compatible format.
Use for: Converting raw GWAS data, handling multi-ancestry studies, standardizing column formats.
chunk - Loci Identification and Chunking¶
Identifies independent genetic loci and splits data into locus-specific files.
Use for: Defining independent loci, creating analysis-ready chunks, handling genome-wide data.
prepare - LD Matrix Preparation¶
Extracts LD matrices from reference panels and creates final fine-mapping inputs.
Use for: Matching summary stats with LD data, handling multi-ancestry reference panels, optimizing for fine-mapping.
Analysis Commands¶
meta - Meta-Analysis¶
Combines summary statistics and LD matrices across ancestries or studies.
Use for: Multi-ancestry meta-analysis, combining studies, increasing statistical power.
qc - Quality Control¶
Performs comprehensive quality checks on fine-mapping inputs.
Use for: Validating data quality, identifying problematic loci, ensuring analysis reliability.
finemap - Fine-Mapping Analysis¶
Runs statistical fine-mapping to identify causal variants.
Use for: Identifying causal variants, calculating posterior probabilities, generating credible sets.
Workflow Commands¶
pipeline - Complete Pipeline¶
Runs the full analysis workflow in a single command.
credtools pipeline prepared_loci.txt results/ --tool multisusie```
**Use for:** Automated end-to-end analysis, production workflows, consistent parameter application.
## Typical Workflows
### Single-Ancestry Analysis
```bash
# 1. Standardize summary statistics
credtools munge gwas_eur.txt munged/
# 2. Identify loci and chunk data
credtools chunk munged/gwas_eur.munged.txt.gz chunked/
# 3. Prepare LD matrices
echo '{"EUR": "/path/to/eur_reference"}' > genotype_config.json
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/
# 4. Run fine-mapping
credtools finemap prepared/final_loci_list.txt results/ --tool susie
# 5. Results are saved in results/ directory
Multi-Ancestry Analysis¶
# 1. Prepare multi-ancestry file mapping
echo '{
"EUR": "gwas_eur.txt",
"ASN": "gwas_asn.txt",
"AFR": "gwas_afr.txt"
}' > ancestry_files.json
# 2. Munge all ancestries
credtools munge ancestry_files.json munged/
# 3. Identify shared loci
credtools chunk munged/ chunked/ --merge-overlapping
# 4. Prepare multi-ancestry LD matrices
echo '{
"EUR": "/path/to/eur_reference",
"ASN": "/path/to/asn_reference",
"AFR": "/path/to/afr_reference"
}' > genotype_config.json
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/
# 5. Run complete pipeline with meta-analysis
credtools pipeline prepared/final_loci_list.txt results/ \
--meta-method meta_all --tool multisusie
# 6. Results are saved in results/ directory
Quality-Focused Workflow¶
# Standard preparation steps
credtools munge ancestry_files.json munged/
credtools chunk munged/ chunked/
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/
# Meta-analysis with quality control
credtools meta prepared/final_loci_list.txt meta/
credtools qc meta/meta_all/loci_list.txt qc/
# Fine-mapping only on QC-passed loci
credtools finemap qc/passed_loci_list.txt finemap/
# Results are saved in finemap/ directory
Comparative Analysis Workflow¶
# Prepare data once
credtools munge ancestry_files.json munged/
credtools chunk munged/ chunked/
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/
# Compare different meta-analysis strategies
for method in meta_all meta_by_population no_meta; do
credtools pipeline prepared/final_loci_list.txt results_${method}/ \
--meta-method $method --tool susie
done
# Compare different fine-mapping tools
for tool in susie abf finemap multisusie; do
credtools finemap meta/meta_all/loci_list.txt results_${tool}/ --tool $tool
done
# Compare results in results_comparison/ directory
Command Selection Guide¶
Choose Commands Based on Your Needs¶
Starting with raw GWAS data?
→ Begin with munge
to standardize formats
Have genome-wide association data?
→ Use chunk
to identify independent loci
Need LD information?
→ Use prepare
to extract LD matrices from reference panels
Multiple ancestries or studies?
→ Use meta
to combine evidence appropriately
Concerned about data quality?
→ Use qc
to validate inputs before analysis
Ready for fine-mapping?
→ Use finemap
for detailed variant-level analysis
Want automated workflow?
→ Use pipeline
for streamlined end-to-end analysis
Need to explore results? → Examine JSON and CSV output files
Individual Commands vs Pipeline¶
Use individual commands when: - Learning the credtools workflow - Need custom intermediate processing - Debugging analysis issues - Comparing different strategies - Maximum control over each step
Use pipeline when: - Running standard analysis workflows - Production or batch processing - Want consistent parameter application - Automated processing pipelines - Time-sensitive analyses
Common Parameter Patterns¶
Computational Resources¶
# High-performance multi-threading
--threads 16
# Memory-efficient processing
--threads 4
# Single-threaded for debugging
--threads 1
Multi-Ancestry Strategies¶
# Maximum power through combination
--meta-method meta_all# Population-specific analysis
--meta-method meta_by_population# Individual ancestry analysis
--meta-method no_meta```
### Fine-Mapping Configuration
```bash
# Conservative single-signal analysis
--tool abf --max-causal 1
# Standard multi-signal analysis
--tool susie --max-causal 3
# Complex multi-signal analysis
--tool susie --max-causal 10 --estimate-residual-variance
# State-of-the-art multi-ancestry
--tool multisusie--max-causal 5
Getting Help¶
Command-Specific Help¶
# Get help for any command
credtools munge --help
credtools finemap --help
credtools pipeline --help
# See all available commands
credtools --help
Detailed Documentation¶
Each command has comprehensive documentation:
- munge - Summary statistics munging
- chunk - Loci identification and chunking
- prepare - LD matrix preparation
- meta - Meta-analysis
- qc - Quality control
- finemap - Fine-mapping analysis
- pipeline - Complete pipeline
Additional Resources¶
- Tutorial - Step-by-step guides
- API Documentation - Programmatic usage
- Installation - Setup instructions
- Contributing - Development guidelines
Tips for Success¶
- Start simple: Begin with default parameters and single ancestry
- Plan your workflow: Understand which commands you'll need before starting
- Use appropriate resources: Match computational requirements to available hardware
- Validate inputs: Use QC command to catch issues early
- Save intermediate results: Keep outputs from each step for troubleshooting
- Document parameters: Record command-line options for reproducibility
- Explore results thoroughly: Examine output files and summary statistics