Usage Overview¶

credtools provides a comprehensive suite of commands for multi-ancestry fine-mapping analysis. This page provides an overview of all available commands and their typical usage patterns.

Command Summary¶

credtools includes 7 main subcommands that can be used individually or as part of integrated workflows:

Data Preparation Commands¶

munge - Summary Statistics Munging¶

Standardizes GWAS summary statistics from various formats into credtools-compatible format.

credtools munge input_files.json output_dir/

Use for: Converting raw GWAS data, handling multi-ancestry studies, standardizing column formats.

chunk - Loci Identification and Chunking¶

Identifies independent genetic loci and splits data into locus-specific files.

credtools chunk munged_files.json chunked_output/

Use for: Defining independent loci, creating analysis-ready chunks, handling genome-wide data.

prepare - LD Matrix Preparation¶

Extracts LD matrices from reference panels and creates final fine-mapping inputs.

credtools prepare chunk_info.txt genotype_config.json prepared_output/

Use for: Matching summary stats with LD data, handling multi-ancestry reference panels, optimizing for fine-mapping.

Analysis Commands¶

meta - Meta-Analysis¶

Combines summary statistics and LD matrices across ancestries or studies.

credtools meta prepared_loci.txt meta_output/ --meta-method meta_all

Use for: Multi-ancestry meta-analysis, combining studies, increasing statistical power.

qc - Quality Control¶

Performs comprehensive quality checks on fine-mapping inputs.

credtools qc loci_list.txt qc_output/

Use for: Validating data quality, identifying problematic loci, ensuring analysis reliability.

finemap - Fine-Mapping Analysis¶

Runs statistical fine-mapping to identify causal variants.

credtools finemap loci_list.txt finemap_output/ --tool susie --max-causal 3

Use for: Identifying causal variants, calculating posterior probabilities, generating credible sets.

Workflow Commands¶

pipeline - Complete Pipeline¶

Runs the full analysis workflow in a single command.

credtools pipeline prepared_loci.txt results/ --tool multisusie```

**Use for:** Automated end-to-end analysis, production workflows, consistent parameter application.


## Typical Workflows

### Single-Ancestry Analysis

```bash
# 1. Standardize summary statistics
credtools munge gwas_eur.txt munged/

# 2. Identify loci and chunk data
credtools chunk munged/gwas_eur.munged.txt.gz chunked/

# 3. Prepare LD matrices
echo '{"EUR": "/path/to/eur_reference"}' > genotype_config.json
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/

# 4. Run fine-mapping
credtools finemap prepared/final_loci_list.txt results/ --tool susie

# 5. Results are saved in results/ directory

Multi-Ancestry Analysis¶

# 1. Prepare multi-ancestry file mapping
echo '{
  "EUR": "gwas_eur.txt",
  "ASN": "gwas_asn.txt", 
  "AFR": "gwas_afr.txt"
}' > ancestry_files.json

# 2. Munge all ancestries
credtools munge ancestry_files.json munged/

# 3. Identify shared loci
credtools chunk munged/ chunked/ --merge-overlapping

# 4. Prepare multi-ancestry LD matrices
echo '{
  "EUR": "/path/to/eur_reference",
  "ASN": "/path/to/asn_reference",
  "AFR": "/path/to/afr_reference"
}' > genotype_config.json
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/

# 5. Run complete pipeline with meta-analysis
credtools pipeline prepared/final_loci_list.txt results/ \
  --meta-method meta_all --tool multisusie

# 6. Results are saved in results/ directory

Quality-Focused Workflow¶

# Standard preparation steps
credtools munge ancestry_files.json munged/
credtools chunk munged/ chunked/
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/

# Meta-analysis with quality control
credtools meta prepared/final_loci_list.txt meta/
credtools qc meta/meta_all/loci_list.txt qc/

# Fine-mapping only on QC-passed loci
credtools finemap qc/passed_loci_list.txt finemap/

# Results are saved in finemap/ directory

Comparative Analysis Workflow¶

# Prepare data once
credtools munge ancestry_files.json munged/
credtools chunk munged/ chunked/
credtools prepare chunked/chunk_info.txt genotype_config.json prepared/

# Compare different meta-analysis strategies
for method in meta_all meta_by_population no_meta; do
  credtools pipeline prepared/final_loci_list.txt results_${method}/ \
    --meta-method $method --tool susie
done

# Compare different fine-mapping tools
for tool in susie abf finemap multisusie; do
  credtools finemap meta/meta_all/loci_list.txt results_${tool}/ --tool $tool
done

# Compare results in results_comparison/ directory

Command Selection Guide¶

Choose Commands Based on Your Needs¶

Starting with raw GWAS data? → Begin with munge to standardize formats

Have genome-wide association data? → Use chunk to identify independent loci

Need LD information? → Use prepare to extract LD matrices from reference panels

Multiple ancestries or studies? → Use meta to combine evidence appropriately

Concerned about data quality? → Use qc to validate inputs before analysis

Ready for fine-mapping? → Use finemap for detailed variant-level analysis

Want automated workflow? → Use pipeline for streamlined end-to-end analysis

Need to explore results? → Examine JSON and CSV output files

Individual Commands vs Pipeline¶

Use individual commands when: - Learning the credtools workflow - Need custom intermediate processing - Debugging analysis issues - Comparing different strategies - Maximum control over each step

Use pipeline when: - Running standard analysis workflows - Production or batch processing - Want consistent parameter application - Automated processing pipelines - Time-sensitive analyses

Common Parameter Patterns¶

Computational Resources¶

# High-performance multi-threading
--threads 16

# Memory-efficient processing  
--threads 4

# Single-threaded for debugging
--threads 1

Multi-Ancestry Strategies¶

# Maximum power through combination
--meta-method meta_all# Population-specific analysis
--meta-method meta_by_population# Individual ancestry analysis
--meta-method no_meta```

### Fine-Mapping Configuration

```bash
# Conservative single-signal analysis
--tool abf --max-causal 1

# Standard multi-signal analysis
--tool susie --max-causal 3

# Complex multi-signal analysis
--tool susie --max-causal 10 --estimate-residual-variance

# State-of-the-art multi-ancestry
--tool multisusie--max-causal 5

Getting Help¶

Command-Specific Help¶

# Get help for any command
credtools munge --help
credtools finemap --help
credtools pipeline --help

# See all available commands
credtools --help

Detailed Documentation¶

Each command has comprehensive documentation: - munge - Summary statistics munging - chunk - Loci identification and chunking
- prepare - LD matrix preparation - meta - Meta-analysis - qc - Quality control - finemap - Fine-mapping analysis - pipeline - Complete pipeline

Additional Resources¶

Tutorial - Step-by-step guides
API Documentation - Programmatic usage
Installation - Setup instructions
Contributing - Development guidelines

Tips for Success¶

Start simple: Begin with default parameters and single ancestry
Plan your workflow: Understand which commands you'll need before starting
Use appropriate resources: Match computational requirements to available hardware
Validate inputs: Use QC command to catch issues early
Save intermediate results: Keep outputs from each step for troubleshooting
Document parameters: Record command-line options for reproducibility
Explore results thoroughly: Examine output files and summary statistics