OptiPFair Documentation¶

Welcome to the OptiPFair documentation. OptiPFair is a Python library for structured pruning of large language models, with a focus on GLU architectures and fairness analysis.

Why Prune Language Models?¶

Pruning helps to reduce the size and computational requirements of LLMs, making them:

Faster for inference
More efficient in terms of memory usage
Easier to deploy on resource-constrained devices

Why Analyze Bias in Language Models?¶

Understanding bias in language models is crucial for:

Ensuring fairness in AI applications
Identifying problematic patterns in model behavior
Developing mitigation strategies through pruning
Making informed decisions about model deployment

Key Features¶

GLU Architecture-Aware Pruning: Maintains the paired nature of gate_proj and up_proj layers
Depth Pruning: Remove entire transformer layers for aggressive efficiency gains
Layer Importance Analysis: Analyze transformer layers using cosine similarity to inform pruning decisions
Bias Visualization Tools: Comprehensive analysis of how models process demographic attributes
Quantitative Bias Metrics: Numeric measurements for consistent evaluation
Multiple Neuron Selection Methods: MAW, VOW, and PON for different pruning strategies
Flexible Pruning Targets: Support for both pruning percentage and target expansion rate
Layer Selection Methods: Choose specific layers or use automatic selection strategies
Simple API and CLI: Easy to use interfaces for Python and command line
Progress Tracking: Visual progress bars and detailed statistics

Getting Started¶

Installation: How to install OptiPFair
Usage: Basic usage examples for pruning
Bias Visualization: Analyzing fairness in transformers
API Reference: Detailed API documentation
Examples: In-depth examples and tutorials

Supported Model Architectures¶

OptiPFair is designed to work with transformer-based models that use GLU architecture in their MLP components, including:

LLaMA family (LLaMA, LLaMA-2, LLaMA-3)
Mistral models
And other models with similar GLU architectures

Citation¶

If you use OptiPFair in your research, please cite:

@software{optipfair2025,
  author = {Pere Martra},
  title = {OptiPFair: A Library for Structured Pruning of Large Language Models},
  year = {2025},
  url = {https://github.com/peremartra/optipfair}
}