Pere Martra

Pere Martra

AI Engineer & Applied Researcher specializing in LLM optimization, structured pruning, and fairness-aware systems.

I help ML engineering teams optimize large language models for production through practical research and hands-on implementation.

Current Work

Training

Hands-on workshops for engineering teams on LLM optimization and creation.

What I Offer

  • LLM optimization techniques (pruning, quantization, distillation)
  • Fairness-aware model development
  • Production deployment strategies

Format

Short workshops • Small teams • Customized content

Limited availability: 1-2 engagements maximum

Consulting

Specialized consulting services for production LLM challenges.

LLM Optimization

Hands-on optimization for production deployments. Structured pruning, quantization, and efficiency improvements.

Agent Migration

Research Partnership

Collaborative framework development for migrating agentic systems from proprietary models (GPT-4, Claude) to open-source alternatives (Llama, Mistral).

Limited availability: 1-2 engagements maximum

Selected Work

Books

Rearchitecting LLMs (Manning, forthcoming)
Large Language Models Projects (Apress, 2024)

Open Source

OptiPfair – Fairness-aware structured pruning
LLM course – LLMs: Hands-on Projects
1,700+ GitHub stars combined