Changelog¶
All notable changes to querygym will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
0.2.0 - 2026-04-12¶
Added¶
- ThinkQE reformulation method: multi-round query expansion via iterative corpus feedback
- Strips
<think>...</think>reasoning traces for compatibility with DeepSeek-R1 style models - Configurable rounds (
num_interaction), expansion accumulation, and novelty-based passage filtering - Prompt template
thinkqe.v1added to prompt bank - Full documentation in methods reference and reformulation guide
- ThinkQE listed in README supported methods table
Fixed¶
- Removed misspelled
reqeat_weightfallback key inThinkQE._get_repeat_weight() - Renamed
test_method_thinkqe.pytotest_methods_thinkqe.pyfor naming consistency
0.1.4 - 2025-11-21¶
Added¶
- Docker support with pre-built images on GitHub Container Registry (GHCR)
- GPU-enabled image (
ghcr.io/ls3-lab/querygym:latest) - CPU-only image (
ghcr.io/ls3-lab/querygym:cpu) - Multi-platform support (linux/amd64, linux/arm64 for CPU)
- Docker Compose configuration for easy setup
- Comprehensive Docker documentation and examples
- GitHub Actions workflow for automated Docker image building and publishing
- Example notebooks and scripts in
examples/directory - Code snippets for quick reference
- Docker examples with Jupyter notebooks
- QueryGym + Pyserini integration examples
- DOCKER_SETUP.md quick reference guide
- Makefile for local Docker development
Changed¶
- License changed from MIT to Apache License 2.0
- Reorganized examples into three categories: snippets, docker, querygym_pyserini
- Updated to OpenJDK 21 (from 17) in Docker images
- Migrated to Docker Compose V2 syntax (
docker composeinstead ofdocker-compose) - Improved Docker image tagging strategy (separate tags for GPU and CPU)
- Enhanced README with Docker installation options and citation information
- Updated all documentation to use modern Docker Compose commands
Fixed¶
- Docker build issues with OpenJDK availability in Debian Trixie
- Docker Compose tag conflicts between GPU and CPU images
- Inconsistent command examples across documentation
0.1.3 - 2025-11-19¶
Added¶
- Apache License 2.0
- Citation information with arXiv paper reference in README
Changed¶
- Package metadata updated with Apache 2.0 license
0.1.2 - 2025-11-18¶
Changed¶
- Updated author information with email addresses for Mert Incesu and Negar Arabzadeh
- Version bump for metadata updates
0.1.1 - 2025-11-18¶
Added¶
- Added
tqdmas core dependency for progress bars - Complete Read the Docs documentation setup
- MkDocs configuration with Material theme
- Comprehensive API reference documentation
- GitHub Actions workflow for automated PyPI publishing
Changed¶
- Updated package name from
queryGymtoquerygym(lowercase) - Updated author information in package metadata
- Improved dependency management
Fixed¶
- Fixed missing
tqdmdependency in core package
0.1.0 - 2025-11-17¶
Added¶
- Initial release of querygym
- Core reformulation framework
- Eight query reformulation methods:
- GenQR (generic keyword expansion)
- GenQR Ensemble
- Query2Doc
- QA Expand
- MuGI
- LameR
- Query2E
- CSQE
- Prompt bank system with YAML configuration
- DataLoader for queries, qrels, and contexts
- BEIR and MS MARCO format loaders
- Pyserini and PyTerrier adapters
- CLI interface with
querygymcommand - OpenAI-compatible LLM client
- Comprehensive test suite
- Example scripts and documentation
Features¶
- Dependency-free core data loading
- Optional integrations (BEIR, Pyserini, HuggingFace)
- Flexible LLM client supporting any OpenAI-compatible API
- Batch processing with progress bars
- Context-based reformulation support
- Export to TSV/JSONL formats
- Script generation for Pyserini + trec_eval