Artifact 2f6f21190ba3c825464f14d0db5c72e75f566b7c0683d8a411e8ba6da0c8f86d:
- File docs/home.md — part of check-in [74dd286374] at 2025-09-08 22:38:55 on branch docs — updates home.md to add the pixi trick (user: mathos size: 1985) [more...]
Query generation
This is a query generation project. We take a database and generate queries on that database. The queries generated can be used as input data for machine learning pipelines involving query optimization, cardinality estimation or latency prediction.
The queries are generated based on foreign keys, and stats collected from the database itself.
Important links
Installation of requirements
We use pixi for managing the python environment. For running the LLM augmentation we use ollama
You can install pixi in Linux and Mac by running:
curl -fsSL https://pixi.sh/install.sh | sh
You can check pixi was succesfully installed by running:
pixi run main --help
which will give you a list of our endpoints along with some documentation
To install ollama you can run
curl -fsSL https://ollama.com/install.sh | sh
For Mac you can use the installer.
Pixi cheatsheet
If you want to run the formatter and the test make sure you are in the dev environment and run it with
pixi shell -e dev
pixi run lint
If all went well there are no errors on the formatting or the tests.
Running the generator
We support various ways of running queries. To see our supported pipelines see the query-generation pipelines available.
Authors and contact
This project was made by Gabriel Lozano under the supervision of Yanlei Diao and Guillaum Lachaud at École Polytechnique. You may contact the main collaborator via email gabriel.lozano@lix.polytechnique.fr