Query generation
This is a query generation project. We take a database and generate queries on that database. The queries generated can be used as input data for machine learning pipelines involving query optimization, cardinality estimation or latency prediction.
The queries are generated based on foreign keys, and stats collected from the database itself.
Index
Installation of requirements
We use pixi for managing the python environment. For running the LLM augmentation we use ollama
You can install pixi in Linux and Mac by running:
curl -fsSL https://pixi.sh/install.sh | sh
You can check pixi was succesfully installed by running:
pixi run main --help
which will give you a list of our endpoints along with some documentation
To install ollama you can run
curl -fsSL https://ollama.com/install.sh | sh
For Mac you can use the installer.
Running the generator
We support various ways of running queries. You may check them ... TODO
Authors and contact
This project was made by Gabriel Lozano under the supervision of Yanlei Diao and Guillaum Lachaud at École Polytechnique. You may contact the main collaborator via email gabriel.lozano@lix.polytechnique.fr