Query-generation

Home
Login

Home

Query generation

This is a query generation project. We take a database and generate queries on that database. The queries generated can be used as input data for machine learning pipelines involving query optimization, cardinality estimation or latency prediction.

The queries are generated based on foreign keys, and stats collected from the database itself.

Index

  1. Installation of requirements
  2. Running the Code
  3. Authors and Contact

Installation of requirements

We use pixi for managing the python environment. For running the LLM augmentation we use ollama

You can install pixi in Linux and Mac by running:

curl -fsSL https://pixi.sh/install.sh | sh

You can check pixi was succesfully installed by running:

pixi run main --help

which will give you a list of our endpoints along with some documentation

To install ollama you can run

curl -fsSL https://ollama.com/install.sh | sh

For Mac you can use the installer.

Running the generator

We support various ways of running queries. You may check them ... TODO

Authors and contact

This project was made by Gabriel Lozano under the supervision of Yanlei Diao and Guillaum Lachaud at École Polytechnique. You may contact the main collaborator via email gabriel.lozano@lix.polytechnique.fr