A python framework to transform natural language questions to queries in a database query language.

Last update: Dec 18, 2022

Related tags

Overview

  __ _ _   _  ___ _ __  _   _
 / _` | | | |/ _ \ '_ \| | | |
| (_| | |_| |  __/ |_) | |_| |
 \__, |\__,_|\___| .__/ \__, |
    |_|          |_|    |___/

What's quepy?

Quepy is a python framework to transform natural language questions to queries in a database query language. It can be easily customized to different kinds of questions in natural language and database queries. So, with little coding you can build your own system for natural language access to your database.

Currently Quepy provides support for Sparql and MQL query languages. We plan to extended it to other database query languages.

An example

To illustrate what can you do with quepy, we included an example application to access DBpedia contents via their sparql endpoint.

You can try the example online here: Online demo

Or, you can try the example yourself by doing:

python examples/dbpedia/main.py "Who is Tom Cruise?"

And it will output something like this:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Thomas Cruise Mapother IV, widely known as Tom Cruise, is an...

The transformation from natural language to sparql is done by first using a special form of regular expressions:

person_name = Group(Plus(Pos("NNP")), "person_name")
regex = Lemma("who") + Lemma("be") + person_name + Question(Pos("."))

And then using and a convenient way to express semantic relations:

person = IsPerson() + HasKeyword(person_name)
definition = DefinitionOf(person)

The rest of the transformation is handled automatically by the framework to finally produce this sparql:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Using a very similar procedure you could generate and MQL query for the same question obtaining:

[{
    "/common/topic/description": [{}],
    "/type/object/name": "Tom Cruise",
    "/type/object/type": "/people/person"
}]

Installation

You need to have installed docopt and numpy. Other than that, you can just type:

pip install quepy

You can get more details on the installation here:

http://quepy.readthedocs.org/en/latest/installation.html

Learn more

You can find a tutorial here:

http://quepy.readthedocs.org/en/latest/tutorial.html

And the full documentation here:

http://quepy.readthedocs.org/

Join our mailing list

Contribute!

Want to help develop quepy? Welcome aboard! Find us in http://groups.google.com/group/quepy

A python framework to transform natural language questions to queries in a database query language.

Related tags

Overview

What's quepy?

An example

Installation

Learn more

Contribute!

Owner

Machinalis

ConvBERT: Improving BERT with Span-based Dynamic Convolution

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

DaCy: The State of the Art Danish NLP pipeline using SpaCy

Sequence Modeling with Structured State Spaces

Using Bert as the backbone model for lime, designed for NLP task explanation (sentence pair text classification task)

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

基于“Seq2Seq+前缀树”的知识图谱问答

Transformer related optimization, including BERT, GPT

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

Unsupervised text tokenizer for Neural Network-based text generation.

Pattern Matching in Python

An open source framework for seq2seq models in PyTorch.

Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE

The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.

Source code for the paper "TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations"

(ACL 2022) The source code for the paper "Towards Abstractive Grounded Summarization of Podcast Transcripts"

Yet another Python binding for fastText

Stand-alone language identification system

Voilà turns Jupyter notebooks into standalone web applications