Usage guide

This page will explain how to set up SPARQLess for a given SPARQL endpoint, step-by-step. If you want to use SPARQLess as a black box, you may prefer to use the Docker image instead, as described here.

Installing dependencies

The project is written in TypeScript and uses Node.js as its runtime.

You will need to have the following installed before you proceed with usage:

Node 16.13.0 (Gallium LTS) - easily managed with nvm
npm

Once you have installed them, run npm install in the project root directory to install all required dependencies with npm.

Library interface

The main functionality is encapsulated by the SPARQLess class in src/api/library.ts. This class contains functions for the main tasks users may want to perform.

Most notably, the buildSchemaAndRunEndpoint functions is provided for users who want the least configuration necessary. It is enough to simply set up a Config as shown in the next section, and run SPARQLess like so:

const config = new SPARQLessConfigBuilder()
    .sparqlEndpoint('https://data.gov.cz/sparql')
    .build();

const sparqless = new SPARQLess();

// This method returns a Promise, so you should await it
// if you want to do something after the server has started.
sparqless.buildSchemaAndRunEndpoint(config);

The SPARQLess class also contains other functions which perform smaller parts of the whole algorithm, which you may want to use if your task is more complex than convert this SPARQL endpoint into a GraphQL endpoint. For example, the observeAndBuildSchema function performs all the steps up to and including schema building, meaning one could use it to perform some other operations on the generated GraphQL schema, rather than starting a GraphQL server immediately.

Edit configuration

There is one required configuration step before you run the library - configuring the SPARQL endpoint you want SPARQLess to run against. This is done by creating a Config object, which you will pass to SPARQLess functions. There is a pre-defined list of known online and working endpoints in src/observation/endpoints.ts in case you just want to try the project without having a specific SPARQL endpoint in mind, but you can easily define your own.

A very basic configuration can look like this:

const config = new SPARQLessConfigBuilder()
    .sparqlEndpoint('https://data.gov.cz/sparql')
    .build();

As you can see, the only required configuration value when using the builder is specifying the SPARQL endpoint. This example uses the SPARQLessConfigBuilder to build the configuration. You may of course create the Config object manually, but using the builder lets you make use of pre-defined sane defaults, allowing you to only specify the configuration values you care about.

There are other configuration values which you are free to modify, but they have sensible defaults in case you just want to get started. If you want to find out more about additional configuration options, you can refer to the configuration guide. Alternatively, you can examine their definition in the code, where they are also documented.

Run it

Putting together the above code examples is enough to start a SPARQLess instance, which will observe the configured SPARQL endpoint and convert it to a GraphQL one. Alternatively, if you are running SPARQLess from this repository, you can run npm start, which will run src/main.ts, which is an entrypoint meant for starting the server from withing this repository. In that case, edit your configuration there.

In the logs, you will see that it will first run some observations on the target endpoint to collect information about its schema and the data contained within. This may take a while, especially for large datasets, depending on the configuration parameters used. As an example, the CZ Government open data endpoint shown in the configuraiton above takes under 10 minutes to have a fully functional GraphQL instance running.

After observation is finished, you will see that a fully functional GraphQL interface is available for you to explore and query at the configured port (localhost:4000 by default).

Explore the schema

When you open the GraphQL interface in your browser, you will see an interface provided by Apollo Server. It allows you to explore the schema including all of the available classes, as well as their properties, relations, how many times they occur in the dataset and other metadata.

GraphQL interface

The interface also allows you to interactively build and execute queries against the endpoint.

Hot reloading

You may notice that with the default configuration, another round of observation will start right after the GraphQL server is created.

This is hot reloading in action. Because fully observing the entire dataset in the endpoint may take a long time, the initial observations are limited by the default configuration - observation will only count up to 1000 occurences for each property in the dataset.

After the server has started, it's fully available and functional for purposes of schema exploration and querying. In the background, SPARQLess continues to make more detailed observations, and seamlessly updating the GraphQL schema in the GraphQL endpoint.

You can read more about hot reloading here.

Query the endpoint

If you are not yet familiar with how to compose GraphQL queries, you can learn more about GraphQL here.

You can use the interface to query the data, either writing the query yourself, or using the graphical query editor to compose the query.

GraphQL querying

NOTE: be careful running queries which have an unlimited result size! If your SPARQL endpoint contains a large dataset, the query could either run for a very long time, or it might just fail due to a timeout. You should use the limit and offset parameters provided in each query field as shown in the image above.