Skip to content

Commit 67d9948

Browse files
authored
Update README.md
1 parent bf66544 commit 67d9948

File tree

1 file changed

+7
-29
lines changed

1 file changed

+7
-29
lines changed

README.md

Lines changed: 7 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -1,38 +1,14 @@
1-
# RDF Processing Toolkit
1+
# RDF Processing Toolkit (RPT)
2+
3+
RPT makes RDF/SPARQL workflows on the command line easy. The RDF Processing Toolkit (RPT) integrates several of our tools into a single CLI frontend: It features commands for running SPARQL-statements on triple and quad based data both streaming and static. SPARQL extensions for working with CSV, JSON and XML are included. So is an RML toolkit that allows one to convert RML to SPARQL (or TARQL). RPT embeds several SPARQL engines, including Jena’s ARQ and TDB, as well as one of our own for SPARQL-based batch processing using Apache Spark.
24

35
## News
46

5-
* 2023-05-19 New quality of life features: `cpcat` command and the canned queries `tree.rq` and `gtree.rq`.
6-
* 2023-04-04 Release v1.9.5! RPT now ships with `sansa` (Apache Spark based tooling) and `rmltk` (RML Toolkit) features. A proper GitHub release will follow once Apache Jena 4.8.0 is out as some code depends on its latest SNAPSHOT changes.
7-
* 2023-03-28 Started updating documentation to latest changes (ongoing)
7+
* 2024-09-18 Improved [documentation](https://smartdataanalytics.github.io/RdfProcessingToolkit/)!
88

99
[Previous entries](#History)
1010

1111

12-
## Example Usage
13-
14-
* `integrate` allows one to load multiple RDF files and run multiple queries on them in a single invocation. Further prefixes from a snapshot of [prefix.cc](https://prefix.cc) are predefined and we made the SELECT keyword of SPARQL optional in order to make scripting less verbose. The `--jq` flag enables JSON output for interoperability with the conventional `jq` tool
15-
16-
17-
```
18-
rpt integrate data.nt update.ru more-data.ttl query.rq
19-
20-
rpt integrate --jq file.ttl '?s { ?s a foaf:Person }' | jq '.[].s'
21-
```
22-
23-
* `ngs` is your well known bash tooling such as `head`, `tail`, `wc` adapted to named graphs instead of lines of text
24-
```
25-
# Group RDF into graph based on consecutive subjects and for each named graph count the number of triples
26-
cat file.ttl | ngs subjects | ngs map --sparql 'CONSTRUCT { ?s eg:triples ?c} { SELECT ?s COUNT(*) { ?s ?p ?o } GROUP ?s }
27-
28-
# Count number of named graphs
29-
rpt ngs wc file.trig
30-
31-
# Output the first 3 graphs produced by another command
32-
./produce-graphs.sh | ngs head -n 3
33-
```
34-
35-
3612
## Example Use Cases
3713

3814
* [Lodservatory](https://github.com/SmartDataAnalytics/lodservatory) implements SPARQL endpoint monitoring uses these tools in this [script](https://github.com/SmartDataAnalytics/lodservatory/blob/master/update-status.sh) called from this [git action](https://github.com/SmartDataAnalytics/lodservatory/blob/master/.github/workflows/main.yml).
@@ -51,5 +27,7 @@ The dependency tree can be viewed with Maven using `mvn dependency:tree`.
5127

5228
## History
5329

54-
* (no entry yet)
30+
* 2023-05-19 New quality of life features: `cpcat` command and the canned queries `tree.rq` and `gtree.rq`.
31+
* 2023-04-04 Release v1.9.5! RPT now ships with `sansa` (Apache Spark based tooling) and `rmltk` (RML Toolkit) features. A proper GitHub release will follow once Apache Jena 4.8.0 is out as some code depends on its latest SNAPSHOT changes.
32+
* 2023-03-28 Started updating documentation to latest changes (ongoing)
5533

0 commit comments

Comments
 (0)