Commit 8e732b9b authored by Mateusz Pawlik's avatar Mateusz Pawlik

Tidied up the main README.

parent 739a838e
# Tree Edit Distance similarity join - datasets scripts # Datasets for tree edit distance experiments
This repository contains all resources to download and process the datasets for This repository contains all resources to acquire datasets for experimenting
tree similarity join experiments. on tree edit distance algorithms.
**We do not store the datasets**, only the scripts to obtain and prepare them. **We do not store the datasets**, only the scripts to obtain and prepare them.
## Datasets description ## Datasets description
Currently we support the following datasest: Currently we support the following datasest:
- **bozen** - Bozen streets - **Bolzano** - Residential addresses in the city of Bolzano.
- **dblp** - DBLP - **DBLP** - Bibliographic XML data.
- **Python** - Abstract syntax trees of Python source code in JSON.
The details about each dataset can be found in the corresponding README files. - **Sentiment** - Semantic trees of movie reviews in the PennTreeBank format.
- **Swissprot** - Protein sequence data in XML.
The details about each dataset can be found in the README files in the
datasets subdirectories.
## Repository organisation ## Repository organisation
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment