Commit b1c8a1af authored by Bernhard Liebl's avatar Bernhard Liebl
Browse files

added note about sbert_contextual folder

parent ee924a83
...@@ -21,7 +21,8 @@ We introduce the Vectorian as a framework that allows researchers to try out dif ...@@ -21,7 +21,8 @@ We introduce the Vectorian as a framework that allows researchers to try out dif
+-- data +-- data
| +-- processed_data | +-- processed_data
| | +-- corpus: preprocessed Vectorian document data for parts of gold.json (e.g. tokenization) | | +-- corpus: preprocessed Vectorian document data for parts of gold.json (e.g. tokenization)
| +-- doc_embeddings.*: precompute Sentence-BERT embeddings for parts of gold.json | | +-- doc_embeddings.*: precomputed Sentence-BERT document embeddings for parts of gold.json
| | +-- sbert_contextual: precomputed Sentence-BERT contextual token embeddings
| +-- raw_data | +-- raw_data
| | +-- gold.json: gold standard data for Shakespeare text reuse as JSON | | +-- gold.json: gold standard data for Shakespeare text reuse as JSON
+-- environment.yml: Python dependencies needed to run the notebook in a conda environment +-- environment.yml: Python dependencies needed to run the notebook in a conda environment
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment