Due to a storage error important parts of the scripts were lost.
The most important parts of these have been restored -- still missing the visualization and interactive parts. But with this the core calculations are covered and datasets can be created to be explored.
Might have add those later.
- Loads .tsv/.csv files and uses two columns for id and text respectively. The rest are kept for metadata.
- The SCA metadata are stored on exit and can be reloaded.
- Stopwords can be loaded from various languages thanks to NLTK
- Stopwords can be addd (also stored and reloaded from file)
- Stopwords can be removed
- Stopwords do not have to be used
Large portions of the added features have been written with the help of Gemini, Claude and Cursor.