A linguistic set of tools in Go and web interface in PHP to do quick Stable Lexical Marker Analysis (key keyword analysis) and explore the results.
Table of Contents
- Contact information
- Supported formats
- Citing snelSLiM
- How to use
If you have suggestions for features, feedback or a question, feel free to contact to create a GitHub issue or me about snelSLiM at bert.vandepoel AT uclouvain DOT be.
Having issues building or installing my software? I would love to have more users at different universities, so I don't mind if you email me for help or advice!
Alpino XML, TEI XML BNC/Brown Corpus Variant, CoNLL, DCOI XML, Eindhoven corpus, FoLiA XML, Gysseling corpus, PRAAT TextGrid, XCES GrAF, plain text and XML with XPath query.
A demo version is available on http://demo.snelslim.org/. While several features are locked in demo mode to prevent abusive use, it can come in handy to get a general idea of what snelSLiM can be used for.
An initial publication introducing snelSLiM and comparing it to other tools is available to cite in research using snelSLiM (or of course as a general introduction and reference):
- Van de Poel, B., & Speelman, D. (2020). SnelSLiM, a user-friendly and fast tool to perform better keyword analysis through Stable Lexical Marker Analysis. Computational Linguistics in the Netherlands Journal, 10, 147-160. Retrieved from https://clinjournal.org/clinj/article/view/111 PDF BibTeX
To (also) refer to the tool itself as software:
SnelSLiM is a web application, it can easily be installed on cheap shared web hosting, a VPS or private (virtual or physical) server. This makes it possible for individuals, research groups, companies and faculties to deploy snelSLiM on infrastructure that suits their needs. Of course, more storage makes it possible to store larger corpora, and more CPU cores and power greatly increases analysis speed.
Please refer to the installation guide in the INSTALL.md file for details about the requirements, installation and configuration of snelSLiM.
How to use
Besides the help pages within the application, a user manual is also available detailing what the interfaces look like, what is possible, and explaining how to go through the scientific process using snelSLiM.
I'm certainly willing to accept pull requests for bug fixes, extra formats, code cleanup and new functionality (if useful). By submitting submitting a pull request, you agree to license your submission under the same license as this project, the AGPL.
Feel free to report issues and file feature requests on GitHub.
Under the terms of the AGPL you are free to adapt my format parsers for your project (or any other part of the code of course). I hope they can be of more broad use in the future.
This project is licensed under the AGPL license - see the LICENSE file for details. Some libraries used within this project are licensed under other licenses, you can find more details in the ACKNOWLEDGEMENTS file.
- This project was initially developed as my Bachelor Paper for my Bachelor in Linguistics and Literature under the supervision of Prof. Dr. Dirk Speelman
- I have largely rewritten the application, correcting mistakes in the statistics, extending the functionality and improving user experience as my Master Thesis for my Master in Linguistics under the supervision of Prof. Dr. Dirk Speelman
- Development has continued for the Thesis of my Advanced Master in Advanced Studies of Linguistics under the supervision of Prof. Dr. Dirk Speelman, introducing new visualizations, checks for corpus quality, new corpus formats, the option to share reports, demo mode, multithreading and many small improvements
- I now maintain this project on the side while working on my PhD at the UCLouvain
- Further acknowledgements see the ACKNOWLEDGEMENTS file