The original dataset is not sorted chronologically but organized in time chunks. Integration is required for easier usage.
Run ./Pre-processing/make_llamps_dataset.ipynb
Use LLM to process the data, score user texts, and perform analysis.
Run ./llamps/Pre-processing/LLMprocess.py separately for each of the following four paths. Running them concurrently can improve efficiency:
./llamps/Pre-processing/llamps-dataset/negative_examples_anonymous./llamps/Pre-processing/llamps-dataset/negative_examples_test./llamps/Pre-processing/llamps-dataset/positive_examples_anonymous./llamps/Pre-processing/llamps-dataset/positive_examples_test
Note: Modify the url and api_key in GPTRank.py and GPTAnalyze.py. These are the interfaces for the LLM.
Run ./llamps/Pre-processing/process_sentence_embedding.py
Run ./llamps/Pre-processing/train_fast_dataset.py
Execute the following command in the terminal:
bash runme_combine16.sh
- File:
llamps/erisk_infer_analyze.ipynb - Calculate the ERDE (Early Risk Detection Error) metric for model evaluation
- File:
llamps/schedule_visulize.ipynb - Visualize and analyze sleep characteristics and patterns from the data
- File:
llamps/t-sne.ipynb - Apply t-SNE for dimensionality reduction and visualize high-dimensional data patterns