Skip to content

A machine learning-based approach to identify the likelihood of Autism Spectrum Disorder (ASD) using questionnaire data. The project uses structured features and achieves 86.88% accuracy with Logistic Regression, supporting early screening and diagnosis.

Notifications You must be signed in to change notification settings

Devatva24/Autism-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Early Autism Detection Using Machine Learning

๐Ÿ“Œ Objective

To develop a machine learning model that can predict the likelihood of Autism Spectrum Disorder (ASD) in individuals based on questionnaire-based screening data.


๐Ÿ›‘ Problem Statement

Early detection of Autism Spectrum Disorder is crucial for timely intervention and support. Traditional diagnosis often requires specialized clinical assessment, which may not be easily accessible in all regions. This project aims to assist early screening using machine learning models trained on structured questionnaire responses.


๐Ÿ“Š Dataset

  • Source: Kaggle - Autism Screening Data
  • Features include:
    • Age
    • Gender
    • Ethnicity
    • Family history of ASD
    • Jaundice at birth
    • Screening test responses (Q1 to Q10)
    • Result from the Autism Spectrum Quotient test
    • Class/target: ASD or Not

โš™๏ธ Methodology

1. Data Preprocessing

  • Handle missing values and inconsistent entries
  • Convert categorical data to numerical using encoding
  • Normalize/standardize features as needed
  • Train-test split (e.g., 80-20)

2. Exploratory Data Analysis (EDA)

  • Understand class imbalance
  • Visualize age distribution and test response trends
  • Check correlation between features

3. Model Selection

  • Test multiple models:
    • Logistic Regression โœ… (Best Performing)
    • Random Forest
    • Support Vector Machine (SVM)
    • Naive Bayes
  • Use GridSearchCV or cross-validation for tuning

4. Evaluation Metrics

  • Accuracy
  • Precision, Recall, F1-Score
  • Confusion Matrix
  • ROC-AUC Score

5. Final Model

  • Logistic Regression achieved:
    • Accuracy: 86.88%
    • High Recall: Suitable for screening tasks

6. Deployment (optional)

  • Streamlit app to collect questionnaire responses and display predictions
  • User-friendly interface for non-technical audiences

๐Ÿ› ๏ธ Technologies Used

  • Python
  • Pandas, NumPy
  • Matplotlib, Seaborn
  • Scikit-learn
  • (Optional) Streamlit for frontend

โœ… Results

  • Best model (Logistic Regression) achieved 86.88% accuracy
  • Balanced performance with good recall for detecting potential ASD cases

๐Ÿ“ Conclusion

The project showcases the potential of machine learning in supporting early autism screening using easily available data from questionnaires. With further development and validation, such tools can complement clinical assessments, especially in resource-constrained settings.


๐Ÿ”ฎ Future Work

  • Train on larger and more diverse datasets
  • Implement ensemble learning methods
  • Enhance the user interface for deployment
  • Collaborate with healthcare professionals for clinical validation

About

A machine learning-based approach to identify the likelihood of Autism Spectrum Disorder (ASD) using questionnaire data. The project uses structured features and achieves 86.88% accuracy with Logistic Regression, supporting early screening and diagnosis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published