Grow better with AI.

This GitHub repository contains all the code necessary for the proper functioning of the project we named FloraFlow. This cornerstone project was carried out during our Data Scientist training at Data Scientest.

Project overview :

This project aims to develop a deep learning model capable of accurately classifying different plant species, with the goal of improving crop management.

The main steps of our project include:

In-depth exploration of the dataset,
Data preparation,
The design and implementation of a learning model,
And the analysis of its performance.

Our ultimate goal is to provide the foundations of a robust tool for plant classification, thereby offering farmers more efficient means to manage their crops.

Behind FloraFlow are four individuals in training at DataScientest:

Gilles de Peretti Linkedin - GitHub
Hassan ZBIB Linkedin - GitHub
Dr. Iréné AMIEHE ESSOMBA Linkedin - GitHub
Olivier MOTELET Linkedin - GitHub

About the dataset:

We used the Kaggle's dataset V2 Plant Seedlings Dataset, which includes a collection of plant images belonging to various species.

The V2 Plant Seedlings dataset consists of 5539 images representing seedlings at the germination stage. They are grouped into 12 classes, each representing a variety/species of plants.

Structure du projet

├── LICENSE
├── README.md
├── models
├── notebooks
│   ├── 0- Data exploration.ipynb
│   ├── 1- Trainer demo.ipynb
│   ├── 2- Campaign demo.ipynb
│   └── 3- Model results.ipynb
├── packages.txt
├── references
├── reports
│   └── figures
├── requirements.txt
└── src
    ├── __init__.py
    ├── features
    │   ├── __init__.py
    │   ├── data_builder.py
    │   └── segmentation.py
    ├── main.py
    ├── models
    │   ├── __init__.py
    │   ├── campaign.py
    │   ├── final_test
    │   │   ├── __init__.py
    │   │   ├── stage1.py
    │   │   ├── stage2.py
    │   │   ├── stage3.py
    │   │   └── stage4.py
    │   ├── model_wrapper.py
    │   ├── models.py
    ├── streamlit
    │   ├── app.py
    │   ├── config.toml
    |   ├── mods
    |   |    ├── __init__.py
    |   |    └── histogram_color.py
    │   └── fichiers
    │       └── gallery
    └── visualization
        ├── __init__.py
        ├── eda.py
        └── graphs.py

Installation

To start using FloraFlow, clone the repository and install the necessary dependencies:

git clone https://github.com/DataScientest-Studio/AU23_Plantes.git FloraFlow
cd FloraFlow
pip install -r requirements.txt

Then, download the Kaggle dataset mentioned earlier and drop it into your FloraFlow folder. It's best to avoid using the NonSegmented folder, as it might have pics of several plant species all mixed in one folder.

Utilisation

To get the Streamlit app going and interact with the user interface:

streamlit run streamlit/app.py

or consult the online version

To run the demo notebooks, just open Jupyter Notebook and navigate through the notebooks/ directory.

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
models		models
notebooks		notebooks
references		references
reports		reports
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README french.md		README french.md
README.md		README.md
app.py		app.py
data		data
packages.txt		packages.txt
requirements.txt		requirements.txt

License

gillesdeperetti/AU23_Plantes

Folders and files

Latest commit

History

Repository files navigation

Grow better with AI.

Project overview :

About the dataset:

Structure du projet

Installation

Utilisation

About

Resources

License

Stars

Watchers

Forks

Languages