pymatgen api_miner/feature scraper

This repositories serves two key functionalities:
One is to use the Materials Project API to pull down the data from the Materials project AND the Battery Explorer.
The second is series of functions to mine the pulled data into the training/test data and deployment data for use in ML models
the primary output are a csv file of the features and a csv files of the labels used in our paper:

note that to use these scripts, you must have an account on Materials Project (free) so you can generate your own MAPI key (if you find my MAPI key on my scripts, please let me know.

Dependencies

pymatgen 2018.3.23

folder structure

add a directory tree diagram here

mining the api extracted data

In general, using the api to query the data over an internet connection is slow, so we want to mine everything into a local set of folders so we can use efficiently
there is an additional set of functions which further mines the raw database to generate features and labels for battery machine learning, which is a separate repository you can check to see state of the art machine learning algorithms

database_reader_functions

compilation of functions designed to read data from the databases mined by the api-miners materials_project_reader: reads .json formatted text files
Since the contents of the structure base are just pickled python objects, structure_base_reader is just pickle.loads(file)

csv_processed_datasets

csv is important because ONLY csv files are allowed three files in fact are stored here

training/test labels
training/test features
deployment features (sometimes called prediction set) these files should be transferred to the battery machine learning repo for further analysis. This is the FINAL PRODUCT OF THE PYMATGEN API MINER

data dump

we include a folder called data-dump where all raw .csv files which are successfully mined from the data-miner

FeatureLabelMining

Idea to have separate scripts for different features is simple. First, we can classify different features Second, different types of features may be more or less expensive to compute

scripts

contains two folders, one which focuses on the Materials Project, one which focuses on the Battery Explorer

Battery API Miner Folder

one script which queries the Battery Explorer and mines them to a text file in a directory that has the default name: Battery_Explorer

Materials API Miner Folder

two core scripts

one which mines materials data and structure data to json format in one file each (Materials_Project_Database)
second one mines the structure into a pymatgen structure object, stored as a pickle file (structure_base)

Data Scraping

performs all feature construction

##final_processing concatenates all the mp data and the train data into the final csv files that we can transport to the machine learning module

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.idea		.idea
Shannon_Radii		Shannon_Radii
__pycache__		__pycache__
archive		archive
csv_processed_datasets		csv_processed_datasets
database_reader_functions		database_reader_functions
feature_miner_functions		feature_miner_functions
label_miner_functions		label_miner_functions
scripts		scripts
.gitignore		.gitignore
README.md		README.md
log		log
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pymatgen api_miner/feature scraper

Dependencies

folder structure

mining the api extracted data

database_reader_functions

csv_processed_datasets

data dump

FeatureLabelMining

scripts

Battery API Miner Folder

Materials API Miner Folder

Data Scraping

About

Releases

Packages

Languages

zhaonat/pymatgen_materials_project_api_miner

Folders and files

Latest commit

History

Repository files navigation

pymatgen api_miner/feature scraper

Dependencies

folder structure

mining the api extracted data

database_reader_functions

csv_processed_datasets

data dump

FeatureLabelMining

scripts

Battery API Miner Folder

Materials API Miner Folder

Data Scraping

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages