A Deep Learning–based Audio-in-Image Watermarking Scheme

Some notes one dataset preparation

Need to import your own Audio and Image Dataset. For our training run we used the Speech Commands Dataset from Kaggle and resampled with a sampling rate of 8192. For images we used the MS COCO Dataset rescaled to 128x128 pixels.
Since we are training the model in TPU, it is necessary to offload the dataset into the memory first. That's why we are using preprocessed image and audio numpy blocks for dataset.
Audio dataset expected numpy shape: (Dataset_length, 8192, 1)
Image dataset expected numpy shape: (Dataset_length, 128, 128, 3)
After Data preparation, install the dependencies from requirements.txt

Cite

@inproceedings{das2021deep,
  title={A Deep Learning-based Audio-in-Image Watermarking Scheme},
  author={Das, Arjon and Zhong, Xin},
  booktitle={2021 International Conference on Visual Communications and Image Processing (VCIP)},
  pages={1--5},
  year={2021},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
DeepAIWM.ipynb		DeepAIWM.ipynb
README.md		README.md
components.py		components.py
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

DeepAIWM.ipynb

DeepAIWM.ipynb

README.md

README.md

components.py

components.py

model.py

model.py

requirements.txt

requirements.txt

Repository files navigation

A Deep Learning–based Audio-in-Image Watermarking Scheme

Some notes one dataset preparation

Cite

About

Releases

Packages

Languages

arjondas/deep-aiwm

Folders and files

Latest commit

History

Repository files navigation

A Deep Learning–based Audio-in-Image Watermarking Scheme

Some notes one dataset preparation

Cite

About

Resources

Stars

Watchers

Forks

Languages