Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PROJECT PROPOSAL] #6

Open
nehavish006 opened this issue May 20, 2023 · 3 comments · May be fixed by #115
Open

[PROJECT PROPOSAL] #6

nehavish006 opened this issue May 20, 2023 · 3 comments · May be fixed by #115
Assignees
Labels
assigned The project is assigned to a contributor GSSOC 23 This issue is part of Girl script summer of Code

Comments

@nehavish006
Copy link

Project Request

Automatic captioning of videos based on cooking by understanding the action behind the scene and providing Nutrtional information based on the ingredients used.

Field Deep Learning
About Video Captioning Using CNN-RNN Model
Github nehavish006
Email nehavish006@gmail.com
Label Project Request

https://github.com/nehavish006


Define You

GSSOC Participant
Contributor - Snekha C | Contributor

Project Name

Description

Video Captioning is a task of automatic captioning a video by understanding the action and event in the video which can help in the retrieval of the video efficiently through text. Video Captioning is an encoder decoder mode based on sequence to sequence learning. Automated video caption generator helps searching of videos in websites better and make content easier. The video information is considered as a sequence of images with 10 to 12 seconds short video clips. In the proposed system, Convolutional Neural Networks (VGG 16) is used for feature extraction and LSTM is used for encoding and decoding the features. Greedy search algorithm is used for predicting the efficient caption and gives speedy word extraction. Finally, the language converter is used for specific people language to understand the captions.

Scope

Objectives:

  • To help the people with Deaf and hard of hearing individuals to watch videos helps people to focus on and remember the information more easily.
  • To build the feature extraction using the convolutional neural network (VGG 16).
  • To examine how LSTM is used for encoding and decoding the features.
  • To explore how the greedy search algorithm predict the efficient caption.

Deliverables:

  • A trained CNN RNN model capable of analyzing the video and generating the captions.
  • Well-documented guidelines, including dataset preparation, training, inference, and any specific requirements.

Timeline

Start date : The date of assignment
End date: June 15 2023

Video Links or Support Links

Sandeep Samleti , Ashish Mishra , Alok Jhajhria , Shivam Kumar Rai, Gaurav Malik, 2021, "Real Time Video Captioning Using Deep Learning", International Journal Of Engineering Research & Technology (IJERT) Volume 10, Issue 12 (December 2021).

@tushtithakur
Copy link

Hi, I am very much interested and would like to contribute as a part of @GSSOC'23.
Please assign the task to me. Thank you.

@adithya-s-k adithya-s-k added assigned The project is assigned to a contributor GSSOC 23 This issue is part of Girl script summer of Code labels May 20, 2023
@adithya-s-k
Copy link
Owner

@nehavish006 I have assigned this particular project to you under GSSOC 23.
Will you be able to manage the project alone or would you like to collaborate with other contributors

All the best contributing
if you have any doubts on the project structure feel free to contact me

@nehavish006
Copy link
Author

nehavish006 commented May 20, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
assigned The project is assigned to a contributor GSSOC 23 This issue is part of Girl script summer of Code
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants