New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GSoC 2019] Growing the model garden 🌳
#1684
Comments
@nsthorat (as I have promised here), @dsmilkov, @manrajgrover, let me know if you have any opinion on which models you would like to see first. I would love to hear any ideas and high-level views on the plan and details of implementation as well. |
Mask-RCNN and YOLO will be very helpful to see first @sdll @nsthorat @dsmilkov @manrajgrover because they are very useful and commonly used for computer vision. For my own research, I'm trying to use it for robotics control to provide accessibility people with disabilities. Among all of the other models, these models will increase accessibility control and independence for people with disabilities. In fact, I could really use some help to run Mask-RCNN and YOLO in the browser for robotics control as soon as possible. Hope you can please help. |
@kavikode, TF.js implementations of YOLO already exist. To convert Mask-RCNN as described here, you might try the following:
However, Mask-RCNN is quite a heavyweight model and may require further optimizations to run smoothly. |
thank you so much @sdll |
What is the process to pitch for moving items from the |
@beriberikix, thank you for your interest! I have not dived into the details, but DeepSpeech seems to be resource-intensive, given the discussion in your other issue. Have you had any success converting the model using |
I haven't had any success with the converter tool. I've asked for some clarification on how the saved model is generated. I'll updated if and when I get the converter working. |
I've worked some with DeepSpeech in the past. As I recall, it has both significant compute work for generating the input audio features (a la MFCC) as well as decoding via the language model. I'm not sure how much of that functionality also has corresponding JS libraries, so it's possible that porting it may be an involved effort. By any chance, have you looked into the supporting work apart from the TF model yet to see how easy or hard it might be? |
Thanks everyone for the suggestions. Since the GSoC 2019 run came to the end, I have added the final report to the issue and updated the status of each port. Let me know if you have any other ideas on porting text detection/text recognition or would like to work on other models. |
Hey I'm looking to implement MaskRCNN inference with tfjs for a production project I'm working on. I would love to get some help and would very much like to contribute for getting MaskRCNN into model garden if possible. Please let me know where/how we can discuss this and take it forward. From what I have achieved till now, it looks like following things are required for my particular case: Rewrite |
This issue tracked the progress for GSoC'19 project "Reasonable Effectiveness of Mobile Inference: Adaptive Growth of the TensorFlow.js Model Garden".
The final report is here.
Active
DeepLab
EfficientNet
Text Detection
Text Recognition
Scratchpad
VGGish
DeepGaze
Style Transfer
Pix2Pix
Speech to Text
Face Swap
Sketch Simplification
DeOldify
HNAtt
VAMPIRE
CycleGAN
EDANet
ShuffleNet
ErfNet
YOLO
Mask-RCNN
The text was updated successfully, but these errors were encountered: