Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TIKA-94] Speech-to-text transcription #406

Merged
merged 163 commits into from
May 3, 2021
Merged

Conversation

lewismc
Copy link
Member

@lewismc lewismc commented Feb 26, 2021

This is a WIP on the work we are doing as fulfillment of the Hackillinois program.
We will be adding to this and I will be making comments in here.
Great work team on the work so far...

Copy link
Member Author

@lewismc lewismc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great first pass guys. Please take a look at my comment and let iterate.

tika-core/pom.xml Outdated Show resolved Hide resolved
/**
* Interface for Transcriber services.
*
* @since Tika TODO
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Excellent. Thank you for adding this. We will populate it when we complete the pull request.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please just add 2.1

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also please describe the interface.

*/
public interface Transcriber {
/**
* @return
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

First, we need a description of the interface. This is REALLY important
Next we add parameters
Then we add @throws
then return

This method signature needs to change. It is too tighly coupled to the AWS transcribe input. Please model the interface on the tika-translate API.

import static org.junit.Assert.fail;

public class AmazonTranscribeGuessLanguageTest {
AmazonTranscribe transcriber;
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be

Transcribe transcriber;

public void AmazonTranscribeGuessLanguageAudioLongTest() {
String expected = "where is the bus stop? where is the bus stop?";
//TODO: "expected" should be changed to reflect the contents of LongAudioSample.mp3
String audioFilePath = "src/test/resources/LongAudioSample.mp3";
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where is this file?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We create a resource folder in the test folder to store test file

Necessary to get the correct file path from our test resource folder? */
//TODO: is the above commented block necessary to obtain the proper filepath for a file located in the tika-translate/test/resources directory?

String audioFilePath = "src/test/resources/ShortAudioSample.mp3";
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where is this file?

public void AmazonTranscribeGuessLanguageShortVideoTest() {
String expected = "where is the bus stop? where is the bus stop?";
//TODO: "expected" should be changed to reflect the contents of ShortVideoSample.mp4
String videoFilePath = "src/test/resources/ShortVideoSample.mp4";
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where is this file?

public void AmazonTranscribeGuessLanguageLongVideoTest() {
String expected = "hello sir";
//TODO: "expected" should be changed to reflect the contents of LongVideoSample.mp4
String videoFilePath = "src/test/resources/LongVideoSample.mp4";
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

?

tballison and others added 20 commits April 23, 2021 09:41
Updated POMs and link to Miredot docs.
…long with aliases for popular alternate mimetypes for it
Authored-by: Ryan <ryan@softnext.com>

Thank you!
…ponents in TikaConfig as default. Still more cleanup necessary...
@lewismc lewismc closed this May 3, 2021
@lewismc lewismc reopened this May 3, 2021
@lewismc lewismc closed this May 3, 2021
@lewismc lewismc reopened this May 3, 2021
@lewismc
Copy link
Member Author

lewismc commented May 3, 2021

@tballison I know you and I spoke about refactoring this as simple a parser interface...
I would like to merge it for the time being and I can begin to work on the refactoring in a separate ticket.

@lewismc lewismc merged commit 2d0f9e2 into apache:main May 3, 2021
@lewismc lewismc deleted the TIKA-94 branch May 3, 2021 23:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.