Speech: Connections stay open in ESTABLISHED or CLOSE_WAIT state #5570

dmarvp · 2018-07-04T21:44:26Z

OS: CentOS Linux release 7.2.1511 (Core)
Python 2.7.5
google-cloud-speech Version: 0.27.0

Steps to reproduce:
After using the google speech client do:
lsof -p <pid>
or:
netstat -a
and you'll notice that there are some TCP connections still in ESTABLISHED or CLOSE_WAIT state.

The foreign addresses belong to Google. These connections remain open and, if the client is used a lot, will create problems of the type "IOError: [Errno 24] Too many open files". I'm using this inside a cherrypy server, so it actually makes my server hang after a while.

I noticed that some of the connections close themselves after some minutes, but some of them seem to remain there no matter what. I'm wondering if there is a way to close the connections opened by the speech client after finishing a Recognize call.

Code example

def audio_to_text(self, audio_segment, language_code=None,
                      phrases=None, frame_rate=16000, channels=1):
        if language_code is None:
            language_code = self.DEFAULT_LANGUAGE_CODE

        # Add a second of silence
        silence = pydub.AudioSegment.silent(duration=1000)
        audio_segment = silence + audio_segment

        # Change the frame rate if needed
        if (audio_segment.frame_rate != frame_rate):
            audio_segment = audio_segment.set_frame_rate(frame_rate)

        # Change the number of channels if needed
        if (audio_segment.channels != channels):
            audio_segment = audio_segment.set_channels(channels)

        # Sanitize the phrases array
        if phrases:
            phrases = sanitize_phrases(phrases)

        client = speech.SpeechClient(
            credentials=self.google_cloud_credentials())
        audio = types.RecognitionAudio(content=audio_segment.raw_data)
        config = types.RecognitionConfig(
            encoding=enums.RecognitionConfig.AudioEncoding.LINEAR16,
            sample_rate_hertz=audio_segment.frame_rate,
            language_code=language_code,
            speech_contexts=[speech.types.SpeechContext(phrases=phrases, )],
        )

        try:
            response = client.recognize(config, audio)
        except Exception as e:
            print "failed to do audio_to_text: ", e
            return None

        return self.process_result(response, audio_segment)

The text was updated successfully, but these errors were encountered:

dmarvp · 2018-07-05T15:37:07Z

Btw, this seems somewhat related to #5523 and similar to this 2-year old issue in stack overflow: https://stackoverflow.com/questions/34794516/gcloud-python-close-connection

theacodes · 2018-07-09T16:52:15Z

Seems this should maybe be moved to gRPC, @tseaver?

tseaver · 2018-07-09T17:45:33Z

@theacodes You likely know better than I. :)

theacodes · 2018-07-11T19:35:14Z

This issue was moved to grpc/grpc#15990

dmarvp · 2018-08-10T18:06:34Z

hello @theacodes, the problem persists and according to @srini100 on grpc/grpc#15990 it's not on the grpc side of things. Anything we can do about it?

theacodes · 2018-08-10T18:10:35Z

If you're only using one client then I'm not sure. It could be requests?

jtromans · 2018-08-10T21:04:05Z

I'm also experiencing the same issue, which is similar to and reported here: #5523

I get it on Windows and on Linux using Python 3.6.5. I am using requests module version '2.18.4' and I recently tried upgrading to '2.19.1'. It would appear as though I'm getting the same issue regardless of version. Regarding grpc, seems like I'm using '1.14.1'.

It seems to me as though grpc is the issue here in that "Since there is no guarantee that memory is the only resource consumed by a grpc.Channel and since some garbage collectors only collect garbage when memory is scarce, there is a liability that applications might run out of those other resources (file descriptors and so on) when they are in fact perfectly reclaimable." from here: grpc/grpc#12531

dmarvp · 2018-09-05T21:15:58Z

@theacodes at this point, should I create a docker image with a minimal way to reproduce it and share it with you? or would you suggest stop using the library and start hitting the HTTP endpoint directly (is that even possible?)

JustinBeckwith added the triage me I really want to be triaged. label Jul 5, 2018

theacodes mentioned this issue Jul 11, 2018

Speech: Connections stay open in ESTABLISHED or CLOSE_WAIT state grpc/grpc#15990

Closed

theacodes closed this as completed Jul 11, 2018

jtromans mentioned this issue Aug 10, 2018

PubSub: Errno 24: too many open files with multiple publishers #5523

Closed

davewalkexpel mentioned this issue Dec 3, 2018

Hundreds or thousands of open files when sending traces to Stackdriver Trace in gRPC>=1.12.0 in Python grpc/grpc#17379

Closed

JustinBeckwith assigned theacodes Feb 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech: Connections stay open in ESTABLISHED or CLOSE_WAIT state #5570

Speech: Connections stay open in ESTABLISHED or CLOSE_WAIT state #5570

dmarvp commented Jul 4, 2018

dmarvp commented Jul 5, 2018

theacodes commented Jul 9, 2018

tseaver commented Jul 9, 2018

theacodes commented Jul 11, 2018

dmarvp commented Aug 10, 2018

theacodes commented Aug 10, 2018

jtromans commented Aug 10, 2018 •

edited

dmarvp commented Sep 5, 2018

Speech: Connections stay open in ESTABLISHED or CLOSE_WAIT state #5570

Speech: Connections stay open in ESTABLISHED or CLOSE_WAIT state #5570

Comments

dmarvp commented Jul 4, 2018

dmarvp commented Jul 5, 2018

theacodes commented Jul 9, 2018

tseaver commented Jul 9, 2018

theacodes commented Jul 11, 2018

dmarvp commented Aug 10, 2018

theacodes commented Aug 10, 2018

jtromans commented Aug 10, 2018 • edited

dmarvp commented Sep 5, 2018

jtromans commented Aug 10, 2018 •

edited