Monitoring Java Servers On Mac OS Lion Causes Monitored Server To Segfault #277

giampaolo · 2014-05-23T14:59:21Z

From shane.c....@gmail.com on June 10, 2012 05:16:51

Running the following code to monitor a Java based server on Mac OS Lion causes 
the monitored server to seg fault. I tried this with several industrial grade 
servers (ActiveMQ, Tomcat etc) and each time the server crashed after several 
minutes when the script is run on five second loop. 

I want to be clear - it is not the python script that fails - it is the java 
server process that is being monitored that consistently segfaults. I tried 
with non-java processes (firefox etc) and did not observe the same behavior. 

    #!/usr/bin/env python

    import psutil
    import sys

    proc = None;

    #find the server we are looking for
    for ps in psutil.process_iter():
        #print ps.name
        try:
            if( ps.name == "java" ):
                for cmd in ps.cmdline:
                    if cmd.count("apache-activemq-5.4.2") > 0:
                        proc = ps;
                        break
            if proc is not None:
                break

        except Exception, e:
            pass

    if not proc:
        print "SERVER NOT RUNNING..."
        sys.exit(1)


    print " CPU:    {0:15.1f}&#37;".format(proc.get_cpu_percent())
    print " U Time: {0:15.1f}s".format(proc.get_cpu_times().user)
    print " S Time: {0:15.1f}s".format(proc.get_cpu_times().system)
    print " Memory: {0:15.1f}&#37;".format(proc.get_memory_percent())
    print " Threads:{0:13d}".format( proc.get_num_threads() )
    print " Files:  {0:13d}".format( len(proc.get_open_files()) )
    print " INET:   {0:13d}".format( len(proc.get_connections()) )



What is the expected output?  
The service being monitored should continue to run 



What do you see instead?  
Segmentation fault: 11

Original issue: http://code.google.com/p/psutil/issues/detail?id=277

The text was updated successfully, but these errors were encountered:

giampaolo · 2014-05-23T14:59:22Z

From jlo...@gmail.com on June 09, 2012 20:57:35

Hi Shane,

Since you're able to reproduce the problem simply on your system, can you try 
narrowing down the steps to reproduce to the smallest test case? For example, 
does the problem happen due to use of one of these specific calls below? 

print " CPU:    {0:15.1f}%".format(proc.get_cpu_percent())
print " U Time: {0:15.1f}s".format(proc.get_cpu_times().user)
print " S Time: {0:15.1f}s".format(proc.get_cpu_times().system)
print " Memory: {0:15.1f}%".format(proc.get_memory_percent())
print " Threads:{0:13d}".format( proc.get_num_threads() )
print " Files:  {0:13d}".format( len(proc.get_open_files()) )
print " INET:   {0:13d}".format( len(proc.get_connections()) )

It would be very helpful to determine specifically which feature of psutil 
seems to be causing a problem for the Java process. If you are getting a 
hotspot crash dump from the JVM that would also be helpful to include here. 

Thanks

giampaolo · 2014-05-23T14:59:23Z

From shane.c....@gmail.com on June 09, 2012 21:20:50

Yes. I had actually been doing this in the background - I tried running each 
one of these individually and could not reproduce the segfault after running 
~10 minutes each. Within minutes of starting them all again, the segfault 
happened again. So, it appears to not be a single call, but some combination of 
multiple. I will try combining and see what I can come up with.

giampaolo · 2014-05-23T14:59:23Z

From shane.c....@gmail.com on June 09, 2012 22:37:29

OK - I have caused it to happen with this combination:

print " S Time: {0:15.1f}s".format(proc.get_cpu_times().system)
print " Memory: {0:15.1f}%".format(proc.get_memory_percent())
print " Threads:{0:13d}".format( proc.get_num_threads() )
print " Files:  {0:13d}".format( len(proc.get_open_files()) )

This was the smallest combination that I could get it to happen with. Is it 
possible that this is a timing issue - and not really dependent on what we are 
doing - but how long we are doing it for (ie, the longer I spend working with 
the proc object, the greater the chance that the error will occur)? If so, I 
could reduce the amount of time by building the string and then printing it all 
at once - but I don't like the idea that the thing I am using to monitor my 
applications is the one that it murdering them :)

I will turn debugging on in the jvm and see if I can get more information there.

giampaolo · 2014-05-23T14:59:24Z

From g.rodola on February 24, 2013 13:59:45

Any news about this?

giampaolo · 2016-11-13T22:50:03Z

Closing as outdated.

giampaolo closed this as completed Nov 13, 2016

giampaolo added critical macos labels Nov 15, 2020

giampaolo removed the critical label Dec 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monitoring Java Servers On Mac OS Lion Causes Monitored Server To Segfault #277

Monitoring Java Servers On Mac OS Lion Causes Monitored Server To Segfault #277

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented Nov 13, 2016

Monitoring Java Servers On Mac OS Lion Causes Monitored Server To Segfault #277

Monitoring Java Servers On Mac OS Lion Causes Monitored Server To Segfault #277

Comments

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented May 23, 2014

giampaolo commented Nov 13, 2016