Container Runtime Interface support [SMAGENT-1205] by gnosek · Pull Request #1277 · draios/sysdig

gnosek · 2018-12-18T17:48:20Z

This PR adds support for talking the (gRPC-based) CRI protocol to get container metadata.

It's now out for review and testing but it's not ready to be merged yet. Apart from any issues to be found, the CRI socket is currently hardcoded and all containers are reported as Docker ones.

While I was here, I split container.cpp into per-engine files (it was hard to read even before I touched it).

@mattpag, this greatly conflicts with your async Docker work but we do have a plan to introduce async container metadata in a more general way.

mattpag · 2018-12-18T18:38:50Z

@gnosek It's not a problem, that PR is not supposed to be merged

adalton

This is awesome -- thank you!

To summarize my comments around the docker engine, it'd be nice if we can hide some of the protected/private things that are platform dependent to the appropriate platform-dependent file. I think that'll simply the public header a great deal (both in terms of includes/forward declarations, as well as in macro guards around features and platforms). Since the file is for only that class, scoping it at that level is equivalent to making it a static class member.

// Linux implementation file
...
namespace 
{ 

#if defined(HAS_CAPTURE)
CURLM* m_curlm;
CURL* m_curl;
#endif 

bool s_query_image_info;

size_t curl_write_cb(...)
{
        ...
}

inline bool parse_containerd_mounts(...)
{
        ...
}
...

} // end namespace

// implementation of the class.

adalton · 2018-12-18T18:17:58Z

userspace/libsinsp/container_docker_common.cpp

@@ -0,0 +1,280 @@
+/*


I recommend that we name name the file the same as the class, here sinsp_container_engine_docker.cpp (with an optional _win suffix for Windows-only content).

It'll make it easier when we get to the point that we need a symbol and expect to have it in a file with the same name rather than a layer of indirection to find it. Hopefully it'll also dissuade folks from dumping other content in the same file.

I have mixed feelings about this as the class name is pretty long and basically everything in the directory will be called sinsp_something. I like the idea of class<->file matching but I don't feel putting sinsp_ in front of everything will help much ;)

Maybe rename all the engines to sinsp::container_engine::docker etc. and move them into libsinsp/container_engine/docker.cpp and so on?

Sorry for the delay in getting back to you. That sounds great to me.

I just thought of something... we seem to include every directory in the include search path, so if you name this docker.{h,cpp} then you'll run into a name conflict with sysdig/userspace/libsinsp/docker.h.

(I really don't think we should include every directory in the include search path, but that's a different story.)

Also, there exists a class named sinsp, so I don't think we can get away with a namespace with the same name.

Maybe make this libsinsp::container_engine_docker in container_engine_docker.{h,cpp}?

userspace/libsinsp/container_docker.h

adalton · 2018-12-18T18:20:06Z

userspace/libsinsp/container_docker.h

+	static void set_query_image_info(bool query_image_info);
+	static void parse_json_mounts(const Json::Value &mnt_obj, std::vector<sinsp_container_info::container_mount_info> &mounts);
+
+protected:


I think (hope?) you can make this private

adalton · 2018-12-18T18:22:48Z

userspace/libsinsp/container_docker.h

+
+	static bool m_query_image_info;
+#if !defined(CYGWING_AGENT) && defined(HAS_CAPTURE)
+	static CURLM *m_curlm;


Since you've separated things out, these don't really need to be static members of the class, they can be static (or better yet, members of an anonymous namespace) in the implementation file.

Does it being protected hurt anything? I can see us e.g. subclassing the engine in some tests. We never subclass the engines in production code so this should be effectively a no-op change.

A couple of points here:

Ideally, tests should need access to the private state, so (again ideally) we shouldn't need a test to subclass this in order to get at this stuff.

If we really can't manage to test this using its public API, then I'd suggest just making a test class (not subclass) and explicitly making this a friend of that test class.

That said, if there are existing tests for this class that use the approach that you describe, then leaving it protected is ok.

adalton · 2018-12-18T18:27:14Z

userspace/libsinsp/container_docker.h

+
+protected:
+#if !defined(CYGWING_AGENT) && defined(HAS_CAPTURE)
+	static size_t curl_write_callback(const char* ptr, size_t size, size_t nmemb, std::string* json);


Similar to the comment about the static members, all these static functions I suspect could just be stand-alone functions in an anonymous namespace in the implementation file. I think doing this will simplify the includes in this file too.

It's not going to be pretty with configurable values (socket path, grpc timeout) but I'll move what I can to the cpp file. BTW it feels like !HAS_CAPTURE should have its own impl file as well.

userspace/libsinsp/container_info.h

adalton · 2018-12-18T18:40:21Z

userspace/libsinsp/container_lxc.h

+	bool resolve(sinsp_container_manager* manager, sinsp_threadinfo* tinfo, bool query_os_for_missing_info);
+};
+
+class sinsp_container_engine_libvirt_lxc


I suggest we put this in its own file too

userspace/libsinsp/container_mesos.cpp

adalton · 2018-12-18T18:44:15Z

userspace/libsinsp/container_mesos.h

+public:
+	bool resolve(sinsp_container_manager* manager, sinsp_threadinfo* tinfo, bool query_os_for_missing_info);
+	static bool set_mesos_task_id(sinsp_container_info* container, sinsp_threadinfo* tinfo);
+protected:


Can this be private? Same comment in other classes.

See other comment (tl;dr: I'm not sure it gives us anything)

As a general rule, I always prefer exposing the minimum possible privilege. I agree that if there are no production subclasses, then private and protected are functionally equivalent today, but who knows what tomorrow might bring :)

userspace/libsinsp/container_docker.h

gnosek · 2018-12-21T16:03:08Z

@adalton, please have another look. I deferred splitting the lxc engines until we decide about the name changes but I think I addressed everything else I could.

adalton

New changes look good to me.

userspace/libsinsp/container_info.h

adalton · 2018-12-27T13:49:49Z

userspace/libsinsp/container_mesos.h

+public:
+	bool resolve(sinsp_container_manager* manager, sinsp_threadinfo* tinfo, bool query_os_for_missing_info);
+	static bool set_mesos_task_id(sinsp_container_info* container, sinsp_threadinfo* tinfo);
+protected:


As a general rule, I always prefer exposing the minimum possible privilege. I agree that if there are no production subclasses, then private and protected are functionally equivalent today, but who knows what tomorrow might bring :)

adalton · 2018-12-27T16:54:30Z

userspace/libsinsp/container_docker.h

@@ -0,0 +1,61 @@
+/*
+Copyright (C) 2013-2018 Draios Inc dba Sysdig.


Nit: My understanding is we're now (within the last week or two) officially "Sysdig".

Oh neat, I didn't know that! If we do change it to Sysdig, we should tackle that separately and change all the headers in one step.

adalton · 2018-12-27T16:58:25Z

userspace/libsinsp/container_docker.h

+
+	bool resolve(sinsp_container_manager* manager, sinsp_threadinfo* tinfo, bool query_os_for_missing_info);
+	static void cleanup();
+	static void set_query_image_info(bool query_image_info);


I don't see an implementation for set_query_image_info() or parse_json_mounts()

they're in userspace/libsinsp/container_docker_common.cpp:

set_query_image_info(): https://github.com/draios/sysdig/pull/1277/files#diff-7c1ac911c0444c40b5263133df8f03e1R43

parse_json_mounts(): https://github.com/draios/sysdig/pull/1277/files#diff-7c1ac911c0444c40b5263133df8f03e1R29

adalton · 2018-12-27T17:39:28Z