Contribute NeighborClassifierFilter #1803

mrosen · 2018-02-08T21:48:18Z

From doc/stages/filters.knnassign.rst:

The knnassign filter allows you update the value of a dimension for specific points
to a value determined by a K-nearest neighbors vote. For each point, the k
nearest neighbors are queried and if more than half of them have the same
value for the specified dimension, the filter updates the selected point
accordingly

For example, if an automated classification procedure put/left erroneous
vegetation points near the edges of buildings which were largely classified
correctly, you could try using this filter to fix that problem.

Similiarly, some automated classification processes result in prediction for
only a subset of the original point cloud. This filter could be used to
extrapolate those predictions to the original.

abellgithub

I think this mostly looks OK. You should rename processOne to something else, because it conflicts with Stage::processOne and is confusing. It generates warning on some compilers.

mrosen · 2018-02-12T22:13:31Z

Is there any value to preserving the processOne override (which is used in several other filters)? The override is currenly broken b/c of the need to pass the KDIndex and a PointRef that can be used to de-reference the IDs it provides. These could be passed by other means restoring the signature.

abellgithub · 2018-02-12T23:13:12Z

The reason to override is to allow the generic use processOne() by callers. In your case, processOne() doesn't follow the conventions of behavior expected by a stage, so it doesn't make any sense to try to preserve the override. processOne() is used by the pipeline infrastructure to support streaming, but your filter can't do streaming because it needs to calculate a KD-tree before it starts processing points.

mrosen · 2018-02-13T00:05:13Z

thanks for this clarification. I've changed the name and removed the virtual specifier.

abellgithub · 2018-02-14T16:47:50Z

The operator << function seems unnecessary (and contains an infinite recursion).
I wonder if there's any reason to allow selecting a dimension. Does it make sense for anything other than classification? I always prefer doing something simple and making it more generic if there is call for it later.
I don't think the code that uses a second input set for a KD tree works. When you create the KD tree on the second set, all the point IDs refer to the points in the second set. It makes no sense to use IDs in the first set to try to find nearest neighbors in the second set.
You don't need to pass a temporary point into processPoint(). You can pass the view and just create a PointRef from the view, which seems much more clear to me:

PointRef temp(view, nearest_neighbor_id);

mrosen · 2018-02-16T22:51:29Z

<< function. Good catch. Seem unneeded, will remove.

Dimension optional. I agree with your focus on making common use cases easy. But really I think what we're seeing here is that the name wants to be KNNClassifyFilter. I'm inclined to remove "dimension" and change the name. Thoughts?

I think the "Candidate" KD Tree code is right. I create the KD Tree on the second (Candidate) set but the IDs still come from the input set. For each point in the input set, we vote based on the NNs found in the Candidate. My specific workflow is that the Candidate has a subset of the input points. There's a unit test that exercises this.

Temporary PointRef. Passing a copy of the View is only marginally better than passing a PointRef based on the View. The issue is that the KDIndex should expose some method/attribute that allows it's client to interpret its results. I considered exposing the View from the KDTree ... but thought that too disruptive for a first contribution. I'd be willing to do this if there's concurrence.

abellgithub · 2018-02-19T14:42:27Z

You're correct and I was wrong. I missed that you were passing a PointRef into KDIndex, which binds to the proper PointView when fetching the X/Y to find neighbors. That said, the test assumes that the code is correct, it doesn't really test the algorithm, though it will tell you if a change impacts the current behavior.

On the name, perhaps NeighborClassifier or some such?

mrosen · 2018-02-22T15:59:46Z

Can someone explain to me what is wrong with this? The reported failure from Travis:

Scanning dependencies of target pdal_plugin_writer_sqlite
[ 54%] Building CXX object plugins/sqlite/CMakeFiles/pdal_plugin_writer_sqlite.dir/io/SQLiteWriter.cpp.o
[ 54%] Linking CXX executable ../../../bin/pdal_filters_python_test
../../../lib/libpdal_base.so.6.1.0: undefined reference to `Json::Value::Value(long long)'
collect2: error: ld returned 1 exit status
make[2]: *** [plugins/python/filters/CMakeFiles/pdal_filters_python_test.dir/build.make:212: bin/pdal_filters_python_test] Error 1
make[1]: *** [CMakeFiles/Makefile2:1046: plugins/python/filters/CMakeFiles/pdal_filters_python_test.dir/all] Error 2
make[1]: *** Waiting for unfinished jobs....
[ 54%] Linking CXX shared library ../../lib/libpdal_plugin_writer_sqlite.so
[ 54%] Built target pdal_plugin_writer_sqlite
make: *** [Makefile:163: all] Error 2

I don't believe this link failure from SQLiteWriter has anything to do with the changes to the new NeighborClassifierFilter.

abellgithub · 2018-02-23T23:23:27Z

You're correct, that error has nothing to do with your PR.

chambbj · 2018-03-07T19:33:56Z

filters/NeighborClassifierFilter.cpp

+
+void KNNAssignFilter::doOneNoDomain(PointRef &point, PointRef &temp, KD3Index &kdi)
+{
+    std::vector<PointId> iSrc = kdi.neighbors(point, m_k);


In the case that the query point and the KD3Index are from the same point cloud, neighbors will return the query point. If that is not the intended behavior, you could detect the situation and increment the number of neighbors accordingly.

Let's see ... I had two use cases in mind:

A single point cloud where we need to smooth the classifications, so K > 1 (think a few stray mis-classifications)

Two point clouds, the candidate is completely classified and the source (where the query point comes from) is a superset of those points. We need to extrapolate the classifications, from the smaller candidate to the larger source. Here K == 1.

chambbj · 2018-03-07T19:36:07Z

Could be a later addition if it's useful, but have you also considered defining your neighborhood by radius?

abellgithub · 2018-03-07T20:05:09Z

It would also be good if you could change the name of the class to be consistent with the name of the filter. Do you have any other ideas on names that would better describe the behavior?

mrosen · 2018-03-07T22:30:29Z

It would also be good if you could change the name of the class to be consistent with the name of the filter.

smh uh, ... yeah. that's embarrassing. I'll certainly fix this. Don't tell anyone. ;-).

Michael Rosen added 2 commits February 8, 2018 12:30

Contribute KNNAssignFilter.

84a03f4

hook up KNNAssign filter to StageFactory

dc28f97

mrosen mentioned this pull request Feb 9, 2018

KNNFilter #1800

Closed

abellgithub reviewed Feb 12, 2018

View reviewed changes

rename processOne --> doOne and make non-virtual

dc5a22f

remove dimension, hard-code Classification as what we update

bc77e38

Michael Rosen added 3 commits February 21, 2018 10:46

merge

ce182ef

Merge branch 'master' of github.com:PDAL/PDAL

edad7fa

change name knnassignfilter --> neighborclassifierfilter

8ee8184

mrosen changed the title ~~Contribute KNNAssignFilter~~ Contribute NeighborClassifierFilter Feb 21, 2018

chambbj reviewed Mar 7, 2018

View reviewed changes

finish rename KNNAssignFilter --> NeighborClassifierFilter

e5f3dea

abellgithub merged commit dd5627e into PDAL:master Mar 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contribute NeighborClassifierFilter #1803

Contribute NeighborClassifierFilter #1803

mrosen commented Feb 8, 2018

abellgithub left a comment

mrosen commented Feb 12, 2018

abellgithub commented Feb 12, 2018

mrosen commented Feb 13, 2018

abellgithub commented Feb 14, 2018

mrosen commented Feb 16, 2018

abellgithub commented Feb 19, 2018 •

edited

mrosen commented Feb 22, 2018

abellgithub commented Feb 23, 2018

chambbj Mar 7, 2018

mrosen Mar 7, 2018

chambbj commented Mar 7, 2018

abellgithub commented Mar 7, 2018

mrosen commented Mar 7, 2018 •

edited

Contribute NeighborClassifierFilter #1803

Contribute NeighborClassifierFilter #1803

Conversation

mrosen commented Feb 8, 2018

abellgithub left a comment

Choose a reason for hiding this comment

mrosen commented Feb 12, 2018

abellgithub commented Feb 12, 2018

mrosen commented Feb 13, 2018

abellgithub commented Feb 14, 2018

mrosen commented Feb 16, 2018

abellgithub commented Feb 19, 2018 • edited

mrosen commented Feb 22, 2018

abellgithub commented Feb 23, 2018

chambbj Mar 7, 2018

Choose a reason for hiding this comment

mrosen Mar 7, 2018

Choose a reason for hiding this comment

chambbj commented Mar 7, 2018

abellgithub commented Mar 7, 2018

mrosen commented Mar 7, 2018 • edited

abellgithub commented Feb 19, 2018 •

edited

mrosen commented Mar 7, 2018 •

edited