Permalink
Commits on Jan 17, 2017
  1. Using fully qualified hdfs path. (#3705)

    * Using fully qualified hdfs path.
    
    * Review changes.
    
    * Remove unused imports.
    
    * Variable name change.
    akashdw committed with himanshug Jan 17, 2017
  2. Migrating extendedset from Metamarkets. (#3694)

    * Migrating extendedset from Metamarkets.
    
    * Notice change
    
    * More details in NOTICE
    
    * NOTICE formatting.
    
    * suppress header checkstlye for extendedset.
    akashdw committed with drcrallen Jan 17, 2017
Commits on Jan 12, 2017
  1. SQL support for nested groupBys. (#3806)

    * SQL support for nested groupBys.
    
    Allows, for example, doing exact count distinct by writing:
    
      SELECT COUNT(*) FROM (SELECT DISTINCT col FROM druid.foo)
    
    Contrast with approximate count distinct, which is:
    
      SELECT COUNT(DISTINCT col) FROM druid.foo
    
    * Add deeply-nested groupBy docs, tests, and maxQueryCount config.
    
    * Extract magic constants into statics.
    
    * Rework rules to put preconditions in the "matches" method.
    gianm committed with jon-wei Jan 12, 2017
Commits on Jan 11, 2017
  1. Enable auto license checking (#3836)

    * Enable license checking
    
    * Clean duplicated license headers
    jihoonson committed with fjy Jan 11, 2017
  2. Add an option to SearchQuery to choose a search query execution strat…

    …egy (#3792)
    
    * Add an option to SearchQuery to choose a search query execution strategy.
    
    Supported strategies are
    1) Index-only query execution
    2) Cursor-based scan
    3) Auto: choose an efficient strategy for a given query
    
    * Add SearchStrategy and SearchQueryExecutor
    
    * Address comments
    
    * Rename strategies and set UseIndexesStrategy as the default strategy
    
    * Add a cost-based planner for auto strategy
    
    * Add document
    
    * Fix code style
    
    * apply code style
    
    * apply comments
    jihoonson committed with gianm Jan 11, 2017
Commits on Jan 9, 2017
Commits on Jan 7, 2017
Commits on Jan 6, 2017
  1. Update caching.md typo (#3824)

    * Update caching.md
    
    Typo of Command vs Comma
    
    * Update index.md
    
    Fixing `Command` typo
    tranv94 committed with fjy Jan 6, 2017
  2. Randomly choose a server when multiple best servers are available (#3822

    )
    
    * Randomly choose a server when multiple best servers are available
    
    * Use one pass instead of two
    
    * Fix code style issues
    niketh committed with pjain1 Jan 6, 2017
  3. Remove makeMathExpressionSelector from ColumnSelectorFactory. (#3815)

    * Remove makeMathExpressionSelector from ColumnSelectorFactory.
    
    * Add @Nullable annotations in places, fix Number.class check.
    
    * Break up createBindings, add tests.
    
    * Add null check.
    gianm committed with fjy Jan 6, 2017
Commits on Jan 5, 2017
  1. log sizes of created smoosh files (#3817)

    * log when merging of intermediate segments starts during batch ingestion
    
    * log sizes of created smoosh files
    himanshug committed with fjy Jan 5, 2017
Commits on Jan 4, 2017
  1. Downgrade to avatica-server 1.8.0, skip avatica-core. (#3813)

    This matches the version bundled by Calcite 1.10.0.
    gianm committed with fjy Jan 4, 2017
Commits on Jan 3, 2017
  1. Filters: Use ColumnSelectorFactory directly for building row-based ma…

    …tchers. (#3797)
    
    * Filters: Use ColumnSelectorFactory directly for building row-based matchers.
    
    * Adjustments based on code review.
    
    - BoundDimFilter: fewer volatiles, rename matchesAnything to !matchesNothing.
    - HavingSpecs: Clarify that they are not thread-safe, and make DimFilterHavingSpec
      not thread safe.
    - Renamed rowType to rowSignature.
    - Added specializations for time-based vs non-time-based DimensionSelector in RBCSF.
    - Added convenience method DimensionHanderUtils.createColumnSelectorPlus.
    - Added singleton ZeroIndexedInts.
    - Added test cases for DimFilterHavingSpec.
    
    * Make ValueMatcherColumnSelectorStrategy actually use the associated selector.
    
    * Add RangeIndexedInts.
    
    * DimFilterHavingSpec: Fix concurrent usage guard on jdk7.
    
    * Add assertion to ZeroIndexedInts.
    
    * Rename no-longer-volatile members.
    gianm committed on GitHub Jan 3, 2017
  2. Allow users to specify additional command line args for creating drui…

    …d package (#3812)
    
    * Allow users to specify additional command line args for creating tar balls
    
    This PR allows users to specify additional command line options to the
    pull deps command while creating druid distribution.
    e.g. To also package graphite-emitter in druid tarball one can run -
    mvn package -Ddruid.distribution.pulldeps.opts='-c
    io.druid.extensions.contrib:graphite-emitter'
    
    * Set default to --clean instead of blank value
    nishantmonu51 committed with drcrallen Jan 3, 2017
Commits on Jan 2, 2017
  1. Exhibitor Support (#3664)

    * allow JsonConfigTesterBase to treat the fields of collections
    
    * [Feature] Exhibitor Support (#3664)
    
    This patch provides the integration of Druid & Netflix Exhibitor. Druid
    currently use Apache Curator as ZooKeeper client. Curator can be
    integrated with Exhibitor to achieve a live/updating list of the
    ZooKeeper ensemble. This patch enables Druid to use this features.
    yuusaku-t committed with drcrallen Jan 2, 2017
Commits on Dec 26, 2016
  1. Fix the build after #3697 (#3807)

    leventov committed with himanshug Dec 26, 2016
  2. Don't return leaked Objects back to StupidPool, because this is dange…

    …rous. Reuse Cleaners in StupidPool. Make StupidPools named. Add StupidPool.leakedObjectCount(). Minor fixes (#3631)
    leventov committed with himanshug Dec 26, 2016
Commits on Dec 24, 2016
  1. Lookup cache refactoring (the main part of #3667) (#3697)

    * Lookup cache refactoring (the main part of druid-io/druid#3667)
    
    * Use PowerMock's static methods in NamespaceLookupExtractorFactoryTest
    
    * Fix KafkaLookupExtractorFactoryTest
    
    * Use VisibleForTesting annotation instead of Javadoc comment
    
    * Create a NamespaceExtractionCacheManager separately for each test in NamespaceExtractionCacheManagersTest
    
    * Rename CacheScheduler.NoCache.ENTRY_DISPOSED to ENTRY_CLOSED
    
    * Reduce visibility of NamespaceExtractionCacheManager.cacheCount() and monitor() implementations, and don't run NamespaceExtractionCacheManagerExecutorsTest with off-heap cache (it didn't before)
    
    * In NamespaceLookupExtractorFactory, use safer idiom to check if CacheState is NoCache or VersionedCache
    
    * More logging in CacheHandler constructor and close(), VersionedCache.close()
    
    * PR comments addressed
    
    * Make CacheScheduler.EntryImpl AutoCloseable, avoid 'dispose' verb in comments, logging and naming in CacheScheduler in favor of 'close'
    
    * More Javadoc comments to CacheScheduler
    
    * Fix NPE
    
    * Remove logging in OnHeapNamespaceExtractionCacheManager.expungeCollectedCaches()
    
    * Make NamespaceExtractionCacheManagersTest.testRacyCreation() to have similar load to what it be before the refactoring
    
    * Unwrap NamespaceExtractionCacheManager.scheduledExecutorService from unneeded MoreExecutors.listeningDecorator() and specify that this is ScheduledThreadPoolExecutor, which ensures happens-before between periodic runs of the tasks
    
    * More comments on MapDbCacheDisposer.disposed
    
    * Replace concat with Long.toString()
    
    * Comment on why NamespaceExtractionCacheManager.scheduledExecutorService() returns ScheduledThreadPoolExecutor
    
    * Place logging statements in VersionedCache.close() and CacheHandler.close() after actual closing logic, because logging may fail
    
    * Make JDBCExtractionNamespaceCacheFactory and StaticMapExtractionNamespaceCacheFactory to try to close newly created VersionedCache if population has failed, as it is done already in URIExtractionNamespaceCacheFactory
    
    * Don't close the whole CacheScheduler.Entry, if the cache update task failed
    
    * Replace AtomicLong updateCounter and firstRunLatch with Phaser-based UpdateCounter in CacheScheduler.EntryImpl
    leventov committed with b-slim Dec 24, 2016
Commits on Dec 22, 2016
  1. Add dimension type-based interface for query processing (#3570)

    * Add dimension type-based interface for query processing
    
    * PR comment changes
    
    * Address PR comments
    
    * Use getters for QueryDimensionInfo
    
    * Split DimensionQueryHelper into base interface and query-specific interfaces
    
    * Treat empty rows as nulls in v2 groupby
    
    * Reduce boxing in SearchQueryRunner
    
    * Add GroupBy empty row handling to MultiValuedDimensionTest
    
    * Address PR comments
    
    * PR comments and refactoring
    
    * More PR comments
    
    * PR comments
    jon-wei committed with dclim Dec 22, 2016
Commits on Dec 21, 2016
  1. overlord helpers framework and tasklog auto cleanup (#3677)

    * overlord helpers framework and tasklog auto cleanup
    
    * review comment changes
    
    * further review comments addressed
    himanshug committed with drcrallen Dec 21, 2016
  2. Fix #3795 (Java 7 compatibility). (#3796)

    * Fix #3795 (Java 7 compatibility).
    
    Also introduce Animal Sniffer checks during build, which would
    have caught the original problems.
    
    * Add Animal Sniffer on caffeine-cache for JDK8.
    gianm committed with fjy Dec 21, 2016
  3. Fix reindexing of segments in Google Cloud Storage (#3788)

    Google Cloud Storage allows `:` in paths. For this reason `google` was
    not added to https://github.com/druid-io/druid/blob/da007ca3c2a7acec4b86186de95bc370d6867f6a/indexing-hadoop/src/main/java/io/druid/indexer/JobHelper.java#L585
    Normally this is not an issue but when reindexing segments the Hadoop
    code for `getSplits` trips up on the `:` and returns:
    `Relative path in absolute URI`
    This patch URL encodes the `:` character making it work in Hadoop as
    well.
    erikdubbelboer committed with fjy Dec 21, 2016
Commits on Dec 20, 2016
  1. fix JodaUtils.condenseIntervals(..) to correctly take end or current/…

    …next interval on overlap (#3793)
    
    * remove unused duplicate JodaUtils.java
    
    * fix JodaUtils.condenseIntervals(..) to correctly take end or current/next interval on overlap
    himanshug committed with gianm Dec 20, 2016
  2. fix MetadataStorage binding so that it is always Noop except for coor…

    …dinator iff derby is configured (#3789)
    himanshug committed with fjy Dec 20, 2016