Skip to content

feature: Add hosting package for apache-tika #714

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

Meir017
Copy link

@Meir017 Meir017 commented Jun 6, 2025

**Closes #578 **

Adds Apache Tika hosting support to solution

Integrates Apache Tika document processing capabilities into the Aspire hosting toolkit by adding the main project, test project, and example application to the solution structure.

Enables developers to easily host and configure Apache Tika servers within their Aspire applications for document text extraction and metadata analysis.

follow-ups:

PR Checklist

  • Created a feature/dev branch in your fork (vs. submitting directly from a commit on main)
  • Based off latest main branch of toolkit
  • PR doesn't include merge commits (always rebase on top of our main, if needed)
  • New integration
    • Docs are written
    • Added description of major feature to project description for NuGet package (4000 total character limit, so don't push entire description over that)
  • Tests for the changes have been added (for bug fixes / features) (if applicable)
  • Contains NO breaking changes
  • Every new API (including internal ones) has full XML docs
  • Code follows all style conventions

Other information

Meir017 added 2 commits June 6, 2025 14:24
Replaces empty XML documentation comments with detailed descriptions explaining the purpose, parameters, return values, and configuration details of the Apache Tika server resource.

Improves developer experience by providing clear information about the toolkit's content analysis capabilities and default server configuration including HTTP endpoint, health check, and container image settings.
Copy link
Member

@aaronpowell aaronpowell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it possible to create an example that shows how you could use this within an application?

Should the resource expose a "connection string" which has the endpoint on it so that a client app can consume that easily?

<Project Sdk="Microsoft.NET.Sdk">

<PropertyGroup>
<AdditionalPackageTags>hosting adminer</AdditionalPackageTags>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
<AdditionalPackageTags>hosting adminer</AdditionalPackageTags>
<AdditionalPackageTags>hosting tika</AdditionalPackageTags>

I assume that's a better set of tags

@aaronpowell aaronpowell added the awaiting response Waiting for the author of the issue to provide more information or answer a question label Jun 12, 2025
@github-actions github-actions bot added the Stale label Jun 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
awaiting response Waiting for the author of the issue to provide more information or answer a question Stale
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add hosting library for apache tika
2 participants