Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Review #2412

Closed
wants to merge 80 commits into from
Closed

Review #2412

wants to merge 80 commits into from

Commits on Mar 6, 2024

  1. Update links [main] (#2358)

    - fix 404s due to openvino link structure change
    - 2023.3 -> 2024 where neccessary
    - spelling fixes
    dkalinowski committed Mar 6, 2024
    Configuration menu
    Copy the full SHA
    c5a2a67 View commit details
    Browse the repository at this point in the history

Commits on Mar 11, 2024

  1. Configuration menu
    Copy the full SHA
    630f3b2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8da2890 View commit details
    Browse the repository at this point in the history

Commits on Mar 13, 2024

  1. LLM Text Gen demo - fixes & adjusting to notebook changes (#2367)

    CVS-135106
    ---------
    
    Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
    dkalinowski and dtrawins committed Mar 13, 2024
    Configuration menu
    Copy the full SHA
    818b4fa View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f1e403d View commit details
    Browse the repository at this point in the history
  3. Python improvements and bugfixes (#2366)

    * validate class and execute method existence, extend pyovms.Tensor constructor, fix finalize not called issue, print with flush in demos
    mzegla committed Mar 13, 2024
    Configuration menu
    Copy the full SHA
    6c4a09a View commit details
    Browse the repository at this point in the history
  4. Documentation for capi benchmark app (#2329)

    Fixed bugs in capi benchmark app, documented and created demo showcasing benchmark app features
    bstrzele committed Mar 13, 2024
    Configuration menu
    Copy the full SHA
    33d8d9f View commit details
    Browse the repository at this point in the history

Commits on Mar 14, 2024

  1. RAG Chatbot demo (#2328)

    ---------
    
    Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
    dkalinowski and dtrawins committed Mar 14, 2024
    Configuration menu
    Copy the full SHA
    60234af View commit details
    Browse the repository at this point in the history

Commits on Mar 18, 2024

  1. Configuration menu
    Copy the full SHA
    95336a8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c21f7bb View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    59690c1 View commit details
    Browse the repository at this point in the history

Commits on Mar 19, 2024

  1. Resolve python node todo's (#2344)

    * Resolve python node todo's
    michalkulakowski committed Mar 19, 2024
    Configuration menu
    Copy the full SHA
    ef32570 View commit details
    Browse the repository at this point in the history

Commits on Mar 20, 2024

  1. Configuration menu
    Copy the full SHA
    e9b17ea View commit details
    Browse the repository at this point in the history

Commits on Mar 22, 2024

  1. Update to new calculators validation (#2335)

    * Dump to file flags
    rasapala committed Mar 22, 2024
    Configuration menu
    Copy the full SHA
    34a7551 View commit details
    Browse the repository at this point in the history

Commits on Mar 25, 2024

  1. Configuration menu
    Copy the full SHA
    2af021b View commit details
    Browse the repository at this point in the history

Commits on Mar 26, 2024

  1. PR testing changes (#2323)

    smart building depending on the content
    parallel tests execution
    build performance optimization
    dtrawins committed Mar 26, 2024
    Configuration menu
    Copy the full SHA
    a4aa7f3 View commit details
    Browse the repository at this point in the history

Commits on Mar 29, 2024

  1. Quieten boost (#2379)

    By having a verbose flag, it creates ~67,000 lines of messages in the build
    logs just for unpacking the boost tar file. This makes it challenging to
    audit the build process.
    stevegrubb committed Mar 29, 2024
    Configuration menu
    Copy the full SHA
    ca5934e View commit details
    Browse the repository at this point in the history

Commits on Apr 2, 2024

  1. Integrate openvino_tokenizers repo (#2381)

    CVS-135755
    dkalinowski committed Apr 2, 2024
    Configuration menu
    Copy the full SHA
    f76dedc View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    602500d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    296f53a View commit details
    Browse the repository at this point in the history

Commits on Apr 3, 2024

  1. Configuration menu
    Copy the full SHA
    42de1f3 View commit details
    Browse the repository at this point in the history

Commits on Apr 4, 2024

  1. Configuration menu
    Copy the full SHA
    2cee117 View commit details
    Browse the repository at this point in the history

Commits on Apr 8, 2024

  1. Configuration menu
    Copy the full SHA
    8286d31 View commit details
    Browse the repository at this point in the history
  2. Allow flag injection to pugixml (#2390)

    * Allow flag injection to pugixml
    
    This commit contains a patch that adds the variables for the CXX and linker
    flags to the CMakeLists.txt file. It then uses the patch during build so that
    later we can inject build flags on the cmake command.
    
    * exclude header check
    * fix dockerfile sequence
    * set ubi as the default base image
    
    ---------
    
    Co-authored-by: Steve Grubb <ausearch.1@gmail.com>
    dtrawins and stevegrubb committed Apr 8, 2024
    Configuration menu
    Copy the full SHA
    7af4177 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e5dd40c View commit details
    Browse the repository at this point in the history

Commits on Apr 9, 2024

  1. Add string output demo (#2378)

    * Add string output demo
    michalkulakowski committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    a959b8e View commit details
    Browse the repository at this point in the history
  2. Fix row named format for binary inputs in REST API & fix logging for …

    …MediaPipe stream info (#2395)
    mzegla committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    10667af View commit details
    Browse the repository at this point in the history
  3. Upgraded gradio (#2392)

    bstrzele committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    067d241 View commit details
    Browse the repository at this point in the history
  4. Full support of KServe Api for Mediapipe

    * Add support of _contents fields in KServe request input for mediapipe for all deserialization paths
    
    ---------
    
    Co-authored-by: atobisze <adrian.tobiszewski@intel.com>
    michalkulakowski and atobiszei committed Apr 9, 2024
    Configuration menu
    Copy the full SHA
    6058b37 View commit details
    Browse the repository at this point in the history

Commits on Apr 11, 2024

  1. [DOCS] Fixing references for docs (#2402)

    * Fixing references
    
    * Fix internal link
    sgolebiewski-intel committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    ba6e878 View commit details
    Browse the repository at this point in the history
  2. Expose compiler flags (#2380)

    * Expose compiler flags
    
    By default, cmake hides the build process. For auditing purposes, the build
    flags need to be exposed. This adds VERBOSE=1 to make commands to get the
    desired verbosity.
    
    * Use -DCMAKE_VERBOSE_MAKEFILE=ON
    
    Previous patch was using VERBOSE=1 which does the trick. But for consistency,
    let's use -DCMAKE_VERBOSE_MAKEFILE=ON wherever possible.
    
    ---------
    
    Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
    stevegrubb and dtrawins committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    d7fca36 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    18543aa View commit details
    Browse the repository at this point in the history

Commits on Apr 12, 2024

  1. Fix cpprestsdk linker flag injection bug (#2383)

    cpprestsdk has a bug where it tries to override LD_FLAGS. With cmake, this is
    not the right thing to do. You have to use a targeted CMAKE_XXX_LINKER_FLAGS
    variable where XXX is what you want to affect. Fortunately this repo patches
    the same line and we can correct this design mistake. The patch changes
    LD_FLAGS to CMAKE_SHARED_LINKER_FLAGS since we are building a shared object.
    You can now see that the flag originally trying to be patched in is visible
    in the linking of this part of the build.
    stevegrubb committed Apr 12, 2024
    Configuration menu
    Copy the full SHA
    5c2455d View commit details
    Browse the repository at this point in the history
  2. Added handling of relative paths (#2384)

    Co-authored-by: Dariusz Trawinski <Dariusz.Trawinski@intel.com>
    bstrzele and dtrawins committed Apr 12, 2024
    Configuration menu
    Copy the full SHA
    73c3387 View commit details
    Browse the repository at this point in the history

Commits on Apr 15, 2024

  1. Configuration menu
    Copy the full SHA
    5c754fe View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bc334b6 View commit details
    Browse the repository at this point in the history

Commits on Apr 16, 2024

  1. Configuration menu
    Copy the full SHA
    6b33569 View commit details
    Browse the repository at this point in the history
  2. Updated llm demo to report number of generated tokens in the last res…

    …ponse (#2373)
    
    Co-authored-by: Dariusz Trawinski <Dariusz.Trawinski@intel.com>
    bstrzele and dtrawins committed Apr 16, 2024
    Configuration menu
    Copy the full SHA
    8435706 View commit details
    Browse the repository at this point in the history

Commits on Apr 17, 2024

  1. Mkulakow/mediapipe kfs rest test (#2377)

    * Add unit tests for KFS REST Mediapipe support
    
    * Add deserialization of json string input to OVMS pytensor
    michalkulakowski committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    a300728 View commit details
    Browse the repository at this point in the history
  2. Coverity fixes (#2411)

    *Fix uninitialized value issue in pytensor
    *Fix minor leak in Mediapipe fork
    
    Changes in Mediapipe fork:
    openvinotoolkit/mediapipe#72
    
    ID:CVS-136076
    atobiszei committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    0c2fe0e View commit details
    Browse the repository at this point in the history

Commits on Apr 18, 2024

  1. Configuration menu
    Copy the full SHA
    5f62713 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bb43de2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5f79324 View commit details
    Browse the repository at this point in the history

Commits on Apr 22, 2024

  1. Templatize graph execution (both unary and stream) (#2407)

    CVS-138030
    
    The amount of LOC changed is significantly smaller than reported by GitHub due to the fact that a lot of the code is moved from mediapipegraphexecutor.cpp to kfs_graph_executor_impl.cpp.
    
    What really has changed:
    
    * MediapipeGraphExecutor:: infer & inferStream are templatized and do not rely on gRPC KServe types at all
    * Added gRPC KServe specialization for MP graph execution (in kfs_graph_executor_impl.h/cpp)
    * Two types of packet holders are no longer needed - if pasing the ownership to packet is not needed, just simply passing shared_ptr with no-op destructor.
    * Previous unit tests are re-used.
    
    In next PRs: specialization for HTTP types will be added
    dkalinowski committed Apr 22, 2024
    Configuration menu
    Copy the full SHA
    0e872ad View commit details
    Browse the repository at this point in the history

Commits on Apr 24, 2024

  1. print_summary option to benchmark_client to parse benchmark results. (#…

    …2397)
    
    * Add option to summarize benchmark results
    
    * Add sample results summary with option -ps
    
    Command argument to view summary of benchmarking results
    
    * Additional metrics to results summary
    
    * print_summary -ps usage and sample output
    
    * Apply suggestions from code review
    
    * Apply suggestions from code review
    
    Co-authored-by: Katarzyna Papis <katarzyna.papis@intel.com>
    
    ---------
    
    Co-authored-by: Ravi Panchumarthy <ravi.panchumarthy@intel.com>
    Co-authored-by: Katarzyna Papis <katarzyna.papis@intel.com>
    3 people committed Apr 24, 2024
    Configuration menu
    Copy the full SHA
    de7be1e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    dfd1316 View commit details
    Browse the repository at this point in the history
  3. [main] Remove GPT-J demo and blingfire tokenizer custom node (#2421)

    Deprecated in previous release, removing now. Use python based LLM demos.
    
    + removing example tokenizer custom node, since we have tokenizer OV models now.
    
    CVS-136786
    dkalinowski committed Apr 24, 2024
    Configuration menu
    Copy the full SHA
    e52d12c View commit details
    Browse the repository at this point in the history
  4. TFHub->Kaggle (#2427)

    Changing the demos and docs since the models were removed from TFHub and added to Kaggle.
    
    CVS-139265
    dkalinowski committed Apr 24, 2024
    Configuration menu
    Copy the full SHA
    a5fb58a View commit details
    Browse the repository at this point in the history

Commits on Apr 26, 2024

  1. rag demo with online scope changes (#2413)

    Co-authored-by: ngrozae <104074686+ngrozae@users.noreply.github.com>
    Co-authored-by: Damian Kalinowski <damian.kalinowski@intel.com>
    3 people committed Apr 26, 2024
    Configuration menu
    Copy the full SHA
    1cb8a17 View commit details
    Browse the repository at this point in the history

Commits on Apr 29, 2024

  1. Configuration menu
    Copy the full SHA
    e0972e1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f8b06ec View commit details
    Browse the repository at this point in the history

Commits on May 6, 2024

  1. Init PR checkbox (#2375)

    rasapala committed May 6, 2024
    Configuration menu
    Copy the full SHA
    6a7b271 View commit details
    Browse the repository at this point in the history

Commits on May 15, 2024

  1. universal_and_benchmark_documentation_updates (#2433)

    * universal_and_benchmark_documentation_updates
    
    * no proxy update
    
    * update benchmark proxy
    
    * add version to ubuntu tag
    
    * revert ubuntu changes
    
    * added localhost
    
    * review
    pgladkows committed May 15, 2024
    Configuration menu
    Copy the full SHA
    ff46b1d View commit details
    Browse the repository at this point in the history

Commits on May 16, 2024

  1. RAG python demo with online scope changes (#2434)

    * dockerfile for gradio
    * monitoring changes in the documents scope
    * preinstall nltk modules
    * default security context set to ovms account
    * improvements in rag demo
    dtrawins committed May 16, 2024
    Configuration menu
    Copy the full SHA
    0b95dd5 View commit details
    Browse the repository at this point in the history

Commits on May 17, 2024

  1. Implement /v3/chat/completions endpoint - forward to MP graph (#2436)

    CVS-138032
    
    Implementation of /v3/chat/completions endpoint and forwarding the HTTP message to MediaPipe graph.
    The data is std::string now, to be adjusted in following tasks (CVS-139240/CVS-140684).
    dkalinowski committed May 17, 2024
    Configuration menu
    Copy the full SHA
    37d69bf View commit details
    Browse the repository at this point in the history
  2. CVS-137992_fix_tests_deadline_exceeded_dg2 (#2440)

    * CVS-137992_fix_deadline_exceeded_dg2
    
    * add retry for get_model_metadata_request
    
    * add get_model_metadata function
    
    * fix test names
    
    * increase timeout for GetModelStatus
    ngrozae committed May 17, 2024
    Configuration menu
    Copy the full SHA
    3d34b77 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7947c57 View commit details
    Browse the repository at this point in the history

Commits on May 21, 2024

  1. Configuration menu
    Copy the full SHA
    19872ee View commit details
    Browse the repository at this point in the history

Commits on May 22, 2024

  1. Configuration menu
    Copy the full SHA
    787dddc View commit details
    Browse the repository at this point in the history
  2. Mkulakow/chat completion request conversion (#2443)

    https://jira.devtools.intel.com/browse/CVS-139240
    Implementation of chat completion request conversion to HttpPayload struct.
    michalkulakowski committed May 22, 2024
    Configuration menu
    Copy the full SHA
    b9ae161 View commit details
    Browse the repository at this point in the history

Commits on May 23, 2024

  1. Configuration menu
    Copy the full SHA
    916bf33 View commit details
    Browse the repository at this point in the history

Commits on May 24, 2024

  1. Configuration menu
    Copy the full SHA
    57735ef View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    39a0aba View commit details
    Browse the repository at this point in the history

Commits on May 27, 2024

  1. Fix ovms status to http status conversion (#2410)

    * Fix ovms status to http status conversion
    michalkulakowski committed May 27, 2024
    Configuration menu
    Copy the full SHA
    48f6e55 View commit details
    Browse the repository at this point in the history
  2. add-version-to-ubuntu-os (#2438)

    * add-version-to-ubuntu-os
    
    * fix ovms_pkg link
    
    * BASE_OS_DISTRO
    
    * ovms_pkg os
    
    * updates
    
    * DIST_OS added
    
    * adjust nginx build
    
    * fix nginx
    
    * Update Makefile
    
    Co-authored-by: ngrozae <104074686+ngrozae@users.noreply.github.com>
    
    * Update Makefile
    
    Co-authored-by: ngrozae <104074686+ngrozae@users.noreply.github.com>
    
    ---------
    
    Co-authored-by: ngrozae <104074686+ngrozae@users.noreply.github.com>
    pgladkows and ngrozae committed May 27, 2024
    Configuration menu
    Copy the full SHA
    c7bf496 View commit details
    Browse the repository at this point in the history

Commits on May 29, 2024

  1. Configuration menu
    Copy the full SHA
    59c5886 View commit details
    Browse the repository at this point in the history
  2. LLM calculator compliant with /v3/chat/completions (#2452)

    CVS-139231/CVS-139233
    
    This introduces LLM calculator that accepts HTTP OpenAI /v3/chat/completion requests and produces compliant responses.
    Working in both - unary and streaming modes.
    Bunch of parameters are still marked as TODO, but should be enough to perform benchmarks.
    
    Minimal demo description how to run.
    dkalinowski committed May 29, 2024
    Configuration menu
    Copy the full SHA
    1390189 View commit details
    Browse the repository at this point in the history

Commits on May 31, 2024

  1. Configuration menu
    Copy the full SHA
    f08caa1 View commit details
    Browse the repository at this point in the history
  2. update ov (#2465)

    dtrawins committed May 31, 2024
    Configuration menu
    Copy the full SHA
    f3be238 View commit details
    Browse the repository at this point in the history

Commits on Jun 3, 2024

  1. Configuration menu
    Copy the full SHA
    746ed14 View commit details
    Browse the repository at this point in the history

Commits on Jun 4, 2024

  1. Adding LLM scheduler config from graph options. (#2455)

    * Add scheduler config in graph options
    * Fix centos stream-8
    rasapala committed Jun 4, 2024
    Configuration menu
    Copy the full SHA
    b991f9b View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2024

  1. added documentation about openai api (#2460)

    ---------
    
    Co-authored-by: Miłosz Żeglarski <milosz.zeglarski@intel.com>
    Co-authored-by: ngrozae <104074686+ngrozae@users.noreply.github.com>
    3 people committed Jun 6, 2024
    Configuration menu
    Copy the full SHA
    1637183 View commit details
    Browse the repository at this point in the history

Commits on Jun 7, 2024

  1. Add handling for missing OpenAI parameters + profiling traces (#2468)

    CVS-142768
    
    Forwards beam search and multinomial sampling parameters to CB library - this enables returning more than 1 completion for beam search (only for unary)
    Adds profiling traces (minitrace)
    dkalinowski committed Jun 7, 2024
    Configuration menu
    Copy the full SHA
    0c23278 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    122a163 View commit details
    Browse the repository at this point in the history

Commits on Jun 11, 2024

  1. Add http payload conversion UTs (#2461)

    * Add UTs for llm request conversion
    michalkulakowski committed Jun 11, 2024
    Configuration menu
    Copy the full SHA
    019e35d View commit details
    Browse the repository at this point in the history
  2. update ov to 2024.2 rc2 (#2481)

    * fix tbb handling for ubuntu20
    dtrawins committed Jun 11, 2024
    Configuration menu
    Copy the full SHA
    762ea3f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    90dc2dd View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5c73d1b View commit details
    Browse the repository at this point in the history
  5. Text streamer fix (#2483)

    There is an issue (or feature?) that adding generated token to the token cache produces shorter message than previous without newly generated one.
    
    TextStreamer did not expect such behavior.
    
    The fix ignores such event and makes the generation wait for the next tokens.
    
    + reducing number of response chunks by adding requirement so that chunk needs to include space in order to send cache to the client
    dkalinowski committed Jun 11, 2024
    Configuration menu
    Copy the full SHA
    a0bca87 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    bdd225f View commit details
    Browse the repository at this point in the history