chore(llm): using nuitka to provide a binary/perf way for the service by weijinglin · Pull Request #242 · apache/hugegraph-ai

weijinglin · 2025-05-19T16:38:05Z

follow #199 to build a new docker image using Nuitka in Dockerfile.nk

Only simple tests have been conducted so far
more complete tests are still needed

close #200

…the service

Copilot

Pull Request Overview

This PR introduces a new Docker image build process using Nuitka for a binary/embed release along with enhancements to batch text embedding methods across multiple embedding providers.

Removed threaded processing in favor of a batched embedding API call in build_semantic_index.py
Added new get_texts_embeddings methods with documentation in embedding provider implementations (qianfan, openai, ollama)
Updated dependency versions for Ollama in requirements.txt and pyproject.toml, and added a new Dockerfile.nk for the build process

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
hugegraph_llm/operators/index_op/build_semantic_index.py	Removed ThreadPoolExecutor and switched to using batched embedding API
hugegraph_llm/models/embeddings/qianfan.py	Added batched embedding method with new docstring
hugegraph_llm/models/embeddings/openai.py	Added batched embedding method with comprehensive docstring
hugegraph_llm/models/embeddings/ollama.py	Updated API usage for embeddings and added batched embedding method
hugegraph_llm/models/embeddings/base.py	Added abstract method for batched embedding to maintain consistency
hugegraph-llm/requirements.txt, pyproject.toml	Updated Ollama dependency version
docker/Dockerfile.nk	Added new Dockerfile using Nuitka for binary/embed release

Comments suppressed due to low confidence (3)

hugegraph_llm/models/embeddings/qianfan.py:55

[nitpick] Consider revising the docstring phrasing for clarity, e.g., changing 'Usage refer:' to 'See:' or 'Usage:' to align with standard documentation practices.

        """ Usage refer: https://cloud.baidu.com/doc/WENXINWORKSHOP/s/hlmokk9qn"""

hugegraph_llm/operators/index_op/build_semantic_index.py:40

The change from using ThreadPoolExecutor to a direct batched API call assumes that get_texts_embeddings scales well with larger lists. Please verify that this approach performs adequately for high-volume inputs.

embeddings = self.embedding.get_texts_embeddings(vids)

hugegraph_llm/models/embeddings/ollama.py:43

The update from self.client.embeddings to self.client.embed reflects the new API version. Please ensure that the parameters and expected return structure are fully aligned with the updated Ollama documentation.

return list(self.client.embed(model=self.model, input=text)["embeddings"][0])

weijinglin · 2025-05-21T14:55:05Z

The product of image constructed by Dockerfile.nk

imbajin

better to attach a tree structure for it

merge it & add docs for it later

weijinglin and others added 5 commits May 15, 2025 15:33

chore(dependency): update gradio's version

11cd07d

enhancement(embed): Impl Batch Embedding

cc68168

Merge branch 'apache:main' into main

a52beec

chore: format & add comment for new interface

66bb8dd

chore(production): using nk to provide a binary/embed way to release …

d080da0

…the service

dosubot Bot added the size:L This PR changes 100-499 lines, ignoring generated files. label May 19, 2025

github-actions Bot added the llm label May 19, 2025

dosubot Bot added the dependencies Pull requests that update a dependency file label May 19, 2025

weijinglin mentioned this pull request May 19, 2025

provide a binary/embed way to release the service #200

Closed

imbajin requested a review from Copilot May 20, 2025 03:18

Copilot AI reviewed May 20, 2025

View reviewed changes

Comment thread docker/Dockerfile.nk

weijinglin changed the title ~~chore(production): using nk to provide a binary/embed way to release the service~~ build(llm): using nk to provide a binary/embed way to release the service May 21, 2025

Merge branch 'main' into nk

dd82b71

dosubot Bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels May 21, 2025

imbajin changed the title ~~build(llm): using nk to provide a binary/embed way to release the service~~ chore(llm): using nuitka to provide a binary/perf way for the service May 21, 2025

Update build_semantic_index.py

904b085

github-actions Bot removed the llm label May 21, 2025

Update Dockerfile.nk

f2fcd1d

imbajin approved these changes May 21, 2025

View reviewed changes

dosubot Bot added the lgtm This PR has been approved by a maintainer label May 21, 2025

imbajin merged commit 8887a3b into apache:main May 21, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(llm): using nuitka to provide a binary/perf way for the service#242

chore(llm): using nuitka to provide a binary/perf way for the service#242
imbajin merged 8 commits intoapache:mainfrom
weijinglin:nk

weijinglin commented May 19, 2025 •

edited by imbajin

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

weijinglin commented May 21, 2025

Uh oh!

imbajin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

weijinglin commented May 19, 2025 • edited by imbajin Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

weijinglin commented May 21, 2025

Uh oh!

imbajin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

weijinglin commented May 19, 2025 •

edited by imbajin

Loading