feature: Embed Service #182

JNygaard-Skylight · 2025-12-10T16:54:06Z

Description

Creates a simple "embed" service with a single function.

Related Issues

Closes #157

codecov-commenter · 2025-12-10T16:57:47Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 93.61%. Comparing base (da2d612) to head (69d000b).

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #182   +/-   ##
=======================================
  Coverage   93.61%   93.61%           
=======================================
  Files           9        9           
  Lines         407      407           
=======================================
  Hits          381      381           
  Misses         26       26

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

src/dibbs_text_to_code/configs.py

BradySkylight · 2025-12-10T19:19:38Z

src/dibbs_text_to_code/services/embed.py

+
+def embed(input_text: str) -> Tensor:
+    """Embed text."""
+    return model.encode(input_text)


This is awesome and simple! Should we update the doc string to be a bit more descriptive? This may only be relevant for maybe our actual APIs though....but just checking.

Definitely want to add more to the doc string. It's fairly standard to include the parameters and expected output.

m-goggins · 2025-12-10T19:35:29Z

src/dibbs_text_to_code/services/embed.py

+
+def embed(input_text: str) -> Tensor:
+    """Embed text."""
+    return model.encode(input_text)


Definitely want to add more to the doc string. It's fairly standard to include the parameters and expected output.

m-goggins · 2025-12-10T19:37:35Z

src/dibbs_text_to_code/services/embed.py

+
+from dibbs_text_to_code.configs import MODEL_NAME
+
+model = SentenceTransformer(MODEL_NAME)


We likely want to refactor this into a lazy load to make the API and lambda cold start a little faster. Up to you if you want to do that in this PR or another.

I can tackle this in a different PR with an update to the doc strings (see comment below).

src/dibbs_text_to_code/services/text_processor.py

m-goggins · 2025-12-10T19:50:03Z

src/dibbs_text_to_code/services/text_processor.py

@@ -0,0 +1,40 @@
+from dibbs_text_to_code import configs
+


I do think you should add at least one test to show we've returned a tensor object that is the same dimensions as the model expects (768 for Qwen). I could be wrong, but I don't think you'll have to mock anything as long as you load this in the test file from embeddings import embed

I like this idea as well!

…s docstrings to be on a single continuous line with no returns

BradySkylight and others added 14 commits December 8, 2025 16:51

Initial Check-in

689790f

Added some tests for the Config

a0f8f25

Update extraction.py

7cd219a

some refactoring to make things more clear

67ba85c

remove print statements

90f8937

Update extraction.py

b6acabe

Merge branch 'main' into brady/156/non-standard-text-quality-check

7049f99

Merge branch 'main' into brady/156/non-standard-text-quality-check

46a5d8f

moved things around and did some fixes

5e1cfa8

Update extraction.py

b536b6d

Resolve imports

1053e01

fixed a few issues based upon review comments

0f0ee1b

embedding service

8569150

Merge branch 'main' into josh/173/embed-text-1

ebcb498

add missing init file

7842e29

JNygaard-Skylight marked this pull request as ready for review December 10, 2025 19:11

JNygaard-Skylight requested review from BradySkylight, bamader, m-goggins and robertandremitchell as code owners December 10, 2025 19:11

BradySkylight reviewed Dec 10, 2025

View reviewed changes

src/dibbs_text_to_code/configs.py Show resolved Hide resolved

BradySkylight reviewed Dec 10, 2025

View reviewed changes

m-goggins reviewed Dec 10, 2025

View reviewed changes

JNygaard-Skylight added 2 commits December 10, 2025 14:48

add parameter and returns

4f0277d

add additional docs rules and conform to Google's style.

69d000b

m-goggins reviewed Dec 10, 2025

View reviewed changes

BradySkylight added 2 commits December 11, 2025 19:15

some refactoring and adding doc strings

54f5162

reverting some of the linting changes that broke everything and force…

0b678c3

…s docstrings to be on a single continuous line with no returns

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature: Embed Service #182

feature: Embed Service #182

Uh oh!

JNygaard-Skylight commented Dec 10, 2025

Uh oh!

codecov-commenter commented Dec 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

BradySkylight Dec 10, 2025

Uh oh!

m-goggins Dec 10, 2025

Uh oh!

m-goggins Dec 10, 2025

Uh oh!

m-goggins Dec 10, 2025

Uh oh!

BradySkylight Dec 10, 2025

Uh oh!

Uh oh!

m-goggins Dec 10, 2025

Uh oh!

BradySkylight Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants


		from dibbs_text_to_code.configs import MODEL_NAME

		model = SentenceTransformer(MODEL_NAME)

feature: Embed Service #182

Are you sure you want to change the base?

feature: Embed Service #182

Uh oh!

Conversation

JNygaard-Skylight commented Dec 10, 2025

Description

Related Issues

Uh oh!

codecov-commenter commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

BradySkylight Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

m-goggins Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

m-goggins Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

m-goggins Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

BradySkylight Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

m-goggins Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

BradySkylight Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

codecov-commenter commented Dec 10, 2025 •

edited

Loading