pre-refactor: Add orchestrator.types module with common types #326

declark1 · 2025-03-04T01:06:17Z

No. 2 of a few small pre-refactor PRs. This PR adds a new orchestrator.types module, including new abstractions used in the refactored code.

types.chunk

Chunk
- An internal chunk, used for both unary and streaming
Chunks
- A newtype representing a vec of Chunks

types.detection

Detection
- An internal detection, used for all detector types
Detections
- A newtype representing a vec of Detections

types.detection_batch_stream

DetectionBatchStream
- A stream adapter that wraps multiple detection streams and produces a stream of batches using one of the pluggable DetectionBatcher implementations below.
- More details to be added to this PR explaining the rationale and how this works.

types.detection_batcher

DetectionBatcher
- A trait to implement pluggable batching logic for DetectionBatchStream
MaxProcessedIndexBatcher
- A DetectionBatcher implementation based on the existing "max processed index" aggregator
ChatCompletionBatcher
- A DetectionBatcher implementation for chat completions (placeholder, not yet implemented)
NoopBatcher
- A DetectionBatcher implementation that doesn't actually batch (no-op, for testing purposes)

types.chat_message

ChatMessage
- A internal representation of an openai chat message which can be a request message or a completion choice. This is essentially the same idea as the existing ChatMessageInternal, except it only is for text for now (keeping it simple as we don't currently support images or audio) and it holds references instead of owned values to avoid copying the original text, hence the lifetimes.
ChatMessageIterator
- An iterator over ChatMessages

.. + several type aliases

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

…tcher task Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

…tiple_detectors Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

evaline-ju

Mostly looks good - a couple questions

evaline-ju · 2025-03-04T22:41:17Z

src/orchestrator/types/chat_message.rs

+    pub role: Option<&'a openai::Role>,
+    /// The text contents of the message.
+    pub text: Option<&'a str>,
+}


Did we want to track a potential assistant refusal? (Otherwise, what happens on a refusal?)

We aren't doing anything with refusal, so I didn't see a reason to include it here, although I can add it if needed.

yeah, refusal isn't getting used currently, so its fine for it to be not in this object. and whenever that gets more popularly used, we might need to take different "actions" on that (speculating).

The intended use of this internal object is for only internal processing, like calling out to detector etc, and this one doesn't affect the output a user will see. So refusal not being in here doesn't mean it cannot be in final output to user.. Is that accurate @declark1 ?

Correct @gkumbhat. Since this ChatMessage type is just references, I'll go ahead and add the field even though we don't need it now.

I see, yes, my main concern had been whether this would still be reflected in the final result if a refusal were to occur on completions.

evaline-ju · 2025-03-05T21:20:45Z

docs/architecture/adrs/010-detection-batcher.md

+
+The primary issue with these components is that they were designed specifically for the *Streaming Classification With Generation* task and lack flexibility to be extended to additional streaming use cases that require batching detections, e.g.
+- A use case may require different batching logic
+- A use case may need to use different containers to implement it's batching logic


I'm not sure I understand the use case described here - when would separate or different containers be used for batching?

e.g. the chat completions batching logic will likely not use BTreeMap like we are for the MaxProcessedIndexBatcher. The idea here is that the implementation details of the batcher, such as how it stores state and the types it uses internally, should be flexible.

I think the use of containers was a bit confusing here then, more like struct / data structure usage?

src/orchestrator/types/detection.rs

evaline-ju · 2025-03-05T21:53:54Z

src/orchestrator/types/detection_batcher.rs

+    /// Pushes new detections.
+    fn push(
+        &mut self,
+        input_id: InputId,


I'm forgetting a bit what 'input' is referred to for the id here - could we document this?

input_id: u32 was added to support new requirements for chat completion streaming and purposely has a generic name. It's usage here isn't clear as the code where it's relevant isn't added yet in this PR.

For each chat completion chunk received, we have a message_index indicating the message position in the stream (which is passed through to the chunker). The completion can contain multiple choices, so we also have a secondary choice_index. Each choice's content is handled independently with it's own chunk->detection pipeline, so we need to track both throughout the pipeline. Previously, we only needed to track message_index (although, I believe the TGIS API is able to produce multiple generations too).

So, the question becomes how do we name this additional index that needs to be tracked but not sent through to the chunker, therefore isn't one of the indices in Chunk? I didn't want to name it choice_index as that is specific to chat completions and this is common code. I figured "input" would work, but the term "index" is a bit overloaded throughout the code, so I went with input_id. For generation streaming, it will always be 0. For chat completion streaming, it will correspond to the choice_index, which will also often be 0 except when the user explicitly requests n > 1.

This is primarily needed for the batching logic in ChatCompletionBatcher and I have a placeholder comment there noting that it will map to choice_index.

I think a comment just to indicate that it refers to choice_index / message_index would be helpful

src/orchestrator/types/detection_batcher/max_processed_index.rs

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

gkumbhat · 2025-03-05T23:12:20Z

src/orchestrator/types/chat_message.rs

+    pub role: Option<&'a openai::Role>,
+    /// The text contents of the message.
+    pub text: Option<&'a str>,
+}


yeah, refusal isn't getting used currently, so its fine for it to be not in this object. and whenever that gets more popularly used, we might need to take different "actions" on that (speculating).

The intended use of this internal object is for only internal processing, like calling out to detector etc, and this one doesn't affect the output a user will see. So refusal not being in here doesn't mean it cannot be in final output to user.. Is that accurate @declark1 ?

src/orchestrator/types/detection.rs

…tectionEvidence for consistency Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

Add orchestrator.types module

4e20565

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

declark1 requested review from gkumbhat and evaline-ju as code owners March 4, 2025 01:06

declark1 marked this pull request as draft March 4, 2025 01:31

Update DetectionBatchStream to wrap batch_rx directly, update comments

c24215e

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

declark1 force-pushed the types branch from cbdfe89 to c24215e Compare March 4, 2025 01:58

Drop detection consumer task and consume detection stream_set from ba…

19cd41e

…tcher task Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

declark1 force-pushed the types branch from 0f1fcf8 to 19cd41e Compare March 4, 2025 02:04

declark1 added 2 commits March 4, 2025 09:31

Update MaxProcessedIndexBatcher doc string, add test_single_chunk_mul…

76bac51

…tiple_detectors Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

Add test_out_of_order_chunks

e30c3c7

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

declark1 force-pushed the types branch from a33d42e to e30c3c7 Compare March 4, 2025 18:06

declark1 marked this pull request as ready for review March 4, 2025 19:21

Add test_detection_batch_stream

3fd42d8

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

declark1 force-pushed the types branch from d886984 to 3fd42d8 Compare March 4, 2025 19:24

declark1 added 2 commits March 4, 2025 12:17

Add copyright to new files

3e6c60b

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

Update DetectionBatchStream doc string

e5ac6db

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

declark1 force-pushed the types branch 4 times, most recently from ba01ed7 to 24a125c Compare March 5, 2025 19:08

Add ADR 010: DetectionBatcher & DetectionBatchStream

30d95fb

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

declark1 force-pushed the types branch from 24a125c to 30d95fb Compare March 5, 2025 19:11

Fix comment

a524a0e

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

evaline-ju reviewed Mar 5, 2025

View reviewed changes

Rename n to n_detectors in MaxProcessedIndexBatcher

dfc0f13

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

gkumbhat reviewed Mar 5, 2025

View reviewed changes

declark1 added 2 commits March 5, 2025 15:39

Add docstrings to detection types and drop Option from evidence in De…

140b72c

…tectionEvidence for consistency Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

Add refusal to ChatMessage

f2e0b90

Signed-off-by: declark1 <44146800+declark1@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pre-refactor: Add orchestrator.types module with common types #326

pre-refactor: Add orchestrator.types module with common types #326

declark1 commented Mar 4, 2025 •

edited

Loading

evaline-ju left a comment

evaline-ju Mar 4, 2025

declark1 Mar 5, 2025

gkumbhat Mar 5, 2025

declark1 Mar 6, 2025 •

edited

Loading

evaline-ju Mar 6, 2025

evaline-ju Mar 5, 2025

declark1 Mar 5, 2025 •

edited

Loading

evaline-ju Mar 6, 2025

evaline-ju Mar 5, 2025

declark1 Mar 5, 2025 •

edited

Loading

gkumbhat Mar 6, 2025

gkumbhat Mar 5, 2025

pre-refactor: Add orchestrator.types module with common types #326

Are you sure you want to change the base?

pre-refactor: Add orchestrator.types module with common types #326

Conversation

declark1 commented Mar 4, 2025 • edited Loading

types.chunk

types.detection

types.detection_batch_stream

types.detection_batcher

types.chat_message

evaline-ju left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

declark1 Mar 6, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

declark1 Mar 5, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

declark1 Mar 5, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

declark1 commented Mar 4, 2025 •

edited

Loading

declark1 Mar 6, 2025 •

edited

Loading

declark1 Mar 5, 2025 •

edited

Loading

declark1 Mar 5, 2025 •

edited

Loading