refine plan

2025-08-16 16:11:02 +00:00 · 2025-07-29 14:17:00 -04:00 · 2025-07-29 14:17:00 -04:00 · 174b3d6537
commit 174b3d6537
parent 735f264654
1 changed files with 173 additions and 1 deletions
--- a/v1_implementation_plan_v2.md
+++ b/v1_implementation_plan_v2.md
@ -366,7 +366,7 @@ text_block_with_extras: TextContentBlock = {
 **Extra Item Types:** Standard content blocks support extra keys via PEP 728 with `extra_items=Any`, meaning provider-specific fields can be of any type. This provides:
 - **Core fields** (like `type`, `text`, `id`) are **strongly typed** according to the TypedDict definition
- **Extra fields** (provider-specific extensions) are typed as `Any`, allowing complete flexibility  
+- **Extra fields** (provider-specific extensions) are typed as `Any`, allowing complete flexibility
 - **Type safety** is maintained for the standard fields while allowing arbitrary extensions
 This is the most flexible approach - providers can add any kind of metadata, configuration, or custom data they need without breaking the type system or requiring changes to the core LangChain types.
@ -633,3 +633,175 @@ When implementing v1 support for your specific provider:
 - Ensure streaming chunks are properly typed as `AIMessageChunkV1`
 - Implement proper chunk merging using the `+` operator
 - Handle provider-specific streaming features (like reasoning) appropriately
 ```
 ## Common Implementation Mistakes & Solutions
 ### Message Structure Assumptions
 **❌ Mistake**: Assuming `ChatMessage` exists in v1 messages
 ```python
 from langchain_core.messages.v1 import ChatMessage as ChatMessageV1  # DOES NOT EXIST
 ```
 **✅ Solution**: V1 only has `AIMessage`, `HumanMessage`, `SystemMessage`, `ToolMessage`, `AIMessageChunk`
 ```python
 from langchain_core.messages.v1 import (
    AIMessage as AIMessageV1,
    HumanMessage as HumanMessageV1,
    SystemMessage as SystemMessageV1,
    ToolMessage as ToolMessageV1,
    AIMessageChunk as AIMessageChunkV1,
    MessageV1,  # Union type for all message types
 )
 ```
 ### Reasoning Content Handling
 **❌ Mistake**: Storing reasoning content in `additional_kwargs` or `response_metadata`
 V1 messages don't have `additional_kwargs`
 **✅ Solution**: Use `ReasoningContentBlock` in the content list
 ```python
 from langchain_core.messages.content_blocks import ReasoningContentBlock
 if reasoning_content:
    content.append(ReasoningContentBlock(
        type="reasoning",
        reasoning=reasoning_content,
    ))
 ```
 Any other reasoning-related metadata should be stored as extra items in the content block. For instance, thought signatures or effort.
 ### ResponseMetadata Type Safety
 **❌ Mistake**: Directly assigning provider-specific fields to ResponseMetadata
 ```python
 response_metadata["created_at"] = response["created_at"]  # Type error
 ```
 **✅ Solution**: Use type ignores for extra fields (allowed by PEP 728)
 ```python
 response_metadata["created_at"] = response["created_at"]  # type: ignore[typeddict-unknown-key]
 ```
 ### Streaming Implementation
 **❌ Mistake**: Returning wrong message type from streaming methods
 ```python
 def _generate_stream(self) -> Iterator[AIMessageChunkV1]:
    # ...
    ai_message = _convert_to_v1_from_ollama_format(response)  # Returns AIMessageV1
    yield ai_message  # Type error: expected AIMessageChunkV1
 ```
 **✅ Solution**: Convert AIMessageV1 to AIMessageChunkV1 for streaming
 ```python
 ai_message = _convert_to_v1_from_ollama_format(response)
 chunk = AIMessageChunkV1(
    content=ai_message.content,
    response_metadata=ai_message.response_metadata,
    usage_metadata=ai_message.usage_metadata,
 )
 yield chunk
 ```
 ### Generator Type Issues
 **❌ Mistake**: Using `.get()` on ContentBlock without type conversion
 **Why is this a problem?** While ContentBlock is a Union of TypedDict classes (like `TextContentBlock`, `ToolCall`, etc.), mypy treats accessing fields on the union as returning `Any` type because it can't guarantee which specific TypedDict variant you have. When you pass this to `"".join()`, mypy expects `Iterable[str]` but gets `Iterable[Any]`, causing a type error.
 ```python
 text_content = "".join(
    block.get("text", "")  # Returns Any, treated as object in strict mode
    for block in chunk.content
    if block.get("type") == "text"
 )
 ```
 **✅ Solution**: Explicitly convert to string
 ```python
 text_content = "".join(
    str(block.get("text", ""))
    for block in chunk.content
    if block.get("type") == "text"
 )
 ```
 ### Test Type Annotations
 **❌ Mistake**: Missing return type annotations in tests
 ```python
 def test_something(self):  # mypy error: no return type
    pass
 ```
 **✅ Solution**: Add `-> None` to all test methods
 ```python
 def test_something(self) -> None:
    pass
 ```
 ### Performance Optimizations
 **❌ Mistake**: Using append() in loops for list construction
 ```python
 for tool_call in tool_calls:
    content.append(ToolCall(...))  # Flagged by PERF401
 ```
 **✅ Solution**: Use list comprehension with extend()
 ```python
 content.extend([
    ToolCall(
        type="tool_call",
        id=tool_call.get("id", str(uuid4())),
        name=tool_call["function"]["name"],
        args=tool_call["function"]["arguments"],
    )
    for tool_call in tool_calls
 ])
 ```
 ## Implementation Debugging Tips
 ### Type Checking Issues
 1. Run `mypy` early and often - type issues compound quickly
 2. Use `# type: ignore[specific-error]` comments for intentional violations
 3. Check `ResponseMetadata` usage - it allows extra keys but mypy doesn't know this
 ### Import Issues
 1. V1 message imports are different - no `ChatMessage`, limited to core message types
 2. Always use grouped imports from the same module to avoid linter issues
 3. Remove unused imports that can be auto-detected
 ### Testing Strategy
 1. Test message conversion utilities separately from the main chat model
 2. Use concrete test data rather than mocking for content block testing
 3. Include both streaming and non-streaming test cases
 4. Test error conditions (empty content, missing fields, etc.)
 ### Common Error Patterns
 - **"AIMessage has no attribute 'additional_kwargs'"** → Use ReasoningContentBlock instead
 - **"ChatMessage not found"** → ChatMessage doesn't exist in V1, remove references
 - **Type errors with ResponseMetadata** → Add `# type: ignore[typeddict-unknown-key]` for extra fields
 - **Generator type mismatches** → Use `str()` conversion for text extraction from content blocks