updated design

2025-07-23 13:39:50 +03:00
parent df17a99247
commit f759eac04b
1 changed files with 127 additions and 17 deletions
--- a/.kiro/specs/multi-modal-trading-system/design.md
+++ b/.kiro/specs/multi-modal-trading-system/design.md
@@ -58,9 +58,11 @@ The DataProvider class will:
 - Identify pivot points
 - Normalize data
 - Distribute data to subscribers
 - Calculate any other algoritmic manipulations/calculations on the data
 - Cache up to 3x the model inputs (300 ticks OHLCV, etc) data so we can do a proper backtesting in up to 2x time in the future
 Based on the existing implementation in `core/data_provider.py`, we'll enhance it to:
- Improve pivot point calculation using Williams Market Structure
+- Improve pivot point calculation using reccursive Williams Market Structure 
 - Optimize data caching for better performance
 - Enhance real-time data streaming
 - Implement better error handling and fallback mechanisms
@@ -129,33 +131,141 @@ Training:
 ### 4. Orchestrator
-The Orchestrator is responsible for making final trading decisions based on inputs from both CNN and RL models.
+The Orchestrator serves as the central coordination hub of the multi-modal trading system, responsible for data subscription management, model inference coordination, output storage, and training pipeline orchestration.
 #### Key Classes and Interfaces
 - **Orchestrator**: Main class for the orchestrator.
 - **DataSubscriptionManager**: Manages subscriptions to multiple data streams with different refresh rates.
 - **ModelInferenceCoordinator**: Coordinates inference across all models.
 - **ModelOutputStore**: Stores and manages model outputs for cross-model feeding.
 - **TrainingPipelineManager**: Manages training pipelines for all models.
 - **DecisionMaker**: Interface for making trading decisions.
 - **MoEGateway**: Mixture of Experts gateway for model integration.
 #### Core Responsibilities
 ##### 1. Data Subscription and Management
 The Orchestrator subscribes to the Data Provider and manages multiple data streams with varying refresh rates:
 - **10Hz COB (Cumulative Order Book) Data**: High-frequency order book updates for real-time market depth analysis
 - **OHLCV Data**: Traditional candlestick data at multiple timeframes (1s, 1m, 1h, 1d)
 - **Market Tick Data**: Individual trade executions and price movements
 - **Technical Indicators**: Calculated indicators that update at different frequencies
 - **Pivot Points**: Market structure analysis data
 **Data Stream Management**:
 - Maintains separate buffers for each data type with appropriate retention policies
 - Ensures thread-safe access to data streams from multiple models
 - Implements intelligent caching to serve "last updated" data efficiently
 - Maintains full base dataframe that stays current for any model requesting data
 - Handles data synchronization across different refresh rates
 ##### 2. Model Inference Coordination
 The Orchestrator coordinates inference across all models in the system:
 **Inference Pipeline**:
 - Triggers model inference when relevant data updates occur
 - Manages inference scheduling based on data availability and model requirements
 - Coordinates parallel inference execution for independent models
 - Handles model dependencies (e.g., RL model waiting for CNN hidden states)
 **Model Input Management**:
 - Assembles appropriate input data for each model based on their requirements
 - Ensures models receive the most current data available at inference time
 - Manages feature engineering and data preprocessing for each model
 - Handles different input formats and requirements across models
 ##### 3. Model Output Storage and Cross-Feeding
 The Orchestrator maintains a centralized store for all model outputs and manages cross-model data feeding:
 **Output Storage**:
 - Stores CNN predictions, confidence scores, and hidden layer states
 - Stores RL action recommendations and value estimates
 - Maintains historical output sequences for temporal analysis
 - Implements efficient retrieval mechanisms for real-time access
 **Cross-Model Feeding**:
 - Feeds CNN hidden layer states into RL model inputs
 - Provides CNN predictions as context for RL decision-making
 - Stores model outputs that become inputs for subsequent inference cycles
 - Manages circular dependencies and feedback loops between models
 ##### 4. Training Pipeline Management
 The Orchestrator coordinates training for all models by managing the prediction-result feedback loop:
 **Training Coordination**:
 - Calls each model's training pipeline when new inference results are available
 - Provides previous predictions alongside new results for supervised learning
 - Manages training data collection and labeling
 - Coordinates online learning updates based on real-time performance
 **Training Data Management**:
 - Maintains training datasets with prediction-result pairs
 - Implements data quality checks and filtering
 - Manages training data retention and archival policies
 - Provides training data statistics and monitoring
 **Performance Tracking**:
 - Tracks prediction accuracy for each model over time
 - Monitors model performance degradation and triggers retraining
 - Maintains performance metrics for model comparison and selection
 ##### 5. Decision Making and Trading Actions
 Beyond coordination, the Orchestrator makes final trading decisions:
 **Decision Integration**:
 - Combines outputs from CNN and RL models using Mixture of Experts approach
 - Applies confidence-based filtering to avoid uncertain trades
 - Implements configurable thresholds for buy/sell decisions
 - Considers market conditions and risk parameters
 #### Implementation Details
-The Orchestrator will:
+**Architecture**:
- Accept inputs from both CNN and RL models
+```python
- Output final trading actions (buy/sell)
+class Orchestrator:
- Consider confidence levels of both models
+    def __init__(self):
- Learn to avoid entering positions when uncertain
+        self.data_subscription_manager = DataSubscriptionManager()
- Allow for configurable thresholds for entering and exiting positions
+        self.model_inference_coordinator = ModelInferenceCoordinator()
        self.model_output_store = ModelOutputStore()
        self.training_pipeline_manager = TrainingPipelineManager()
        self.decision_maker = DecisionMaker()
        self.moe_gateway = MoEGateway()
    async def run(self):
        # Subscribe to data streams
        await self.data_subscription_manager.subscribe_to_data_provider()
        # Start inference coordination loop
        await self.model_inference_coordinator.start()
        # Start training pipeline management
        await self.training_pipeline_manager.start()
 ```
-Architecture:
+**Data Flow Management**:
- Mixture of Experts (MoE) approach
+- Implements event-driven architecture for data updates
- Gating network: Determine which expert to trust
+- Uses async/await patterns for non-blocking operations
- Expert models: CNN, RL, and potentially others
+- Maintains data freshness timestamps for each stream
- Decision network: Combine expert outputs
+- Implements backpressure handling for high-frequency data
-Training:
+**Model Coordination**:
- Train on historical data
+- Manages model lifecycle (loading, inference, training, updating)
- Update model based on trading outcomes
+- Implements model versioning and rollback capabilities
- Use reinforcement learning to optimize decision-making
+- Handles model failures and fallback mechanisms
 - Provides model performance monitoring and alerting
 **Training Integration**:
 - Implements incremental learning strategies
 - Manages training batch composition and scheduling
 - Provides training progress monitoring and control
 - Handles training failures and recovery
 ### 5. Trading Executor