babylon.rag.pre_embeddings.manager
Pre-embeddings management for the RAG system.
This module provides the PreEmbeddingsManager which integrates preprocessing, chunking, and caching components to prepare content for embedding.
Classes
|
Configuration for the pre-embeddings system. |
|
Manages the pre-embeddings pipeline for the RAG system. |
- class babylon.rag.pre_embeddings.manager.PreEmbeddingsConfig(**data)[source]
Bases:
BaseModelConfiguration for the pre-embeddings system.
- Parameters:
preprocessing_config (PreprocessingConfig | None)
chunking_config (ChunkingConfig | None)
cache_config (CacheConfig | None)
- preprocessing_config
Configuration for content preprocessing
- chunking_config
Configuration for content chunking
- cache_config
Configuration for embedding cache management
- model_config: ClassVar[ConfigDict] = {'frozen': True}
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
-
preprocessing_config:
PreprocessingConfig|None
-
chunking_config:
ChunkingConfig|None
-
cache_config:
CacheConfig|None
- class babylon.rag.pre_embeddings.manager.PreEmbeddingsManager(config=None, preprocessor=None, chunker=None, cache_manager=None, lifecycle_manager=None)[source]
Bases:
objectManages the pre-embeddings pipeline for the RAG system.
This class integrates preprocessing, chunking, and caching components to prepare content for embedding generation.
- Parameters:
config (PreEmbeddingsConfig | None)
preprocessor (ContentPreprocessor | None)
chunker (ChunkingStrategy | None)
cache_manager (EmbeddingCacheManager | None)
lifecycle_manager (Any | None)
- __init__(config=None, preprocessor=None, chunker=None, cache_manager=None, lifecycle_manager=None)[source]
Initialize with configuration and optional component instances.
- Parameters:
config (
PreEmbeddingsConfig|None) – Configuration for the pre-embeddings systempreprocessor (
ContentPreprocessor|None) – Custom preprocessor instancechunker (
ChunkingStrategy|None) – Custom chunker instancecache_manager (
EmbeddingCacheManager|None) – Custom cache manager instancelifecycle_manager (
Any|None) – Lifecycle manager for object state tracking
- process_content(content)[source]
Process a single content item through the pre-embeddings pipeline.
- prepare_for_embedding(obj)[source]
Prepare an object for embedding by processing its content.
This method is designed to work with objects that follow the Embeddable protocol from the embedding system.