Transforming Manual SOP Search into Instant Access with NLP and Vector Search
Component | Technology / Tool |
---|---|
Language | Python |
Frameworks | PyTorch, Langchain |
Models | BERT (fine-tuned for Japanese language understanding) |
Embedding Model | Multilingual embedding model (evaluated for Japanese compatibility) |
Vector Database | Weaviate (used for semantic search and structured metadata filtering) |
Cloud Platform | Google Cloud Platform (GCP) |
Deployment | Self-hosted on GCP with 2 GPUs; Weaviate hosted in a separate Docker container |
Model Strategy | Fine-tuned transformer models for Japanese text, integrated with Langchain for RAG |
Prompt Orchestration | Langchain-based wrapper for dynamic prompt engineering and response generation |