{
    "title": "Guide to Building OpenSearch Embeddings",
    "turns": [{
        "turn": 1,
        "user": "Tell me how to set up OpenSearch Embedding locally.",
        "assistant": "Hello! I'll walk you through, step by step, how to build text embeddings and implement semantic search using OpenSearch in your local environment.\n\nHere's the overall process:\n\n1.  **Preparation**: Install the necessary software.\n2.  **Run OpenSearch Cluster**: Use Docker to run OpenSearch and OpenSearch Dashboards locally.\n3.  **Prepare Embedding Model**: Get a machine learning model to convert text into vectors.\n4.  **Create OpenSearch Index**: Set up a dedicated index for vector search (k-NN).\n5.  **Embed and Index Data**: Convert sample data into vectors and store them in OpenSearch.\n6.  **Run Semantic Search**: Convert your search query into a vector and find similar documents.\n\n* * *\n\n### 1. Preparation\n\nMake sure you have the following software installed in your local development environment:\n\n*   **Docker & Docker Compose**: Needed to easily run the OpenSearch cluster. \n    [Docker Desktop](https://www.docker.com/products/docker-desktop/)\n    includes both.\n*   **Python**: Used to write scripts for data embedding and communicating with OpenSearch. (Python 3.8 or higher recommended)\n*   **Required Python libraries**:\n    ```\n    pip install opensearch-py sentence-transformers\n    ```\n    *   `opensearch-py`: The official Python client for communicating with OpenSearch\n    *   `sentence-transformers`: A library that makes it easy to use text embedding models\n\n* * *\n\n### 2. Running the OpenSearch Cluster (Using Docker)\n\nThe easiest way to run OpenSearch locally is with Docker Compose.\n\n1.  Create a project folder and add a `docker-compose.yml` file inside it.\n2.  Copy and paste the following into your `docker-compose.yml` file:\n    ```\n    version: '3'\n    services:\n      opensearch-node1:\n        image: opensearchproject/opensearch:2.15.0\n        container_name: opensearch-node1\n        environment:\n          - cluster.name=opensearch-cluster\n          - discovery.type=single-node\n          - bootstrap.memory_lock=true # a long with the ulimits setting below\n          - \"OPENSEARCH_JAVA_OPTS=-Xms512m -Xmx512m\" # minimum and maximum Java heap size, recommend setting both to 50% of system RAM\n          - \"plugins.security.disabled=true\" # Disables security plugin for easy local development\n        ulimits:\n          memlock:\n            soft: -1\n            hard: -1\n          nofile:\n            soft: 65536 # Maximum number of open files\n            hard: 65536\n        volumes:\n          - opensearch-data1:/usr/share/opensearch/data\n        ports:\n          - 9200:9200\n          - 9600:9600 # required for Performance Analyzer\n        networks:\n          - opensearch-net\n      opensearch-dashboards:\n        image: opensearchproject/opensearch-dashboards:2.15.0\n        container_name: opensearch-dashboards\n        ports:\n          - 5601:5601\n        expose:\n          - \"5601\"\n        environment:\n          OPENSEARCH_HOSTS: '[\"http://opensearch-node1:9200\"]'\n          \"plugins.security.disabled=true\" # Disables security plugin for easy local development\n        networks:\n          - opensearch-net\n    volumes:\n      opensearch-data1:\n    networks:\n      opensearch-net:\n    ```\n    *   **`plugins.security.disabled=true`**: This setting disables OpenSearch's security features for easier local development. **Never use this in a production environment.**\n3.  In your terminal, navigate to the folder containing your `docker-compose.yml` file and run the following command to start the OpenSearch cluster:\n    ```\n    docker-compose up -d\n    ```\n4.  **Check if it's running**:\n    *   Open `http://localhost:9200` in your web browser to check OpenSearch info.\n    *   Open `http://localhost:5601` in your web browser to see if OpenSearch Dashboards is running.\n\n* * *\n\n### 3. Preparing the Embedding Model\n\nTo convert text into vectors, we'll use the `sentence-transformers` library from Hugging Face. It's important to choose a model that handles Korean sentences well. Here, we'll use the multilingual model `paraphrase-multilingual-mpnet-base-v2`.\n\nLoad the model in your Python script or Jupyter Notebook:\n\n```\nfrom sentence_transformers import SentenceTransformer"
    }]
}