书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

随笔8个月前发布枕樎

76 0 0

基础任务

按照教程，将 MindSearch 部署到 HuggingFace 并美化 Gradio 的界面，并提供截图和 Hugging Face 的Space的链接。

MindSearch 部署到Github Codespace 和 Hugging Face Space

1. 创建开发机 & 环境配置

打开codespace主页，选择blank template。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

浏览器会自动在新的页面打开一个web版的vscode。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

接下来的操作就和我们使用vscode基本没差别了。

然后我们新建一个目录用于存放 MindSearch 的相关代码，并把 MindSearch 仓库 clone 下来。在终端中运行下面的命令：

mkdir -p /workspaces/mindsearch cd /workspaces/mindsearch git clone https://github.com/InternLM/MindSearch.git cd MindSearch && git checkout b832275 && cd ..

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

接下来，我们创建一个 conda 环境来安装相关依赖。

# 创建环境 conda create -n hjl python=3.10 -y # 激活环境 conda activate hjl # 安装依赖 pip install -r /workspaces/mindsearch/MindSearch/requirements.txt

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

2. 获取硅基流动 API Key

因为要使用硅基流动的 API Key，所以接下来便是注册并获取 API Key 了。

首先，我们打开硅基流动统一登录来注册硅基流动的账号（如果注册过，则直接登录即可）。

在完成注册后，打开硅基流动统一登录来准备 API Key。首先创建新 API 密钥，然后点击密钥进行复制，以备后续使用。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

3. 启动 MindSearch

3.1 启动后端

由于硅基流动 API 的相关配置已经集成在了 MindSearch 中，所以我们可以直接执行下面的代码来启动 MindSearch 的后端。

export SILICON_API_KEY=第二步中复制的密钥 conda activate hjl cd /workspaces/mindsearch/MindSearch python -m mindsearch.app --lang cn --model_format internlm_silicon --search_engine DuckDuckGoSearch

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

3.2 启动前端

在后端启动完成后，我们打开新终端运行如下命令来启动 MindSearch 的前端。

conda activate hjl cd /workspaces/mindsearch/MindSearch python frontend/mindsearch_gradio.py

前后端都启动后，我们应该可以看到github自动为这两个进程做端口转发。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

部署完成！

4. 部署到 HuggingFace Space

最后，我们来将 MindSearch 部署到 HuggingFace Space。

我们首先打开 https://huggingface.co/spaces ，并点击 Create new Space，如下图所示。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

在输入 Space name 并选择 License 后，选择配置如下所示。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

然后，我们进入 Settings，配置硅基流动的 API Key。如下图所示。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

选择 New secrets，name 一栏输入 SILICON_API_KEY，value 一栏输入你的 API Key 的内容。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

最后，我们先新建一个目录，准备提交到 HuggingFace Space 的全部文件。

# 创建新目录 mkdir -p /workspaces/mindsearch/mindsearch_deploy # 准备复制文件 cd /workspaces/mindsearch cp -r /workspaces/mindsearch/MindSearch/mindsearch /workspaces/mindsearch/mindsearch_deploy cp /workspaces/mindsearch/MindSearch/requirements.txt /workspaces/mindsearch/mindsearch_deploy # 创建 app.py 作为程序入口 touch /workspaces/mindsearch/mindsearch_deploy/app.py

其中，app.py 的内容如下：


import json
import os
 
import gradio as gr
import requests
from lagent.schema import AgentStatusCode
 
os.system("python -m mindsearch.app --lang cn --model_format internlm_silicon &")
 
PLANNER_HISTORY = []
SEARCHER_HISTORY = []
 
 
def rst_mem(history_planner: list, history_searcher: list):
    '''
    Reset the chatbot memory.
    '''
    history_planner = []
    history_searcher = []
    if PLANNER_HISTORY:
        PLANNER_HISTORY.clear()
    return history_planner, history_searcher
 
 
def format_response(gr_history, agent_return):
    if agent_return['state'] in [
            AgentStatusCode.STREAM_ING, AgentStatusCode.ANSWER_ING
    ]:
        gr_history[-1][1] = agent_return['response']
    elif agent_return['state'] == AgentStatusCode.PLUGIN_START:
        thought = gr_history[-1][1].split('```')[0]
        if agent_return['response'].startswith('```'):
            gr_history[-1][1] = thought + '
' + agent_return['response']
    elif agent_return['state'] == AgentStatusCode.PLUGIN_END:
        thought = gr_history[-1][1].split('```')[0]
        if isinstance(agent_return['response'], dict):
            gr_history[-1][
                1] = thought + '
' + f'```json
{json.dumps(agent_return["response"], ensure_ascii=False, indent=4)}
```'  # noqa: E501
    elif agent_return['state'] == AgentStatusCode.PLUGIN_RETURN:
        assert agent_return['inner_steps'][-1]['role'] == 'environment'
        item = agent_return['inner_steps'][-1]
        gr_history.append([
            None,
            f"```json
{json.dumps(item['content'], ensure_ascii=False, indent=4)}
```"
        ])
        gr_history.append([None, ''])
    return
 
 
def predict(history_planner, history_searcher):
 
    def streaming(raw_response):
        for chunk in raw_response.iter_lines(chunk_size=8192,
                                             decode_unicode=False,
                                             delimiter=b'
'):
            if chunk:
                decoded = chunk.decode('utf-8')
                if decoded == '
':
                    continue
                if decoded[:6] == 'data: ':
                    decoded = decoded[6:]
                elif decoded.startswith(': ping - '):
                    continue
                response = json.loads(decoded)
                yield (response['response'], response['current_node'])
 
    global PLANNER_HISTORY
    PLANNER_HISTORY.append(dict(role='user', content=history_planner[-1][0]))
    new_search_turn = True
 
    url = 'http://localhost:8002/solve'
    headers = {'Content-Type': 'application/json'}
    data = {'inputs': PLANNER_HISTORY}
    raw_response = requests.post(url,
                                 headers=headers,
                                 data=json.dumps(data),
                                 timeout=20,
                                 stream=True)
 
    for resp in streaming(raw_response):
        agent_return, node_name = resp
        if node_name:
            if node_name in ['root', 'response']:
                continue
            agent_return = agent_return['nodes'][node_name]['detail']
            if new_search_turn:
                history_searcher.append([agent_return['content'], ''])
                new_search_turn = False
            format_response(history_searcher, agent_return)
            if agent_return['state'] == AgentStatusCode.END:
                new_search_turn = True
            yield history_planner, history_searcher
        else:
            new_search_turn = True
            format_response(history_planner, agent_return)
            if agent_return['state'] == AgentStatusCode.END:
                PLANNER_HISTORY = agent_return['inner_steps']
            yield history_planner, history_searcher
    return history_planner, history_searcher
 
 
with gr.Blocks() as demo:
    gr.HTML("""<h1 align="center">MindSearch Gradio Demo</h1>""")
    gr.HTML("""<p style="text-align: center; font-family: Arial, sans-serif;">MindSearch is an open-source AI Search Engine Framework with Perplexity.ai Pro performance. You can deploy your own Perplexity.ai-style search engine using either closed-source LLMs (GPT, Claude) or open-source LLMs (InternLM2.5-7b-chat).</p>""")
    gr.HTML("""
    <div style="text-align: center; font-size: 16px;">
        <a href="https://github.com/InternLM/MindSearch" style="margin-right: 15px; text-decoration: none; color: #4A90E2;">🔗 GitHub</a>
        <a href="https://arxiv.org/abs/2407.20183" style="margin-right: 15px; text-decoration: none; color: #4A90E2;">📄 Arxiv</a>
        <a href="https://huggingface.co/papers/2407.20183" style="margin-right: 15px; text-decoration: none; color: #4A90E2;">📚 Hugging Face Papers</a>
        <a href="https://huggingface.co/spaces/internlm/MindSearch" style="text-decoration: none; color: #4A90E2;">🤗 Hugging Face Demo</a>
    </div>
    """)
    with gr.Row():
        with gr.Column(scale=10):
            with gr.Row():
                with gr.Column():
                    planner = gr.Chatbot(label='planner',
                                         height=700,
                                         show_label=True,
                                         show_copy_button=True,
                                         bubble_full_width=False,
                                         render_markdown=True)
                with gr.Column():
                    searcher = gr.Chatbot(label='searcher',
                                          height=700,
                                          show_label=True,
                                          show_copy_button=True,
                                          bubble_full_width=False,
                                          render_markdown=True)
            with gr.Row():
                user_input = gr.Textbox(show_label=False,
                                        placeholder='帮我搜索一下 InternLM 开源体系',
                                        lines=5,
                                        container=False)
            with gr.Row():
                with gr.Column(scale=2):
                    submitBtn = gr.Button('Submit')
                with gr.Column(scale=1, min_width=20):
                    emptyBtn = gr.Button('Clear History')
 
    def user(query, history):
        return '', history + [[query, '']]
 
    submitBtn.click(user, [user_input, planner], [user_input, planner],
                    queue=False).then(predict, [planner, searcher],
                                      [planner, searcher])
    emptyBtn.click(rst_mem, [planner, searcher], [planner, searcher],
                   queue=False)
 
demo.queue()
demo.launch(server_name='0.0.0.0',
            server_port=7860,
            inbrowser=True,
            share=True)

在最后，将 /root/mindsearch/mindsearch_deploy 目录下的文件（使用 git）提交到 HuggingFace Space 即可完成部署了。将代码提交到huggingface space的流程如下：

首先创建一个有写权限的token。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

然后从huggingface把空的代码仓库clone到codespace。


cd /workspaces/codespaces-blank
git clone https://huggingface.co/spaces/<你的名字>/<仓库名称>
# 把token挂到仓库上，让自己有写权限
git remote set-url space https://<你的名字>:<上面创建的token>@huggingface.co/spaces/<你的名字>/<仓库名称>

如果报error: No such remote ‘space’，就git remote add space <远程仓库地址>

现在codespace就是本地仓库，huggingface space是远程仓库，接下来使用方法就和常规的git一样了。

cd <仓库名称> # 把刚才准备的文件都copy进来 cp -r /workspaces/mindsearch/mindsearch_deploy/* .

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

注意mindsearch文件夹其实是Mindsearch项目中的一个子文件夹，如果把这个Mindsearch的整个目录copy进来会有很多问题（git submodule无法提交代码，space中项目启动失败等）。

最后把代码提交到huggingface space会自动启动项目。

git add . git commit -m "update" git push

总结下git push 遇到的错误：

1.一直说没有给token令牌

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

报错原因：运行 git remote -v，可以看到有一个origin分支，git push默认使用这个的token，然而这个并没有设置token,如图：

解决办法：把space设置成主main,执行git push –set-upstream space main，然后可以git push了。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

然后就可以测试啦。

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

成功！

5.美化 Gradio 的界面

5.1 打开mindsearch_gradio.py文件

对以下代码进行修改：

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

添加以下代码

gr.Blocks(css=".gradio-container {background: url('file=https://img.zcool.cn/community/011db3571600c432f8758c9b65191d.jpg@2o.jpg')}")

5.2 重新运行mindsearch_gradio.py文件

conda activate hjl cd /workspaces/mindsearch/MindSearch python frontend/mindsearch_gradio.py

运行效果：

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

随笔

特别提醒: 内容为用户自行发布,如有侵权,请联系我们管理员删除,邮箱:mail@xieniao.com ,在收到您的邮件后我们会在3个工作日内处理。

markdown教程(持续更新中)

随笔

8个月前

0720

发布的商品显示商品过期不存在是怎么回事？ – 淘宝天猫

随笔

1年前

01380

产品推广选错产品了，能退款或更改吗？在线等？ – 淘宝天猫

随笔

12个月前

01040

Donut 项目教程

随笔

7个月前

0620

暂无评论

您必须登录才能参与评论！

立即登录

暂无评论...

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

基础任务

1. 创建开发机 & 环境配置

2. 获取硅基流动 API Key

3. 启动 MindSearch

3.1 启动后端

3.2 启动前端

4. 部署到 HuggingFace Space

5.美化 Gradio 的界面

5.1 打开mindsearch_gradio.py文件

5.2 重新运行mindsearch_gradio.py文件

华为开源自研AI框架昇思MindSpore应用案例：ICT实现图像修复

从零到一，全面掌握Apache DolphinScheduler发版流程，实战派经验分享！

相关文章

markdown教程(持续更新中)

发布的商品显示商品过期不存在是怎么回事？ – 淘宝天猫

产品推广选错产品了，能退款或更改吗？在线等？ – 淘宝天猫

Donut 项目教程

暂无评论

热门网站

Wicked Backgrounds

好看的韩国漫画_韩漫在线免费阅读-汗汗漫画网

爱奇艺

Claude

留学世界

黑料正能量

热门文章

Mybatis学习笔记

Mediapipe Python 示例教程

Vue+Express全栈开发项目实战技能：‌从0到1打造完整电商项目

H5随机短视频滑动版带打赏源码

他趣邀请码获取方法(他趣app邀请码2024最新汇总及填写步骤)有效可用

做礼盒的。没有七天无理由。包邮产品。商品寄出，还没收到，客户申请仅退款。来回运费应该谁承担？ – 淘宝天猫

淘宝有没有权力扣我支付宝里面的钱？ – 淘宝天猫

商品链接被投诉下架提交资质证明后重新上架？ – 淘宝天猫

书生大模型实战营（暑假场）进阶岛关卡六——MindSearch 快速部署

基础任务

1. 创建开发机 & 环境配置

2. 获取硅基流动 API Key

3. 启动 MindSearch

3.1 启动后端

3.2 启动前端

4. 部署到 HuggingFace Space

5.美化 Gradio 的界面

5.1 打开mindsearch_gradio.py文件

5.2 重新运行mindsearch_gradio.py文件

华为开源自研AI框架昇思MindSpore应用案例：ICT实现图像修复

从零到一，全面掌握Apache DolphinScheduler发版流程，实战派经验分享！

相关文章

热门网站

Wicked Backgrounds

好看的韩国漫画_韩漫在线免费阅读-汗汗漫画网

爱奇艺

Claude

留学世界

黑料正能量

热门文章

标签云