Ref

 llama.cpp
Ref
https://github.com/ggerganov/llama.cpp
https://note.com/npaka/n/n0ad63134fbe2



 動作確認
model をダウンロードして models ディレクトリの下に置く

https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main
llama-2-7b-chat.Q5_K_S.gguf

llama.cpp をビルドして起動する
make llama-server
./llama-server -m models/llama-2-7b-chat.Q5_K_S.gguf -c 2048
openai の 例を適用してみた（curl） ✅
https://platform.openai.com/docs/quickstartcurl "http://localhost:8080/chat/completions" \
    -H "Content-Type: application/json" \
    -H "Authorization: Bearer $OPENAI_API_KEY" \
    -d '{
        "model": "gpt-4o-mini",
        "messages": [
            {
                "role": "system",
                "content": "You are a helpful assistant."
            },
            {
                "role": "user",
                "content": "Write a haiku that explains the concept of recursion."
            }
        ]
    }'
UI でやってみた ✅

輝

 gpt-researcher

 Ref
https://github.com/assafelovic/gpt-researcher
https://github.com/ggerganov/llama.cpp
https://app.tavily.com/my-plan


 memo
setup llama.cpp as the LLM server
maual
https://docs.gptr.dev/docs/gpt-researcher/llms/llms#custom-openai

settings# set the custom OpenAI API url
OPENAI_BASE_URL="http://localhost:8080"
# set the custom OpenAI API key
OPENAI_API_KEY="sk-no-key-required"
# set the TAVILY_API_KEY
TAVILY_API_KEY="my-key"

# specify the custom OpenAI API llm model  
FAST_LLM="openai:gpt-4o-mini"
# specify the custom OpenAI API llm model  
SMART_LLM="openai:gpt-4o"

DOC_PATH=./my-docs
OPENAI_API_KEY: real key not needed
TAVILY_API_KEY: get this key from https://app.tavily.com/ for free use
other: default is OK



 issues
reported issues by me
https://github.com/assafelovic/gpt-researcher/issues/954
https://github.com/assafelovic/gpt-researcher/issues/957

まだ安定していないみたいで、下記のエラーが出た
  File "/Users/xxx/study/AI/tools/gpt-researcher/gpt_researcher/actions/agent_creator.py", line 59, in handle_json_error
    return json_data["server"], json_data["agent_role_prompt"]
           ~~~~~~~~~^^^^^^^^^^
KeyError: 'server'
  File "/Users/xxx/study/AI/tools/gpt-researcher/gpt_researcher/skills/researcher.py", line 149, in __get_context_by_search
    sub_queries.append(query)
    ^^^^^^^^^^^^^^^^^^
AttributeError: 'str' object has no attribute 'append'
INFO:     connection closed

輝

cursor

Ref
- https://www.cursor.com/
task
- create a chrome plugin

輝

HuggingChat

Ref
- https://huggingface.co/chat/
- https://www.sbbit.jp/article/cont1/113786

輝

LM Studio

Ref
- https://lmstudio.ai/
所感
- 複数のLLM Modelを統合してダウンロード・選択して動作できるので便利
- しかし、OSS LLM Model のを使ってみたところ間違った答えにあったことがある（ChatGPTが正しく答えた）
  - Q：please create a mermaid class diagram
  - A：生成したmermaid図が間違ったフォーマットになっている
  - モデル：Llama-3.2-1B-Instruct-Q8_0-GGUF/llama-3.2-1b-instruct-q8_0.gguf

輝

huggingface

Ref
- https://huggingface.co/
- https://www.sbbit.jp/article/cont1/122042

輝

summarize youtube

https://monica.im/zh_CN/features/youtube-summary-with-chatgpt