🤖

GraphAI Agentにおけるユーザ入力の課題と解決策

Isamu

2024/12/13に公開

!GraphAI記事の一覧はこちら

 GraphAIにおけるユーザ入力の課題と解決策GraphAIにとってユーザからの入力は、とても難しい課題です。

というのも、GraphDataのワークフローはブラウザ、CLI、バッチのいずれの環境でも動作することを前提としているため、特定の入力方法を指定すると動作環境が固定されてしまうからです。

現状では、Agentを通じてユーザから入力を受け付けると、動作環境が固定されてしまうことが課題として認識されています。
入力を受け取る方法は以下の2つがあります。
Agentを通じて入力を受け取る方法
GraphDataのワークフロー実行前に入力を受け付け、Static nodeに値をinjectionする方法

 Agentで入力値を受け取るAgentを使用して入力値を受け取る場合、CLIでは@inquirer/inputを使った対話型の方法が、ブラウザでは特別な方法でformから入力値を受け取る方法が用いられます。

 CLIでの入力CLIでは@graphai/input_agentsを使用します。
実装例は以下のリンクをご参照ください。

サンプル実装（chat.ts）

 ブラウザでの入力ブラウザではAgent単体で入力を受け付けることが難しいため、AgentのGeneratorを用意し、Agentと入力を紐づける必要があります。

具体的には、Agentが入力状態になるとブラウザに通知し、フォームでの入力値がSubmitされるとAgentのPromiseが解決される仕組みです。
@receptron/event_agent_generatorというパッケージを利用することで、ブラウザ上で簡単に入力値を受け取ることが可能です。
vueのサンプルはこちら
これを使用することで、ワークフローの任意のタイミングでブラウザを介してユーザからの入力を受け取ることができます。

 ワークフローの初期値として渡す方法Static nodeはTypeScriptのGraphAIインスタンスでinjectValueを呼び出すことで値を更新できます。

これにより、ユーザの入力値をGraphのNodeに初期値として設定することが可能です。
ただし、この方法では初期値として値を渡すのみで、ワークフローの途中で値を更新することはできません。

 手動ループを活用した入力値の適用GraphAI 0.6.6からmanual loopをサポートしました。

通常、ループはワークフローの1回目の実行結果をStatic nodeの更新として受け取り、その値を更新して次回のワークフローを実行します。
initializeGraphAIとsetPreviousResultsを使用することで、プログラム側でこの仕組みを制御することが可能です。

v以下のコード例では、ワークフローの途中でユーザ入力値を反映させる方法を示しています。

 実装例const result = await graph.run();

graph.initializeGraphAI();  // 必要に応じてグラフを初期化
graph.setPreviousResults(result); // 前回の結果を設定（これによりStatic nodeの更新が適用される）
graph.injectValue("userInput", someValue); // ユーザ入力値を設定

const result2 = await graph.run();  // 再度runすることで実質的なループが可能
この方法はブラウザでのワークフロー動作時だけでなく、Expressサーバ上での動作にも適用可能であり、幅広い応用が期待できます。

 Challenges and Solutions for User Input in GraphAIHandling user input in GraphAI presents significant challenges.

This is because GraphData workflows are designed to operate in various environments, including browsers, CLI, and batch processes. Specifying a particular input method risks locking the workflow to a specific environment.

Currently, when user input is handled through an Agent, the operating environment becomes fixed, which is recognized as an issue.
There are two main methods for receiving input:
Using an Agent to handle input
Receiving input before executing a GraphData workflow and injecting it into a Static node

 Receiving Input via an AgentWhen receiving input through an Agent, CLI environments use an interactive approach with @inquirer/input, while browsers employ a special mechanism to await input via forms.

 Input in the CLIIn the CLI, you can use @graphai/input_agents.
Refer to the following implementation example:

Sample Implementation (chat.ts)

 Input in the BrowserIn the browser, it is difficult to handle input with the Agent alone.

You need to set up an Agent Generator that links the Agent with the input. Specifically, when the Agent enters an input state, it notifies the browser. When the input value is submitted, the Agent's promise resolves, allowing the input to be processed.
The @receptron/event_agent_generator package simplifies this process, enabling seamless input handling in browsers.
Refer to the Vue example here.
By using this package, you can receive user input via the browser at any point in the workflow.

 Passing Input as Workflow Initial ValuesStatic nodes can be updated by calling injectValue on the GraphAI instance in TypeScript.

This allows user input values to be set as initial values for Graph nodes.
However, this approach only applies to initial values and does not allow updating values during workflow execution.

 Applying Input Values with Manual LoopsGraphAI introduced support for manual loops in version 0.6.6.

Typically, loops work by taking the results of the first workflow execution, updating the Static node with these results, and then running the next workflow iteration.
Using initializeGraphAI and setPreviousResults, you can programmatically control this mechanism.

The following example demonstrates how to apply user input during workflow execution:

 Example Implementationconst result = await graph.run();

graph.initializeGraphAI();  // Initialize the graph if needed
graph.setPreviousResults(result); // Set the previous results (updates the Static node)
graph.injectValue("userInput", someValue); // Inject the user input value

const result2 = await graph.run();  // Re-run to achieve a functional loop
This approach is not limited to workflows running in the browser. It can also be used in workflows executed on an Express server, greatly expanding its potential applications.

シンギュラリティ・ソサエティPublication

人工知能を活用したアプリケーションやサービスを活用し、内発的動機付けで行動するエンジニア、起業家、社会起業家をサポートするコミュニティーです。 singularitysociety.org

Discussion

ログインするとコメントできます