GradBot - Mô Hình Theo Dõi Trạng Thái và Phản Hồi Đối Thoại Đa Nhiệm

Khám phá Gradbot, mô hình theo dõi trạng thái và phản hồi cho cả đối thoại theo nhiệm vụ và đối thoại chung, nâng cao trải nghiệm giao tiếp.

Trường đại học

FPT University

Chuyên ngành

Bachelor of Artificial Intelligence in Computer Science

Người đăng

Ẩn danh

Thể loại

final capstone project

2024

Phí lưu trữ

30 Point

Mục lục chi tiết

ACKNOWLEDGMENTS

AUTHOR CONTRIBUTIONS

ABSTRACT

1. INTRODUCTION

1.1. Dialogue State Tracking

1.2. Enhance Reading Comprehension

1.3. Dialogue Response

2. PROJECT MANAGEMENT PLAN

3. Overall Project Objective

3.1. Set up the system’s architecture

4. System Design and Optimization

4.1. Consistency and Compatibility Between Modules

4.2. Flexibility to Change and Upgrade the System

4.3. Optimal training time

4.4. Suitability for Research or Production Needs

4.5. Optimization in training

4.6. Combine Model Parallel and Data Parallel

4.7. Choosing checkpoint model

4.8. Multitask pre-training

4.9. Instruction training and Chain of Thought training

4.10. Complexity of building Schema-guided Definition

4.11. Predefined structure model

4.12. Schema-guided representation

4.13. Dialogue context representation

4.14. Action enhances constraint

4.15. DST as guided Reading Conversation

5. Evaluation and Dataset Analysis

5.1. Schema-Guided Dialogue (SGD)

5.2. Overview Schema-Guided

5.3. Schema-Guided Approach

5.4. Comparison With Other Datasets

5.5. Annotation Error Types

6. RESULTS AND DISCUSSION

6.1. Joint Goal Accuracy

6.2. Performance on FusedChat

6.3. Performance on SGD

6.4. Performance on MultiWoz

7. CONCLUSIONS AND PERSPECTIVES

APPENDIX

Tóm tắt

I. GradBot Tổng Quan Mô Hình Đối Thoại Đa Nhiệm Hiện Đại

Thế giới chatbot và hệ thống đối thoại đang chứng kiến sự phát triển vượt bậc, đặc biệt trong lĩnh vực đối thoại hướng mục tiêu (TOD). Tuy nhiên, một thách thức lớn là làm sao để hệ thống không chỉ thực hiện các tác vụ cụ thể mà còn có thể tham gia vào đối thoại mở (ODD) một cách tự nhiên. GradBot xuất hiện như một giải pháp, một mô hình đối thoại đa nhiệm có khả năng theo dõi trạng thái đối thoại và cung cấp phản hồi đối thoại phù hợp trong cả hai bối cảnh TOD và ODD. Mô hình này khai thác sức mạnh của xử lý ngôn ngữ tự nhiên (NLP) và học máy (Machine Learning) để mang lại trải nghiệm tương tác người-máy mượt mà và hiệu quả. Theo nghiên cứu của Sinh Nguyen và Truc Nguyen, GradBot được xây dựng dựa trên những tiến bộ gần đây trong prompt engineering và conditional generation, cho phép thực hiện zero-shot learning, một khả năng quan trọng để thích ứng với các miền mới. Từ đó, GradBot được phát triển để có thể hỗ trợ các doanh nghiệp xây dựng ứng dụng chatbot thông minh, có khả năng phục vụ khách hàng một cách toàn diện.

1.1. Tầm Quan Trọng của Mô Hình Đối Thoại Đa Nhiệm GradBot

Trong bối cảnh mà người dùng mong đợi sự linh hoạt và thông minh từ các chatbot, khả năng xử lý cả tác vụ cụ thể và đối thoại tự do trở nên thiết yếu. GradBot đáp ứng nhu cầu này bằng cách kết hợp hai khả năng trong một mô hình đối thoại duy nhất. Điều này không chỉ nâng cao trải nghiệm người dùng (UX) mà còn giúp doanh nghiệp tiết kiệm chi phí và nguồn lực bằng cách loại bỏ sự cần thiết phải phát triển và duy trì nhiều hệ thống riêng biệt.

1.2. Ưu Điểm Vượt Trội Của GradBot So Với Các Mô Hình Khác

GradBot nổi bật nhờ khả năng zero-shot learning, cho phép nó thích ứng với các miền mới mà không cần phải được đào tạo lại từ đầu. Điều này đạt được thông qua việc sử dụng prompt engineering và conditional generation. So với các mô hình chỉ tập trung vào TOD hoặc ODD, GradBot cung cấp một giải pháp toàn diện hơn, phù hợp với nhiều tình huống tương tác người-máy khác nhau. Thêm vào đó, GradBot sở hữu tốc độ phản hồi nhanh hơn so với các mô hình ngôn ngữ lớn khác.

II. Thách Thức Khó Khăn Khi Xây Dựng Mô Hình Đối Thoại Đa Nhiệm

Việc xây dựng một mô hình đối thoại đa nhiệm như GradBot không phải là một nhiệm vụ đơn giản. Một trong những thách thức lớn nhất là đảm bảo tính nhất quán đối thoại và độ chính xác phản hồi trong cả hai miền TOD và ODD. Bài toán theo dõi trạng thái trở nên phức tạp hơn khi hệ thống phải theo dõi nhiều mục tiêu và ngữ cảnh khác nhau. Ngoài ra, việc tạo ra phản hồi đối thoại tự nhiên và phù hợp đòi hỏi một sự hiểu biết sâu sắc về ngôn ngữ và ngữ cảnh. Theo tài liệu nghiên cứu, các mô hình DST hiện tại thường bị giới hạn bởi việc tập trung chủ yếu vào đối thoại hướng mục tiêu, dẫn đến hiệu suất kém khi được triển khai trong các tình huống đa dạng. Bên cạnh đó, các mô hình phản hồi đối thoại trong các nghiên cứu trước đây thường cho kết quả không tốt do phản hồi thiếu tự nhiên và trôi chảy.

2.1. Bài Toán Theo Dõi Trạng Thái Đối Thoại Trong Môi Trường Đa Nhiệm

Trong môi trường đa nhiệm, hệ thống đối thoại phải đồng thời theo dõi nhiều luồng thông tin khác nhau, từ mục tiêu của người dùng đến ngữ cảnh của cuộc trò chuyện. Điều này đòi hỏi một cơ chế theo dõi trạng thái đối thoại mạnh mẽ, có khả năng xử lý sự phức tạp và không chắc chắn. GradBot giải quyết vấn đề này bằng cách sử dụng các kỹ thuật học máy (Machine Learning) tiên tiến để học cách phân biệt và ưu tiên các thông tin quan trọng nhất.

2.2. Đảm Bảo Tính Tự Nhiên Và Chính Xác Của Phản Hồi Đối Thoại

Một chatbot thành công không chỉ phải trả lời đúng mà còn phải trả lời một cách tự nhiên và hấp dẫn. Điều này đòi hỏi mô hình ngôn ngữ (Language Model) phải có khả năng tạo ra các phản hồi đối thoại đa dạng và phù hợp với ngữ cảnh. GradBot sử dụng các kỹ thuật fine-tuning và transfer learning để cải thiện khả năng sinh phản hồi đối thoại tự nhiên và chính xác.

III. Phương Pháp GradBot Kết Hợp Prompt Engineering và Conditional Generation

GradBot sử dụng một phương pháp tiếp cận độc đáo kết hợp prompt engineering và conditional generation để giải quyết các thách thức của đối thoại đa nhiệm. Prompt engineering cho phép định hướng mô hình ngôn ngữ bằng cách cung cấp các hướng dẫn và ví dụ rõ ràng, trong khi conditional generation cho phép tạo ra phản hồi đối thoại dựa trên các điều kiện cụ thể, chẳng hạn như mục tiêu của người dùng hoặc trạng thái của cuộc trò chuyện. Theo tác giả Sinh Nguyen và Truc Nguyen, GradBot có thể điều khiển luồng hội thoại và dễ dàng mở rộng các miền (dịch vụ) kinh doanh mới. Sử dụng các mô tả ngôn ngữ tự nhiên với các hướng dẫn chi tiết để hỗ trợ tốt hơn cả đối thoại hướng mục tiêu và đối thoại mở.

3.1. Prompt Engineering Hướng Dẫn Mô Hình Bằng Ngôn Ngữ Tự Nhiên

Prompt engineering là một kỹ thuật mạnh mẽ cho phép kiểm soát hành vi của mô hình ngôn ngữ mà không cần phải đào tạo lại từ đầu. Bằng cách thiết kế các prompt (mồi) một cách cẩn thận, chúng ta có thể hướng dẫn GradBot thực hiện các tác vụ cụ thể, chẳng hạn như trả lời câu hỏi, đưa ra khuyến nghị hoặc tham gia vào cuộc trò chuyện. Điều này đặc biệt hữu ích trong môi trường đa nhiệm, nơi hệ thống phải có khả năng thích ứng với nhiều loại yêu cầu khác nhau.

3.2. Conditional Generation Tạo Phản Hồi Dựa Trên Ngữ Cảnh

Conditional generation cho phép GradBot tạo ra các phản hồi đối thoại phù hợp với ngữ cảnh hiện tại của cuộc trò chuyện. Điều này có nghĩa là hệ thống có thể xem xét mục tiêu của người dùng, lịch sử trò chuyện và các yếu tố khác để tạo ra một phản hồi có ý nghĩa và hữu ích. Điều này giúp cải thiện tính nhất quán đối thoại và trải nghiệm người dùng (UX).

3.3. Zero Shot Learning Khả Năng Thích Ứng Với Các Miền Mới

Zero-shot learning là một khả năng quan trọng cho phép GradBot hoạt động hiệu quả trong các miền mới mà không cần phải được đào tạo lại. Điều này đạt được thông qua việc sử dụng prompt engineering và conditional generation để khái quát hóa kiến thức từ các miền đã biết sang các miền chưa biết. Khả năng này giúp GradBot trở thành một giải pháp linh hoạt và hiệu quả cho nhiều ứng dụng khác nhau.

IV. GradBot Ứng Dụng Thực Tế Và Kết Quả Nghiên Cứu Nổi Bật

GradBot có tiềm năng ứng dụng rộng rãi trong nhiều lĩnh vực, từ chăm sóc khách hàng đến giáo dục và thương mại điện tử. Trong chăm sóc khách hàng, GradBot có thể được sử dụng để trả lời câu hỏi, giải quyết vấn đề và cung cấp hỗ trợ kỹ thuật. Trong giáo dục, GradBot có thể được sử dụng để cung cấp hướng dẫn học tập, trả lời câu hỏi và cung cấp phản hồi. Trong thương mại điện tử, GradBot có thể được sử dụng để giới thiệu sản phẩm, xử lý đơn hàng và cung cấp hỗ trợ khách hàng. Nghiên cứu của Sinh Nguyen và Truc Nguyen đã thử nghiệm tính hiệu quả của phương pháp GradBot trên bộ dữ liệu FusedChat, SGD và MultiWoz2.4, đạt được hiệu suất đáng chú ý trên một số điểm chuẩn và đánh giá của con người.

4.1. GradBot Trong Chăm Sóc Khách Hàng Tối Ưu Hóa Tương Tác

GradBot có thể tự động hóa nhiều tác vụ chăm sóc khách hàng khác nhau, từ trả lời câu hỏi đơn giản đến giải quyết các vấn đề phức tạp. Điều này giúp giảm tải cho nhân viên chăm sóc khách hàng và cải thiện trải nghiệm người dùng (UX). Khả năng đa nhiệm của GradBot cho phép nó xử lý nhiều yêu cầu khác nhau một cách hiệu quả, từ đặt hàng đến cung cấp thông tin sản phẩm.

4.2. GradBot Trong Giáo Dục Nâng Cao Trải Nghiệm Học Tập Cá Nhân Hóa

GradBot có thể cung cấp hướng dẫn học tập cá nhân hóa cho sinh viên, giúp họ hiểu rõ hơn về các khái niệm và cải thiện kết quả học tập. Hệ thống có thể trả lời câu hỏi, cung cấp phản hồi và tạo ra các bài tập tùy chỉnh dựa trên nhu cầu của từng sinh viên. Khả năng theo dõi trạng thái đối thoại của GradBot cho phép nó theo dõi tiến trình học tập của sinh viên và điều chỉnh nội dung cho phù hợp.

V. Kết Luận Tương Lai Của Mô Hình Đối Thoại GradBot Đa Nhiệm

GradBot đánh dấu một bước tiến quan trọng trong lĩnh vực hệ thống đối thoại. Bằng cách kết hợp prompt engineering và conditional generation, GradBot mang lại một giải pháp linh hoạt và hiệu quả cho đối thoại đa nhiệm. Khả năng zero-shot learning của GradBot cho phép nó thích ứng với các miền mới một cách nhanh chóng và dễ dàng, mở ra nhiều cơ hội ứng dụng tiềm năng trong nhiều lĩnh vực khác nhau. Trong tương lai, GradBot có thể được cải thiện hơn nữa bằng cách tích hợp các kỹ thuật mạng nơ-ron (Neural Network) tiên tiến hơn và mở rộng phạm vi của tập dữ liệu đối thoại.

5.1. Hướng Phát Triển Tiếp Theo Của Mô Hình GradBot

Các hướng phát triển tiềm năng cho GradBot bao gồm việc tích hợp các kỹ thuật mạng nơ-ron (Neural Network) tiên tiến hơn, chẳng hạn như Transformer và Attention Mechanism. Ngoài ra, việc mở rộng phạm vi của tập dữ liệu đối thoại và cải thiện khả năng hiểu ngôn ngữ tự nhiên của hệ thống cũng là những ưu tiên quan trọng. Điều này sẽ giúp GradBot trở nên thông minh hơn, linh hoạt hơn và hữu ích hơn.

5.2. Tác Động Của GradBot Đến Tương Lai Của Tương Tác Người Máy

GradBot có tiềm năng thay đổi cách chúng ta tương tác với máy móc. Bằng cách cung cấp một giao diện đối thoại tự nhiên và hiệu quả, GradBot có thể giúp giảm bớt sự phức tạp của công nghệ và làm cho nó dễ tiếp cận hơn với mọi người. Điều này có thể dẫn đến một tương lai mà máy móc trở thành đối tác thực sự của chúng ta, giúp chúng ta hoàn thành công việc, học hỏi và kết nối với nhau.

18/04/2025

Bạn đang xem trước tài liệu:

Gradbot mô hình theo dõi trạng thái đối thoại và phản hồi đối thoại thống nhất cho cả đối thoại theo định hướng nhiệm vụ tod và đối thoại chung mọi miền odd

Tải đầy đủ

Trích đoạn nội dung tài liệu

AIP490 Project GradBot - A unified dialogue state tracking and dialogue response model for Task Oriented Dialogues (TOD) and Open Domain Dialogues (ODD) by Sinh Nguyen, Truc Nguyen THE FPT UNIVERSITY HO CHI MINH CITY ITS Faculty FPT University HCMC Final Capstone Project 2 of 75 GradBot - A unified dialogue state tracking and dialogue response model for Task Oriented Dialogues (TOD) and Open Domain Dialogues (ODD) by Sinh Nguyen, Truc Nguyen Supervisor: Dr. Nguyen Quoc Trung, Dr. Truong Hoang Vinh A final year capstone project submitted in partial fulfillment of the requirement for the Degree of Bachelor of Artificial Intelligence in Computer Science DEPARTMENT OF ITS THE FPT UNIVERSITY HO CHI MINH CITY April 2024 (Month year) Final Capstone Project 3 of 75 ACKNOWLEDGMENTS The project we are working on is under the ownership of Gradients Technologies. During our internship with the company, we were granted permission by the Director to utilize this project for our academic requirements, specifically for our graduation project.

The conceptualization and execution of the project plan were primarily our contributions. Prior to initiating the research, we were equipped with fundamental knowledge by our mentors and project managers. They also provided us with substantial support in refining our methodologies to optimize results. Furthermore, they assisted us in reviewing our code after each task.

For the training of models across all modules in this system, we utilized the company's GPU resources. Upon the conclusion of our internship, we, along with our instructors, were given the opportunity to continue our research and further development on the project. This professional experience has been instrumental in our academic and career growth. Final Capstone Project 4 of 75 AUTHOR CONTRIBUTIONS Conceptualization, Sinh Nguyen and Truc Nguyen and Gradients Technologies.; methodology, Sinh Nguyen and Truc Nguyen and Gradients Technologies; software, Sinh Nguyen and Truc Nguyen; validation, Sinh Nguyen and Gradients Technologies; formal analysis, Sinh Nguyen and Truc Nguyen; investigation, Gradients Technologies; resources, Gradients Technologies; data curation, Sinh Nguyen and Truc Nguyen; writing—original draft preparation, Sinh Nguyen and Truc Nguyen; writing—review and editing, Sinh Nguyen and Truc Nguyen; visualization, Sinh Nguyen and Truc Nguyen; supervision, Dr.Nguyen Quoc Trung and Dr.

Truong Hoang Vinh; project administration, Gradients Technologies; funding acquisition, Sinh Nguyen and Truc Nguyen. All authors have read and agreed to the Final Capstone Project document. Final Capstone Project 5 of 75 ABSTRACT The system for task-oriented dialogue domain requires classifying user intent and replying to a specific goal domain. Within the task-oriented sub-module, the Dialogue State Tracker (DST) is well-known as a variety processing tracker.

Nonetheless, current DST models tend to focus solely on task-oriented domains (ToD), resulting in constrained performance when deployed in varied scenarios. Besides, current dialogue response models of previous studies achieved quite poor results because the responses were not natural and fluent. In this paper, we propose GradBot, a unified system including DST model vs response model that predicts both types of tasks, task-oriented dialogue (TOD) and open domain dialogue (ODD). Our model leverages the recent advances in prompt engineering and conditional generation to perform zero-shot learning.

After experiments, GradBot has achieved an 88.5% score on Joint Goal Accuracy metrics when evaluating the Scheme-Guided Dialogue (SGD) and FusedChat test sets correspondingly, demonstrating the adaptation ability for multi-domains. Keywords: Task Oriented Dialogue, Open Domain Dialogue, Dialogue State Tracking, Conversational AI Final Capstone Project 6 of 75 CONTENTS ACKNOWLEDGMENTS 3 AUTHOR CONTRIBUTIONS 4 ABSTRACT 5 CONTENTS 6 List of Figures 9 List of Tables 10 1.1 Dialogue State Tracking 16 2.2 Enhance Reading Comprehension 16 2. PROJECT MANAGEMENT PLAN 20 3.1 Overall Project Objective 20 3.1 Set up the system’s architecture 23 4.1 Consistency and Compatibility Between Modules 25 4.2 Flexibility to Change and Upgrade the System 26 4.3 Optimal training time 27 4.4 Suitability for Research or Production Needs 28 Final Capstone Project 7 of 75 4.3 Optimization in training 29 4.3 Combine Model Parallel and Data Parallel 34 4.4 Choosing checkpoint model 36 4.1 Multitask pre-training 36 4.2 Instruction training and Chain of Thought training 38 4.5 Complexity of building Schema-guided Definition 40 4.6 Predefined structure model 42 4.1 Schema-guided representation 42 4.2 Dialogue context representation 42 4.3 Action enhances constraint 43 4.7 DST as guided Reading Conversation 44 5.2 Schema-Guided Dialogue (SGD) 49 5.1 Overview Schema-Guided 49 5.2 Schema-Guided Approach 50 5.4 Comparison With Other Datasets 52 5.1 Overview 53 Final Capstone Project 8 of 75 5.2 Annotation Error Types 53 6.1 Joint Goal Accuracy 56 6. RESULTS AND DISCUSSION 61 7.1 Performance on FusedChat 61 7.2 Performance on SGD 62 7.3 Performance on MultiWoz 2.

CONCLUSIONS AND PERSPECTIVES 66 9. APPENDIX 72 Final Capstone Project 9 of 75 List of Figures INTRODUCTION Figure 1. An example of Attraction Domain on Fusedchat datasets. The conversation builds from MutiWoz2.4 by rewriting the existing Task-oriented domain turns and adding new Open Dialogue domain turns.

Pytorch Fully Sharded Data Parallel (FSDP) 32 4. Decomposing All-Reduce Operations in Distributed Data Parallel Training: A Path to Full Parameter Sharding. Efficient Machine Learning with Hybrid Parallelism: The diagram shows a hybrid approach using model and data parallelism for efficient machine learning. It optimizes resource utilization and closely matches the speed of data parallelism, ideal for smaller research teams.

Our fine tuning data comprises 473 datasets, and 1,836 total tasks. Results of Flan T5 and T5 on MMLU, BBH, TyDiQA, MGSM. Compare using chain-of-thought training and not using CoT training. Compare using instruction training and not using instruction training.

Overview of GradBot approach for schema-guided multi-domains dialogues. The bottom figure includes specific examples for dialogue context, user action, ontology and current query while the top figure stimulates predictions. Their dialogue system allows a user and a digital assistant to switch between (TOD) and (ODD) modes. An example includes a query about college fees (TOD) and a chat about personal growth and finance (ODD).

An TOD + ODD instance from FusedChat 48 5. An ODD + TOD instance from FusedChat 49 5. Example schema for a digital wallet service 50 5. In the context of two distinct flight services, dialogue state tracking labels are applied after each user statement.

With the schema-guided method, these annotations depend on the service’s schema, located at the extreme left/right. Examples of each error type. Overview simultaneously enhances the construct meaning of the input and target value. 57 Final Capstone Project 10 of 75 List of Tables 3.

Above are our main tasks assignments. Source data 22 5 Table 3. Statistics for SGD, FusedChat, and MultiWoz2.4, computed across train, validation, and test sets. FusedChat incorporates MultiWoz2.4, with the addition of ODD to its TOD part.

In SGD, ”unique” slots are represented in italics, and the number of slot values includes those for ”categorical” slots. Experimental results on the FUSEDCHAT test set with Join Goal Accuracy (JGA), Slot Accuracy (SA), F1-score performance. Two models from FUSEDCHAT [23] are cited to compare for JGA and SA metrics, their parameters are not referred to in the original paper so that we hide it. Addition F1 column is reported to ensure proper tracking of the dialogue’s type (ODD or TOD).

Our GradTOD model’s performance is written in italics. Experimental results on the SGD test set with Join Goal Accuracy (JGA) performance on seen and unseen domains, the value with Large Language Model (params more than 1B) and our GradTOD model written in bold and in italics, respectively. Comparison of performance between state-of-the-art research on MultiWoz 2. The result of SOM-DST on MultiWoz 2.4 is referred to on [34].

The highest score with Encoder only and Large Language Model (Seq2Seq) are written in bold while our GradTOD model is written in italics, respectively. Example of Metadata miss slots 67 Appendix Table 8. Example of Dialog act miss slots 68 Appendix Table 9. Example of Metadata and dialog act miss slots 68 Appendix Table 10.

Example of Dialog act miss slots and values 69 Appendix Table 11. Example of Inconsistent values 69 Final Capstone Project 11 of 75 1. INTRODUCTION The task-oriented domain has attracted a lot of attention not only in academics but also in industry. This objective is to achieve specific strategies, such as providing information or performing an action that satisfies the user’s request.

Specifically, the task-oriented system will replace most product consultants, reservation staff and customer service staff. This system can interact with users and allow them to carry out intentions such as: buying products, searching or booking hotel rooms, restaurant tables, making medical appointments, buying music tickets, buying tickets flight, train, taxi, etc. and actions include: providing characteristics of the hotel or restaurant that the user wants to search or book, request the system about the place (does it has internet? How much does it cost ? etc), accept orders, confirm orders, etc. After receiving actions and intentions from the user, the system will respond to the user with system actions such as: providing information about hotels and restaurants that the user requests, offering hotel, restaurant in the user’s destination, performing order, etc.

One of the crucial components of the task-oriented domain is Dialogue State Tracking (DST), which tries to predict appropriate actions to resolve the goals. At every turn, DST has to look up the dialogue history (whole or sliding window) to the current user query to determine user intentions, actions with specific values in the slot list [1, 2]. In our observation, there are two kinds of DST designed: ● The traditional method uses an Encoder module exploiting multihead layers to build classified data intent prediction, slots prediction, and slot filling [1, 3]; ● Seq2Seq module uses prompting to show semantics between turns and ontology through conversation to predict a required value [4–6]. Final Capstone Project 12 of 75 In industrial applications, DST is required to adapt flexibly new domains (services) without prior training for a specific task.

For this purpose, the role of zeroshot prediction on unseen domains becomes important in DST. Some previous work [4–6] uses guided schema as a description to show the semantics of slots with input sentences (user query). With recent advances in pre-trained language models [7–9], augmented language techniques are gaining more and more attention. These methods have demonstrated impressive improvement and zero-shot adaptability [3, 10, 11].

Moreover, the in-context learning framework (ICL) shows efficient methods and techniques in DST without the re-training stage by combining prompting and examples for a task (few-shot) [12, 13]. Often other studies only focus on the ability to predict user actions and intentions using the DST model, without researching the ability to respond to users appropriately and fluently. Therefore, our system incorporates a Dialogue Response model designed to interact with users in a way that depends on the predictions made by the DST model (including the user’s actions and intentions). Historically, dialogue response models were primarily developed for conversational AI systems, with a particular focus on question answering systems.

The input model mainly consists of historical context and the current user query. In cases where the system is required to respond to untrained queries, documents retrieved from the internet or a database are added to the input model to provide knowledge to help the model answer user queries. However, to satisfy the user’s task-oriented requests, the Dialogue Response model needs to include a main component, called ‘system action’. This component represents the action that the system will take in response to the user, after receiving the user’s actions and intentions.

Final Capstone Project 13 of 75 Fig 1. An example of Attraction Domain on Fusedchat datasets. The conversation builds from MutiWoz2.4 by rewriting the existing Task-oriented domain turns and adding new Open Dialogue domain turns. More specifically, Figure 1 shows an example conversation with the associated dialogue state of the attraction domain.

The user wants to find information about a specified name and request more data about the phone number, address, and area. At the same time, they ask if going to the museum is useful or not (general domain)?

Nội dung được bảo vệ bản quyền — Tải xuống đầy đủ

Tài liệu "GradBot: Mô Hình Theo Dõi Trạng Thái Đối Thoại và Phản Hồi Đối Thoại Đa Nhiệm" giới thiệu một mô hình tiên tiến trong việc theo dõi trạng thái và phản hồi trong các cuộc đối thoại đa nhiệm. Mô hình này không chỉ giúp cải thiện khả năng tương tác giữa người và máy mà còn tối ưu hóa trải nghiệm người dùng thông qua việc hiểu và phản ứng linh hoạt với ngữ cảnh của cuộc trò chuyện. Độc giả sẽ tìm thấy những lợi ích rõ ràng từ việc áp dụng mô hình này trong các ứng dụng thực tiễn, từ chatbot đến hệ thống hỗ trợ khách hàng.

Để mở rộng kiến thức về các khía cạnh liên quan đến phân tích cảm xúc và xử lý ngôn ngữ tự nhiên, bạn có thể tham khảo thêm tài liệu Luận án tiến sĩ khoa học máy tính phân tích cảm xúc trên cơ sở trị cảm xúc chuyển dịch theo ngữ cảnh cho tiếng việt, nơi cung cấp cái nhìn sâu sắc về cách cảm xúc được phân tích trong ngữ cảnh tiếng Việt. Bên cạnh đó, tài liệu Luận án tiến sĩ khoa học máy tính phân giải đồng tham chiếu đối tượng cho phân tích cảm xúc sẽ giúp bạn hiểu rõ hơn về các phương pháp phân giải trong phân tích cảm xúc. Cuối cùng, tài liệu Luận văn thạc sĩ khoa học máy tính mô hình markov ẩn bậc cao cho việc xác định cảm xúc từ văn bản sẽ cung cấp thêm thông tin về các mô hình thống kê trong việc xác định cảm xúc từ văn bản. Những tài liệu này sẽ là nguồn tài nguyên quý giá để bạn khám phá sâu hơn về lĩnh vực này.

#xử lý ngôn ngữ tự nhiên

#tương tác người-máy

#Mô hình theo dõi trạng thái

#Phản hồi đối thoại đa nhiệm

#Công nghệ AI trong đối thoại

#Hệ thống đối thoại thông minh

Chủ đề

Xử Lý Ngôn Ngữ Tự Nhiên

Công nghệ AI và ứng dụng

Mô hình đối thoại trong AI

Tương lai của giao tiếp máy tính