Use new prompt for QuestionAnswerTool #645

cecheta · 2024-04-09T17:08:41Z

Required by #322 , closes #648

Purpose

This PR includes changes the prompt used in the QuestionAnswerTool to the main_prompt used in On Your Data.
The new prompt includes a mix of system, user and AI messages, instead of one single user message
The new prompt also includes an example to help the LLM, which can be configured on or off
For backwards compatibility, if the old prompt has been set in the config, then this will continue to be used, with no changes
Increase unit test coverage (not for streamlit)

The new prompt + few-shot example are configurable in the Admin app. There is JSON schema validation on the retrieved documents to warn against uploading invalid JSON.

Does this introduce a breaking change?

[ ] Yes
[x] No

Pull Request Type

What kind of change does this Pull Request introduce?

[ ] Bugfix
[x] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

How to Test

Run app and admin app locally
Ask questions from the frontend
Change the prompt in the Admin app

What to Check

Check new prompt is being used
Configure prompt in Admin app
Configure few-shot example

github-actions · 2024-04-09T17:10:01Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
code
app.py	14	14	0%	1–4, 6–7, 9, 12–14, 16, 18, 20–21
create_app.py	148	3	97%	199, 204, 327
code/backend
Admin.py	23	23	0%	1–6, 9, 11, 13–14, 16, 19–20, 22–23, 26, 33, 40, 43–45, 47, 49
code/backend/batch
function_app.py	16	16	0%	1–8, 10, 12–13, 15, 18–21
code/backend/batch/utilities/helpers
ConfigHelper.py	79	1	98%	113
EnvHelper.py	120	9	92%	194, 199–200, 203–205, 215–217
code/backend/batch/utilities/tools
QuestionAnswerTool.py	65	2	96%	44–45
code/backend/pages
04_Configuration.py	118	118	0%	1–8, 10, 12, 14, 21, 28, 30, 35–40, 43–48, 51–52, 55–66, 68–69, 79–81, 83–86, 89–91, 94–95, 100–101, 105–107, 110–111, 114–115, 118–119, 142, 144–145, 147–151, 153–156, 159–163, 170–171, 174–177, 179, 199–200, 202, 204, 210, 217, 224, 232, 239–240, 247, 249–250, 254, 261, 266, 272, 284–285, 302–303, 307, 309–310, 326, 358–359, 361–362
TOTAL	1827	830	54%

Tests	Skipped	Failures	Errors	Time
89	0 💤	0 ❌	0 🔥	10.469s ⏱️

code/backend/batch/utilities/tools/QuestionAnswerTool.py

cecheta · 2024-04-09T17:12:34Z

Get review from Project Wednesday before merging

cecheta · 2024-04-10T09:15:52Z

~~Putting in draft for now, to attempt to add few-shot example to the config~~ Added to config

adamdougal

This is great! Only a couple of minor comments!

(I'll hold off approving until the review next week)

code/backend/batch/utilities/tools/QuestionAnswerTool.py

superhindupur · 2024-04-15T08:37:03Z

code/backend/batch/utilities/helpers/ConfigHelper.py

+- **You cannot list the citation at the end of response.
+- Every claim statement you generated must have at least one citation.**""",
+                "answering_user_prompt": """## Retrieved Documents
+{documents}


I understand that the prompt has been copied from Azure OpenAI On Your Data, but this is a very large prompt, and we've been told by data scientists in the past that long prompts don't work very well since the LLM tends to only "remember" the last few sentences and forget what was said to it in the beginning (attention bias, I think it is called). Should we get this reviewed by a data scientist?

My concern is that we don't have a standard process for DS evaluation - what if we make the performance worse.

I agree that we should get as many reviews as possible, do you know who could review this for us?

We could check with Malvina/Eran if they can take a look at this. It doesn't have to be a blocker for this PR though, since users are allowed to change the prompt if they want to. Can totally be done as a follow-up.

cecheta · 2024-04-15T09:56:31Z

code/backend/batch/utilities/helpers/ConfigHelper.py

+                config = json.loads(config_file)
+
+                # These properties may not exist in the config file as they are newer
+                config["prompts"]["answering_system_prompt"] = config["prompts"].get(


I am planning to add tests to cover these scenarios once the core of the PR has been reviewed

code/backend/batch/utilities/helpers/ConfigHelper.py

cecheta · 2024-04-17T16:23:35Z

Will create new pull request, due to vast number of changes since opening

cecheta commented Apr 9, 2024

View reviewed changes

code/backend/batch/utilities/tools/QuestionAnswerTool.py Outdated Show resolved Hide resolved

cecheta requested review from ross-p-smith, adamdougal, superhindupur, komalg1 and tanya-borisova April 9, 2024 17:11

cecheta marked this pull request as ready for review April 9, 2024 17:11

cecheta changed the title ~~Cecheta/main prompt~~ Use new prompt for QuestionAnswerTool Apr 9, 2024

ross-p-smith requested review from gmndrg and ruoccofabrizio April 9, 2024 17:21

cecheta marked this pull request as draft April 10, 2024 08:21

cecheta force-pushed the cecheta/main-prompt branch from 24ce9d5 to 8aaf006 Compare April 10, 2024 14:52

cecheta marked this pull request as ready for review April 10, 2024 15:26

adamdougal reviewed Apr 11, 2024

View reviewed changes

code/backend/batch/utilities/tools/QuestionAnswerTool.py Outdated Show resolved Hide resolved

code/backend/batch/utilities/tools/QuestionAnswerTool.py Outdated Show resolved Hide resolved

cecheta added 10 commits April 11, 2024 14:36

Add new prompt for QuestionAnswerTool

40487cb

Add poetry install to deploy scripts

8d729af

Move config to self

5583999

Add unit tests

3b88ecd

Remove comments

c93a9ec

Move few-shot example to config

434595d

Fixes + Update tests

cc8dd10

Format

e3bd20d

Remove poetry install

205efb4

Add warnings, log warnings, fix tests

3a7f1e4

cecheta force-pushed the cecheta/main-prompt branch from bc8d5eb to 3a7f1e4 Compare April 11, 2024 15:39

Merge remote-tracking branch 'origin/main' into cecheta/main-prompt

c33d41b

cecheta force-pushed the cecheta/main-prompt branch from ae6b9bf to dbe6c36 Compare April 12, 2024 13:05

Update + simplify tests

a39dbd5

cecheta force-pushed the cecheta/main-prompt branch from dbe6c36 to a39dbd5 Compare April 12, 2024 13:40

Merge remote-tracking branch 'origin/main' into cecheta/main-prompt

1152ed7

superhindupur reviewed Apr 15, 2024

View reviewed changes

cecheta added 3 commits April 15, 2024 09:45

Fix config when old config file has been saved

451711b

Merge remote-tracking branch 'origin/main' into cecheta/main-prompt

90605fc

Fix tests

4ec1ca8

cecheta commented Apr 15, 2024

View reviewed changes

superhindupur reviewed Apr 16, 2024

View reviewed changes

code/backend/batch/utilities/helpers/ConfigHelper.py Show resolved Hide resolved

cecheta added 2 commits April 16, 2024 14:47

Merge remote-tracking branch 'origin/main' into cecheta/main-prompt

1745198

Add language to prompt

952eb73

superhindupur mentioned this pull request Apr 17, 2024

Enable CWYD to accept user input as speech in multiple languages. #317

Closed

4 tasks

cecheta added 2 commits April 17, 2024 09:17

Catch JSON decode error

2717643

Merge remote-tracking branch 'origin/main' into cecheta/main-prompt

82f487b

cecheta closed this Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use new prompt for QuestionAnswerTool #645

Use new prompt for QuestionAnswerTool #645

Uh oh!

cecheta commented Apr 9, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Apr 9, 2024 •

edited

Loading

Uh oh!

Uh oh!

cecheta commented Apr 9, 2024

Uh oh!

cecheta commented Apr 10, 2024 •

edited

Loading

Uh oh!

adamdougal left a comment

Uh oh!

Uh oh!

Uh oh!

superhindupur Apr 15, 2024

Uh oh!

superhindupur Apr 15, 2024

Uh oh!

cecheta Apr 15, 2024

Uh oh!

superhindupur Apr 15, 2024

Uh oh!

cecheta Apr 15, 2024 •

edited

Loading

Uh oh!

Uh oh!

cecheta commented Apr 17, 2024

Uh oh!

Uh oh!

Use new prompt for QuestionAnswerTool #645

Use new prompt for QuestionAnswerTool #645

Uh oh!

Conversation

cecheta commented Apr 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Does this introduce a breaking change?

Pull Request Type

How to Test

What to Check

Uh oh!

github-actions bot commented Apr 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

cecheta commented Apr 9, 2024

Uh oh!

cecheta commented Apr 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adamdougal left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

superhindupur Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

superhindupur Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

cecheta Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

superhindupur Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

cecheta Apr 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cecheta commented Apr 17, 2024

Uh oh!

Uh oh!

cecheta commented Apr 9, 2024 •

edited

Loading

github-actions bot commented Apr 9, 2024 •

edited

Loading

cecheta commented Apr 10, 2024 •

edited

Loading

cecheta Apr 15, 2024 •

edited

Loading