Add support for proxying open AI chat completion through cloud #148

MrStashley · 2023-12-15T03:56:30Z

Added support for proxying open ai chat completion through cloud, this allows users to not have to specify their own openai token

If token is not set, chat completion will now be proxied through the cloud. You can unset your token through the UI.
the cloud uses the default model, currently "gpt-3.5-turbo".
It allows you to set max tokens and max choices parameter

If you don't have telemetry set to on, you cannot use this feature

-- CONSIDERATIONS --
For now, this is set to only hit our dev lambda server. When we deploy the completion code to our prod lambda server, we should add a check for is dev or prod

-- TESTING --
I tested with telemetry on and off, and confirmed that the correct error shows. I have tested the channel timeout and confirm that if the socket hangs, the channel will timeout and send an error, and then the terminal can continue as normal.

I have tested all of the standard code paths, confirmed that when token is set we will still do completion locally, and when token is not set we will do cloud completion. I confirmed that token can now be set to empty string, and it looks good in the ui (it goes back to "not set"). I have also checked socket failed to connect error.

There are some more unlikely error paths that I haven't checked because I didn't reproduce that error, like json decode, etc etc. I also haven't tested socket connect timeout error, but I expect these paths to work as described because the behavior is fairly standard

What do we think about the way I do testing? I can write unit tests in the future

CLAassistant · 2023-12-15T03:56:35Z

All committers have signed the CLA.

wavesrv/pkg/remote/openai/openai.go

wavesrv/pkg/cmdrunner/cmdrunner.go

wavesrv/pkg/remote/openai/openai.go

sawka · 2023-12-15T06:50:56Z

wavesrv/pkg/remote/openai/openai.go

+	if opts.Model == "" {
+		return nil, fmt.Errorf("no openai model specified")
+	}
+	conn, _, err := websocket.DefaultDialer.Dial(AWSLambdaCentralWSAddr, nil)


Use DialContext and we should set some sensible timeout value for connecting

Addressed, set a 20 second timeout

sawka · 2023-12-15T06:53:48Z

I'm thinking we should completely remove the non-streaming paths in the code. Not sure they are really useful. Can't think of a time when we wouldn't want to stream the results back?

MrStashley · 2023-12-15T19:36:29Z

High latency environments might want non streaming completion?
But even then streaming might be better?

web security considerations?
Not sure

I think removing it makes sense

…ouod completion, added capability to unset token, other small fixes

MrStashley · 2023-12-15T20:41:06Z

wavesrv/pkg/remote/openai/openai.go

+	}
+	cloudCompletionRequestConfig := sstore.OpenAICloudCompletionRequest{
+		Prompt:     prompt,
+		MaxTokens:  opts.MaxTokens,


I still let the user specify max tokens and max choices for the cloud, should I use the defaults instead or is this ok?

I think we shouldn't let the user specify these. We should just set them in the cloud.

… completion

… always closed

wavesrv/pkg/cmdrunner/cmdrunner.go

esimkowitz · 2023-12-15T23:03:19Z

wavesrv/pkg/cmdrunner/cmdrunner.go

@@ -66,6 +67,10 @@ const TermFontSizeMax = 24

 const TsFormatStr = "2006-01-02 15:04:05"

+const OpenAIPacketTimeout = 10 * time.Second
+
+const OpenAICloudCompletionTelemetryOffErrorMsg = "In order to protect against abuse, you must have telemetry turned on in order to use Wave's free AI features.  If you do not want to turn telemetry on, you can still use Wave's AI features by adding your own OpenAI key in Settings.  Note that when you use your own key, requests are not proxied through Wave's servers and will be sent directly to the OpenAI API."


nit: there's a few double spaces between sentences in this message

… timeout message

… and error). render cmd status 'error' with red x as well. show exitcode in tooltip of 'x'

…nd code

MrStashley commented Dec 15, 2023

View reviewed changes

wavesrv/pkg/remote/openai/openai.go Outdated Show resolved Hide resolved

MrStashley commented Dec 15, 2023

View reviewed changes

wavesrv/pkg/remote/openai/openai.go Outdated Show resolved Hide resolved

MrStashley commented Dec 15, 2023

View reviewed changes

wavesrv/pkg/cmdrunner/cmdrunner.go Outdated Show resolved Hide resolved

sawka reviewed Dec 15, 2023

View reviewed changes

wavesrv/pkg/remote/openai/openai.go Show resolved Hide resolved

sawka reviewed Dec 15, 2023

View reviewed changes

MrStashley force-pushed the MrStashley-openai-cloud-completion branch from 8abde7e to 5d0dd32 Compare December 15, 2023 20:01

MrStashley marked this pull request as ready for review December 15, 2023 20:03

MrStashley added 4 commits December 15, 2023 12:07

wrote client code for communicating with lambda cloud

1f210e7

Added timeout functionality, added check for telemetry enabled for cl…

502a78b

…ouod completion, added capability to unset token, other small fixes

removed stale prints and comments, readded non stream completion for now

d2b4117

changed json encode to json marshal, also testing my new commit author

037c4b8

MrStashley force-pushed the MrStashley-openai-cloud-completion branch from 2175ae9 to 037c4b8 Compare December 15, 2023 20:07

MrStashley commented Dec 15, 2023

View reviewed changes

MrStashley added 2 commits December 15, 2023 13:20

added no telemetry error message and removed check for model in cloud…

06404b7

… completion

added defer conn.close() to doOpenAIStreamCompletion, so websocket is…

8fe6b6e

… always closed

MrStashley commented Dec 15, 2023

View reviewed changes

wavesrv/pkg/cmdrunner/cmdrunner.go Outdated Show resolved Hide resolved

MrStashley added 5 commits December 15, 2023 13:56

made a constant for the long telemetry error message

e9ff02e

added endpoint getter, made errors better

4049a43

updated scripthaus file to include dev ws endpoint

ad6fd87

added error check for open ai errors

c81e507

changed bool condition for better readability

9d2022c

esimkowitz reviewed Dec 16, 2023

View reviewed changes

sawka added 5 commits December 15, 2023 18:33

update some error messages (use error message from server if returned)

50dd8ce

dont blow up the whole response if the server times out. just write a…

9ba135b

… timeout message

render streaming errors with a new prompt in openai.tsx (show content…

d616550

… and error). render cmd status 'error' with red x as well. show exitcode in tooltip of 'x'

set hadError for errors. update timeout error to work with new fronte…

28c3d5c

…nd code

bump client timeout to 5 minutes (longer than server timeout)

dd0f146

sawka merged commit 4ccd62f into main Dec 16, 2023
4 checks passed

MrStashley deleted the MrStashley-openai-cloud-completion branch February 15, 2024 19:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for proxying open AI chat completion through cloud #148

Add support for proxying open AI chat completion through cloud #148

MrStashley commented Dec 15, 2023 •

edited

Loading

CLAassistant commented Dec 15, 2023 •

edited

Loading

sawka Dec 15, 2023

MrStashley Dec 15, 2023

sawka commented Dec 15, 2023

MrStashley commented Dec 15, 2023 •

edited

Loading

MrStashley Dec 15, 2023

sawka Dec 15, 2023

esimkowitz Dec 15, 2023

Add support for proxying open AI chat completion through cloud #148

Add support for proxying open AI chat completion through cloud #148

Conversation

MrStashley commented Dec 15, 2023 • edited Loading

CLAassistant commented Dec 15, 2023 • edited Loading

sawka Dec 15, 2023

Choose a reason for hiding this comment

MrStashley Dec 15, 2023

Choose a reason for hiding this comment

sawka commented Dec 15, 2023

MrStashley commented Dec 15, 2023 • edited Loading

MrStashley Dec 15, 2023

Choose a reason for hiding this comment

sawka Dec 15, 2023

Choose a reason for hiding this comment

esimkowitz Dec 15, 2023

Choose a reason for hiding this comment

MrStashley commented Dec 15, 2023 •

edited

Loading

CLAassistant commented Dec 15, 2023 •

edited

Loading

MrStashley commented Dec 15, 2023 •

edited

Loading