Refactor llm vii #358

mschwoer · 2024-10-23T15:25:03Z

make link between LLM function definitions and actual functions transparent
add general dimensionality reduction to DataSet
simplify some helpers
slightly refactor llm_integration
add unit tests for whole LLM module

…e transparent

JuliaS92 · 2024-11-08T14:59:21Z

alphastats/gui/pages/05_LLM.py

+            models,
+            index=models.index(st.session_state.get(StateKeys.MODEL_NAME))
+            if current_model is not None
+            else 0,


key = StateKeys.MODEL_NAME?

JuliaS92 · 2024-11-08T15:03:03Z

alphastats/gui/pages/05_LLM.py


-        if model_before != st.session_state[StateKeys.MODEL_NAME]:
+        if current_model != st.session_state[StateKeys.MODEL_NAME]:


If this is the intendend behaviour, then why is the config a st.fragment?

JuliaS92 · 2024-11-08T15:13:14Z

alphastats/DataSet.py

+    def perform_dimensionality_reduction(
+        self, method: str, group: Optional[str] = None, circle: bool = False
+    ):
+        """Generic wrapper for dimensionality reduction methods to be used by LLM.
+
+        Args:
+            method (str): "pca", "tsne", "umap"
+            group (str, optional): column in metadata that should be used for coloring. Defaults to None.
+            circle (bool, optional): draw circle around each group. Defaults to False.
+        """
+
+        result = {
+            "pca": self.plot_pca,
+            "tsne": self.plot_tsne,
+            "umap": self.plot_umap,
+        }.get(method)
+        if result is None:
+            raise ValueError(f"Invalid method: {method}")
+
+        return result(group=group, circle=circle)


It would be great to have something similarly simple for the differential analysis longterm.

JuliaS92 · 2024-11-08T15:33:23Z

tests/llm/test_llm_utils.py

+def test_get_protein_id_multiple_matches(gene_to_prot_map):
+    """Test with a gene that appears in multiple compound keys."""
+    result = get_protein_id_for_gene_name("MULTI", gene_to_prot_map)
+    assert result == "PROT1;PROT2;PROT3"


I think this is the same as test_get_protein_id_compound_key. Actually VCL would be the one matching multiple protein ids.

JuliaS92

I think we need to have a conversation about genes vs proteins with everyone and decide on one to use (i am in favor of protein ids) in the backend. Display can be different, but should match all cases. Otherwise kudos :)

mschwoer added 11 commits October 23, 2024 12:19

make link between LLM function definitions and actual definitions mor…

559ce85

…e transparent

use generic perform_dimensionality_reduction

500c60e

add tests for assistant functions

14baaa5

add tests for utils functions

278a314

add tests for llm_helper and refactor

457f7b1

first bunch of llm_integration unit tests

4f3f919

next bunch of llm_integration unit tests and refactoring

48b8dd1

next bunch of llm_integration unit tests

ced836d

last bunch of llm_integration unit tests

3979e07

simplify tests

396a2e0

introduce _chat_completion_create

548ae85

mschwoer requested review from JuliaS92 and boopthesnoot October 23, 2024 15:26

mschwoer added 7 commits October 23, 2024 17:28

fix imports

d5e9d84

fix some functionality

721245f

rename api_type -> model_name

1386b24

preserve selected model in selectbox

bc56430

make connection test more reliable

e8ff6c2

add option to reset LLM analysis

2e04da8

fix tests

4d1670c

JuliaS92 reviewed Nov 8, 2024

View reviewed changes

JuliaS92 approved these changes Nov 8, 2024

View reviewed changes

Base automatically changed from refactor_llm_VI to development November 8, 2024 16:38

mschwoer merged commit 85a9409 into development Nov 8, 2024
4 of 5 checks passed

mschwoer deleted the refactor_llm_VII branch November 8, 2024 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor llm vii #358

Refactor llm vii #358

mschwoer commented Oct 23, 2024 •

edited

Loading

JuliaS92 Nov 8, 2024

JuliaS92 Nov 8, 2024

JuliaS92 Nov 8, 2024

JuliaS92 Nov 8, 2024

JuliaS92 left a comment


		if model_before != st.session_state[StateKeys.MODEL_NAME]:
		if current_model != st.session_state[StateKeys.MODEL_NAME]:

Refactor llm vii #358

Refactor llm vii #358

Conversation

mschwoer commented Oct 23, 2024 • edited Loading

JuliaS92 Nov 8, 2024

Choose a reason for hiding this comment

JuliaS92 Nov 8, 2024

Choose a reason for hiding this comment

JuliaS92 Nov 8, 2024

Choose a reason for hiding this comment

JuliaS92 Nov 8, 2024

Choose a reason for hiding this comment

JuliaS92 left a comment

Choose a reason for hiding this comment

mschwoer commented Oct 23, 2024 •

edited

Loading