Releases: ITM-Kitware/align-system
0.3.2
0.3.1
0.3.1
Added
- Added new Random ADM (action based; chooses random action and action parameters)
- Added additional metrics evaluation candidate ADM configs
- Added logging for final scenario state (alignment scores are provided there in the unstructured field)
Changed
- Changed the TA3ActionBased interface class to accept a list of scenario IDs to work through (rather than an individual scenario ID)
- No longer restricting the SITREP action based on unvisited and conscious characters
Fixed
- Fixed issue where Llama2SingleKDMAADM tagging selection could choose an invalid tag
- Not allowing actions that require a character ID to be taken when no characters exist
- Handling rare corner case where generic APPLY_TREATMENT action could be repeated forever
- Fixed mentions of "continuation of care" in maximization prompts
0.3.0
0.3.0
Added
- Added new driver script for TA3 interactions that uses a new YAML config format for ADMs
- Added several ADM config files for new driver script
- Added a new ADM HybridKaleidoADM which defers to a Llama2SingleKDMAADM instance to fill out action parameters
- Added new abstract class for action based ADMs (called ActionBasedADM), requires a
choose_action
method - Implemented ActionBasedADM
choose_action
method on the KaleidoADM, Llama2SingleKDMAADM, and a new ADM HybridKaleidoADM - Added alignment accuracy metric in self-evaluation framework
- Added re-usable methods for filling out action parameters to Llama2SingleKDMAADM
- Added short KDMA descriptions for moral deservingness and maximization for Kaleido
- Added new prompt template for selecting the target character of an action
- Added high and low alignment system prompts for SoarTech's maximization KDMA
Changed
- Replaced instances of "casualties" with "characters" as per the new new TA3 scenario data format
- Changed TA3 interface component over to using TA3 client module (rather than raw HTTP requests)
- Moved the previous
run_align_system.py
script torun_simplified_align_system.py
, replacing it with the new primary CLI script - Updated README with respect to new CLI script
- Changed some prompts to not display vitals with a value of None
Fixed
- Fixed issue with logging of choice scores after multiple-sampling with voting
- Fixed issue where per-sample LLM outputs weren't being logged correctly
0.2.4
Version 0.2.4
Added
-
Added Kaleido ADM and dedicated Kaleido CLI script
-
Added
partial
option toformat_template
function for partial template completion -
Added
allow_extraneous
option toformat_template
function to ignore extraneous kwargs
Fixed
- Fixed setting the
loglevel
in CLI scripts
0.2.3
Version 0.2.3
Added
-
Added --loglevel CLI argument for
run_action_based_chat_baseline.py
script -
Added LanguageModel, ChatLanguageModel classes for ADMs to inherit from
-
Added AlignedDecisionMaker interface for ADMs to implement
-
Added template system for ADMs to use
-
Added evaluation library code to measure ADM performance
-
Added ChatKDMAPredictingADM ADM
-
Added a few tests for LanguageModel and ChatLanguageModel classes
Changed
Fixed
-
Fixed issue where TA3 training session flag wasn't being passed to the TA3 API
-
Removing training session data info from "action to take" passed to TA3 API
0.2.2
Version 0.2.2
Added
-
Added capability to loop over several scenarios in one system run for
run_chat_baseline.py
CLI script -
Added alignment capabilities to
run_chat_baseline.py
CLI script -
Added rich logging capability with the help of the
rich
library
Changed
Fixed
-
Fixed iteration over scenarios / alignment targets with TA1 APIs
-
Fixed
--precision
argument inrun_chat_baseline.py
CLI script
0.2.1
Version 0.2.1
Added
-
Added aligned decision making capabilities to
llm_chat_baseline.py
algorithm -
Added multiple sampling along with a voting scheme for aligned decision making with the
llm_chat_baseline.py
algorithm -
Added several alignment prompts for MVP2 KDMAs
Changed
-
Updated action-based chat baseline CLI to use new alignment capabilities
-
Changed simple alignment prompt engineering approach to consider a heavy emphasis on a given KDMA when the value is
> 5
(rather than>= 3
). This is consistent with how to consider KDMAs with the more sophisticated prompt engineering approach
Fixed
0.2.0
Version 0.2.0
Added
-
Added llama 2 chat action-based ADM (via new CLI script
run_action_based_chat_baseline
) -
Added llama-index falcon action-based ADM (via new CLI script
run_action_based_align_system
) -
Added support for CACI's new action-based TA3 interface; along with new action-based template CLI script
-
Added support for new probe types "PatientOrdering", "SelectTag", and "SelectTreatment"
Changed
-
Environment now expects Python version >=3.9 (rather than exactly 3.8)
-
Deprecated support for old TA3 interface (code not fully removed yet)
-
Updated several depedency versions
-
Changed BERT implementation to
bert_score
package
0.1.1
0.1.0
Version 0.1.0
Added
-
Now using poetry to manage dependencies and added
pyproject.toml
andpoetry.lock
in support of this -
Added example template CLI script for custom system interface development along with associated documentation
Changed
-
Collapsed main CLI scripts into a single script
run_align_system
-
Re-arranged codebase to be pip installable
-
Factored out interfaces, for TA3 and local files, into re-usable components