For all new customers of Sage, please fill out the following link: https://airtable.com/appn1MXE52b4GKKbp/pagSyV3tDuaHDl4hL/form
Due to performance issues, Sage no longer supports csv
as an output file. All files must not be in the form of parquet
.
Because of this change, a large number of columns had to be reworked to handle the additional of a true schema being saved with the parquet files.
In a decision to keep things easier to maintain in the future, column names have been reworked to use more of the natural naming conventions.
- Actions to take migrating >1.6 or higher (if previously installed).
- Patch to v1.6 or higher
- Patch/Update the local code-environment for Sage
- Rebuild the new Streamlit Code-Studio Template
- Rename both the original
partitioned_data
andbase_data
folders (if wanting to preserve historical data)- If you do not want to preserve, drop and delete folders with
Drop data
selected
- If you do not want to preserve, drop and delete folders with
- Run both
Init Dashboard
andInit Worker
Macros - Double check that all code-environments properly rebuilt in the extra worker nodes
- Let scenarios run at scheduled time or manually kick off first time
- Author - Stephen Mazzei
- Email - Stephen.Mazzei@dataiku.com
- Version - 1.6.2
- Special Thanks
- Development
- Jordan Burke
- Ben Bourgeois
- Jonathan Sill
- Documentation
- Rob Harris
- Project Management
- Arjun Srivatsa
- Development
This dashboard is designed to give Dataiku Admins insights into the DSS instance.
- DSS at a glance
- Individual objects, statistics, graphs
- Maintenance and performance reviews
- v14.1.0 (v1.6 and below)
- v14.0.0 (v1.5 and below)
- v13.5.5 (v1.5 and below)
Due to the web application being built on Streamlit, installation requires a bit of dedicated code use. Hoping this changes in later DSS versions.
- Plugin
- Login as an admin account
- Migrate to
Waffle::Plugins
and install from GIT: https://github.com/dataiku/dss-plugin-sage-insights - Build the code-environment, no containers needed
- After the plugin is installed, switch to the plugin settings page and fill in the information ("EXAMPLE BELOW")
- Sage Github Repo
- Sage Dashboard Information
SAGE_DASHBOARD
- Hostname or IP:Port
- Admin Level Api Key
SAGE_WORKER
- Ignore certs if needed, Default is
False
- Sage Worker Information
- Fill out each host including the local host if you want to track the local host. Need both Hostname or IP:Port and Admin level API Key
- Code Studios
- Create the template name
sage
# this name is important - Setup K8s to run on
- Add the
Sage Dashboard - Streamlit
block - Disable permissions for users
- Build
- Create the template name
- Create the Sage Dashboard project based off 1.4.2 information
- Go to Macros
- Filter on
Sage Insights: Initialize
- Run
Initialize Dashboard
- Run
Initialize Workers
- Switch to Code Studios page under the Code tab
- Click the checkbox and publish as a Web Application (No API for this)
- Start the Web Application (Auto-Start)
- Nothing may be available at first while the first day cycle needs to run to gather data