Expand cookbook #104

jacobvjk · 2024-09-04T11:17:14Z

jacobvjk · 2024-09-04T11:19:21Z

@cjyetman @jdhoffa I think in the end a cookbook that can be used as the main information hub for users to run P4S by themselves could have a structure like this. Feel free to comment any additions or aspects that you think do not belong here. I realize these are a lot of bullets, but in some cases multiple of those be may combined into a single page.

jdhoffa · 2024-09-09T09:34:22Z

I think the structure looks great! I have some comments (might be things you are already thinking of, but good to have them explicit!). Also not necessarily saying each of these comments is strictly necessary, just brainstorming 😄

I'll start with General suggestions, and then more specific suggestions per section.

General suggestions

Modularity
Consider making each section as modular as possible, so users can jump to the section they need without following the entire document in order.

Troubleshooting and FAQs
Each top-level section (e.g., setup, running analysis, outputs) could end with a “Common Issues” or “FAQ” section to anticipate user difficulties.

Glossary
A glossary of technical terms might be useful for less-experienced users.

Resources
Provide links to any other external resources/ documentation/ the main PACTA website.

Specific suggestions

Title page/Intro/Context

Should clearly communicate the purpose of the tool and its overall capabilities.
Consider briefly outlining the audience (who is this for?) and why the tool is valuable for them.
A high-level diagram or flowchart showing how different stages in the analysis interact might help contextualize the later sections.

Preparatory steps

It would be helpful to provide a concise "checklist" of the required software (e.g., R, RStudio, dependencies) before diving into detailed instructions.
Minimum versions of software dependencies should be defined/noted to avoid compatibility issues.

Data Input

Clarify the format of the input data (e.g., .csv, .xlsx) and any specific formatting requirements.
For external inputs, adding instructions on how to validate or clean the data before using it in the tool could prevent user errors later on.
The distinction between required and optional datasets should be visually obvious, perhaps using different sections, icons, or a color-coded table.

Installing relevant software

A step-by-step approach is essential here. Users appreciate precise details, so specifying terminal commands (if applicable) and any known issues/troubleshooting steps could make this section more comprehensive.
Always better to be linking or referring to any pre-existing documentation here (installing RStudio etc.), if available, to avoid duplicating information.

Setting up the project

Clarity is key in this critical section. If the configuration file or folder structure is complex, providing examples or templates will help the user.
Consider including a visual representation of the folder structure (e.g. can use the tree cli to do this), so the user can clearly see where different files should be located.

Running the analysis

"Basic flow of analysis" sounds general. Consider breaking this down into smaller sub-tasks and using numbered steps or a flowchart to show dependencies between tasks (e.g., Matching -> Data Prep -> Main Analysis).
Including a “Troubleshooting” section here would help users who encounter issues, especially in matching data, loan book misclassification, and iteration processes.

Expected Outputs

In the "Expected PACTA outputs" and "Net Aggregate Alignment outputs" sections, example outputs (with explanations of each column/graph) will be crucial. Users will benefit from screenshots or generated output samples.
Cross-reference the data dictionary as much as possible.

Basic interpretation of outputs:

This section should provide simple, understandable interpretations of the results for both non-technical and technical audiences. Ensure that common questions or confusions around the outputs are addressed here.
If possible, include practical examples of what decisions or next steps might be taken based on different output scenarios.

Advanced use cases:

I’d recommend starting each use case with a problem statement or scenario and showing step-by-step how to adjust the config/inputs to achieve the desired result.

cjyetman mentioned this issue Sep 19, 2024

update README #139

Open

This was referenced Sep 25, 2024

arrange sections of cookbook and add some introductory content #158

Merged

Draft cookbook preparatory steps #161

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expand cookbook #104

Expand cookbook #104

jacobvjk commented Sep 4, 2024 •

edited

Loading

jacobvjk commented Sep 4, 2024

jdhoffa commented Sep 9, 2024 •

edited

Loading

Expand cookbook #104

Expand cookbook #104

Comments

jacobvjk commented Sep 4, 2024 • edited Loading

jacobvjk commented Sep 4, 2024

jdhoffa commented Sep 9, 2024 • edited Loading

General suggestions

Specific suggestions

Title page/Intro/Context

Preparatory steps

Data Input

Installing relevant software

Setting up the project

Running the analysis

Expected Outputs

Basic interpretation of outputs:

Advanced use cases:

jacobvjk commented Sep 4, 2024 •

edited

Loading

jdhoffa commented Sep 9, 2024 •

edited

Loading