Core guidelines that govern how things work inside ELGarash.
-
An experiment should be lightweight, with a clear end goal, take at max a week, and be easily disposable.
-
The end goal of the experiment should be clear, if the end goal is a UI for example, we provide a mockup.
a. the mockup should be simple, preferably done in tldraw.
b. If the experiment is data transformation (e.g., removing tashkeel), we provide a few examples.
-
If the experiment is concerned with generating a new dataset, we document the new dataset.
a. Columns names.
b. Naming schemes.
c. etc...
-
We keep a file called
terminology.md
and keep updating it with new terminology. We stick to single terminology (e.g., we don't usesheik
andreciter
interchangeably stick to one). -
We stick to a single notation
a. In mathematics.
b. Also for data annotation (e.g., the playback speed should be
x1.5
, don't use1.5x
or(1.5x)
interchangeably). -
Figures should have titles below them with the following format:
<p align="center"> <span style="color: #888888;"><em>Figure <experiment name><figure number></em>: <description>.</span> </p> <p align="center"> <img src="./media/<img_path>" alt="<alt>"> </p>
a. The
img_path
should match<experiment name><figure number>
. -
Tables should have titles above them with the following format:
<span style="color: #888888;">_Table <experiment name><table number>_: <description>.</span>
a. We add descriptive column/row names.
b. Units should be specified in the column/row not in values, and put in parentheses, for example,
WER (%)
, orExecution time (s)
. -
If we are comparing the results of two experiments, put the results of the old experiment in the current one, don't reference it.
-
Links to the associated notebooks should added to the experiment report.
-
If we are quoting, word to word, from a paper, we use the quote syntax in markdown:
> This is a quote
-
We add references as the final section of the report.
-
No Experiment should go un-reviewed, open a PR, and someone else should review, sign-off, and approve your experiment.
-
No direct pushes to main.
-
Even if the experiment fails, it should be reported.
-
We Keep our writings short and persuasive, and don’t use extra words → Hemingway app can help with that.
-
Magician’s Mark: Whenever we feel like writing “due to the fact that,” “as a result,” “therefore,” “for that reason,” “accordingly,” or even “because,” we use a colon instead to make the reader ask “why”:
-
Seize the chance to pepper our sentences with such verbs. Try swapping the underlined language for some visual and evocative action words.
-
That Reminds Me: we use analogies.
-
Starting Gate: we start sentences fast.
-
Parallels:
Parallel structure means using the same pattern of words to show that two or more ideas have the same level of importance. This can happen at the word, phrase, or clause level. The usual way to join parallel structures is with the use of coordinating conjunctions such as "and" or "or."
-
Titles are 80% of the work, but you write it as the very last thing. It has to be a compelling opinion or important learning
-
Check out the resources in the dev-writing repo.
-
Check out How to Write a Machine Learning Paper for (not so) Dummies
-
We use the right scientific tense, check out How to Use Tenses within Scientific Writing.
- Points 2, 3, 4, 5, and 6 are from Five Ways to Write Like The Economist.