initial commit of alliteration filter #251

ahoimarie · 2021-08-31T16:53:15Z

A simple alliteration filter that checks if a sentence is an alliteration or not.

filters/alliteration/README.md

ahoimarie · 2021-09-04T18:36:01Z

Sorry, I did not mean to close the pull request, I wanted to force-reload from the upstream branch to undo my erroneous changes to the wrong folder, which had caused the checks to fail.

james-simon · 2021-09-16T23:52:33Z

Simply superb submission! Alliteration's always an amusing attribute to amplify. I'm a reviewer, and I just have a few minor comments.

In your README, you don't state what your exact criterion for alliterativity is. It's clear from your code, but it's worth mentioning towards the top of the README.
That criterion's pretty strong - few people would say that "Peter Piper picked a peck of pickled peppers" isn't alliteration! I think it'd make sense to weaken it (e.g. allow one off-word for every $k$ alliterative words) or allow the user to set the filter strength (e.g. take $k$ as an argument).
For multi-sentence input, what's your rationale for ignoring all but the first sentence? I'd intuitively expect that the filter would either check whether the entire passage was alliterative on the same letter or whether each sentence is independently alliterative.

Solid work! Let me know if you have any questions.

rteehas · 2021-09-18T13:36:12Z

I enjoyed this submission as well. I agree with @james-simon, but on the last point I think that it would make the most sense to treat each sentence independently, since that's probably more likely than an entire passage being alliterative.

mille-s · 2021-09-21T11:48:09Z

Thanks for the submission! I think it would be good to add a few words on why sentences with alliterations are a challenge, and for which tasks; right now this is not clear to me (even if I like the idea). As mentioned previously, please also add the method you apply to select the sentences. By the way, this is tagged as "Transformation" but should probably be "Filter".

… and amended README

ahoimarie · 2021-09-22T14:50:34Z

Hi @james-simon, hello @rteehas,
gratitude and greetings! Alliterations always attract audiences.

I added a few more lines in the README to explain my criterion for alliterativity. Fixed in 9940840
I have added a new variable min_alliteration_length to alleviate it. Now, a sentence is deemed an alliteration if it contains at least min_alliteration_length words starting with the (first) phoneme found. Would it make sense not to use the first phoneme but the one that occurs most in each sentence?
I wanted to avoid checking if a whole paragraph was alliterative because I thought that would practically never be the case. But I agree that ignoring the remaining sentences is not optimal either. I now fixed it to go through all the input sentences and to return true if any of the sentences are alliterative. I think this is what @rteehas suggested too? Alternatively, it would be possible to check each sentence individually but I think having separate outcomes for each input sentence is not the aim of this filter?

Does this make sense?
Thanks again!

ahoimarie · 2021-09-22T14:53:26Z

Hello @mille-s,
thanks for your feedback! I have added a few more lines in the README to discuss alliterations and their use. Please see 9940840. Would you like me to expand on it?
Thanks again!

james-simon · 2021-09-22T16:20:58Z

Ahoy! Although arguably ameliorating the aforementioned ails, your changes created certain contrasting concerns:

if the criterion for alliterativity is for any k words to start with the first phoneme, won't a lot of long sentences just satisfy this by accident? For example, if a sentence starts with "The" and includes two other "the"s later, wouldn't that count? This doesn't seem like it actually checks for alliteration; I think the alliterative words do have to be close together. If you agree, you should rethink the rule. As an example, a criterion I might choose would be to take two arguments, min_alliteration_length and allowable_offwords, and then check if a sentence contains any long run (not necessarily starting with the first word) of min_alliteration_length alliterative words, allowing allowable_offwords offwords breaking up the run. For example, the sentence Look, guys: Peter Piper picked a peck of pickled peppers would satisfy min_alliteration_length=6 with allowable_offwords=2 (i.e. "a" and "of"). This criterion seems like it gets much closer to what the typical reader would actually call an alliterative sentence. Does that make sense?
the header "Why is it a challenge?" doesn't seem right; I'd say something like "Why is this filter important?" instead
typo in the README: "will they not be removed"

I think checking for a single alliterative sentence makes sense.

mille-s · 2021-09-23T14:47:46Z

@ahoimarie thank for the details, I agree it could be used in a generation setting to see if a model manages to create alliterations, although I suspect that the subpopulations with alliterations in the datasets we are using will be extremely small. But with other (more literary) datasets that could be nice.

ahoimarie · 2021-09-23T19:44:30Z

Hello! Hopefully having handled your handy help here, I imagine it indeed improved its impact.

Instead of checking the first phoneme of the sentence, I now use a rolling window of size min_alliteration_length+allowable_offwords to check if it contains one phoneme of at least the frequency min_alliteration_length in it. If the sentence is shorter than min_alliteration_length+allowable_offwords, the window reduces to min_alliteration_length.
Agreed, and changed.
Yes, thanks!

Thanks! Thorough thoughtful things!
The thing is, the theme of this thrilling thread thrives without thumbing through a thick thesaurus.

james-simon · 2021-09-23T21:33:11Z

Amazing! Apt augmentations, all. Two tiny things:

I think "If the windowlen is smaller than the length of the data" in your docstring is meaning-reversed
Is allowable_offwords a parameter the user can specify, or is it always 2? It'd be better if it could be chosen

After the aforementioned alterations are achieved, I'll animatedly accept.

ahoimarie · 2021-09-24T13:40:01Z

Awesome as always. Appreciated!

Impressive inspection! I immediately edited it.
allowed_offwords is indeed a parameter the user can specify. For example print(Alliteration(stopwords = False, min_alliteration_length=5, allowed_offwords=3).filter("Illuminating illustration of this inane but interesting instruction.")) will return True but calling it with allowed_offwords=2 won't.

Any additional amendments?

james-simon · 2021-09-24T17:42:21Z

Ja, I'm happy! Great filter; every doubt's crushed. Bravo! Accept.

Makefile

kaustubhdhole · 2021-10-01T13:38:16Z

All looks good. Please fix the makefile and I am happy to merge :)

kaustubhdhole · 2021-10-08T04:18:26Z

The makefile still shows in the tracking as deleted.

ahoimarie · 2021-10-30T09:33:51Z

Hi @kaustubhdhole, sorry for the delay, I have been on sick leave because of an injury.
I've added the Makefile again -- does it work now or shall I create a new PR?

kaustubhdhole · 2021-11-15T00:53:13Z

@ahoimarie I missed this message somehow. Please create a separate PR and link this PR in the comment.

kaustubhdhole · 2021-11-15T00:53:52Z

Ensure that when you click "Files Changed", the makefile remains untouched.

ahoimarie · 2021-11-16T12:41:01Z

@kaustubhdhole I removed the makefile with this little trick mentioned on stackoverflow, as creating a separate PR kept failing. But this should solve it too, shouldn't it?

kaustubhdhole reviewed Sep 2, 2021

View reviewed changes

filters/alliteration/README.md Outdated Show resolved Hide resolved

kaustubhdhole added the transformation label Sep 2, 2021

ahoimarie closed this Sep 4, 2021

ahoimarie force-pushed the alliteration branch from 70513dc to 57fd7da Compare September 4, 2021 18:19

re-added submission, after having cleaned up the tree

3c12348

ahoimarie reopened this Sep 4, 2021

ahoimarie added 3 commits September 7, 2021 21:38

added keyword

e09da52

added evaluation results and tweaked code to make evaluations work.

527ffb0

added Data statement

64e174d

kaustubhdhole requested a review from mille-s September 20, 2021 17:15

AbinayaM02 added filter and removed transformation labels Sep 21, 2021

added minimum_alliteration_length, included all input text sentences,…

9940840

… and amended README

ahoimarie added 2 commits September 23, 2021 20:53

updated README and robustness scores

dbcd9fb

changed criterion to check for alliterations

7172a66

corrected docstring for rolling_window

1ac30cf

mille-s approved these changes Sep 27, 2021

View reviewed changes

kaustubhdhole reviewed Sep 27, 2021

View reviewed changes

Makefile Outdated Show resolved Hide resolved

Update Makefile

866a310

remove Makefile from tracking

58e744c

added Makefile

5abb695

Removed a modified makefile from pull request

95e2d67

ahoimarie mentioned this pull request Nov 16, 2021

Alliteration filter #369

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial commit of alliteration filter #251

initial commit of alliteration filter #251

ahoimarie commented Aug 31, 2021

ahoimarie commented Sep 4, 2021

james-simon commented Sep 16, 2021 •

edited

Loading

rteehas commented Sep 18, 2021

mille-s commented Sep 21, 2021

ahoimarie commented Sep 22, 2021 •

edited

Loading

ahoimarie commented Sep 22, 2021

james-simon commented Sep 22, 2021

mille-s commented Sep 23, 2021

ahoimarie commented Sep 23, 2021

james-simon commented Sep 23, 2021

ahoimarie commented Sep 24, 2021

james-simon commented Sep 24, 2021

kaustubhdhole commented Oct 1, 2021

kaustubhdhole commented Oct 8, 2021

ahoimarie commented Oct 30, 2021

kaustubhdhole commented Nov 15, 2021

kaustubhdhole commented Nov 15, 2021

ahoimarie commented Nov 16, 2021

initial commit of alliteration filter #251

Are you sure you want to change the base?

initial commit of alliteration filter #251

Conversation

ahoimarie commented Aug 31, 2021

ahoimarie commented Sep 4, 2021

james-simon commented Sep 16, 2021 • edited Loading

rteehas commented Sep 18, 2021

mille-s commented Sep 21, 2021

ahoimarie commented Sep 22, 2021 • edited Loading

ahoimarie commented Sep 22, 2021

james-simon commented Sep 22, 2021

mille-s commented Sep 23, 2021

ahoimarie commented Sep 23, 2021

james-simon commented Sep 23, 2021

ahoimarie commented Sep 24, 2021

james-simon commented Sep 24, 2021

kaustubhdhole commented Oct 1, 2021

kaustubhdhole commented Oct 8, 2021

ahoimarie commented Oct 30, 2021

kaustubhdhole commented Nov 15, 2021

kaustubhdhole commented Nov 15, 2021

ahoimarie commented Nov 16, 2021

james-simon commented Sep 16, 2021 •

edited

Loading

ahoimarie commented Sep 22, 2021 •

edited

Loading