Support for parsing custom corpus #47

oveddan · 2019-10-04T20:11:40Z

It would be great if you supported parsing a custom corpus that a user can provide, that is already formatted in an argument that is recognizable, and can feed into createDataset.py
For example, if you could pass a text file that looks like:

message: How's it going today?
response: It's going alright
message: What's for dinner tonight?
response: Chicken baked with cheese

The text was updated successfully, but these errors were encountered:

adeshpande3 · 2019-10-05T16:25:21Z

Yeah totally agree with that. Things kinda busy for me on the life end, but PRs are welcome 🙏

caraneel · 2019-10-05T16:53:27Z

On this note-- is there any way to see what the format is of the document that is being fed to createDataset.py? I realize the actual files contain personal info, I just want to know the structure of it so I can recreate it. The fbchat-archive-parser doesn't work for my message data, so I want to re-create the file that would have resulted from running fbcap ./messages.htm > fbMessages.txt

adeshpande3 · 2019-10-05T18:17:05Z

Actually now that I think back to this project (it's been a while for me), I think the fbMessages.txt file is actually pretty similar to the format @oveddan was talking about (correct me if I'm wrong though). #28 Basically just username: message on each line and then you should enter your username here (https://github.com/adeshpande3/Facebook-Messenger-Bot/blob/master/createDataset.py#L7)

oveddan · 2019-10-05T18:27:26Z

ok so what you're saying is, it expects a file in the format of:

somePerson: hi how's it going
someOtherPerson: ok
somePerson: what's for dinner tonight?
someOtherPerson: chicken on rice

Would it work if given a file like this? It would be great if there a sample file format for fb messages somewhere in this repo, considering the fb message gathering repo is obsolete.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for parsing custom corpus #47

Support for parsing custom corpus #47

oveddan commented Oct 4, 2019

adeshpande3 commented Oct 5, 2019

caraneel commented Oct 5, 2019

adeshpande3 commented Oct 5, 2019

oveddan commented Oct 5, 2019

Support for parsing custom corpus #47

Support for parsing custom corpus #47

Comments

oveddan commented Oct 4, 2019

adeshpande3 commented Oct 5, 2019

caraneel commented Oct 5, 2019

adeshpande3 commented Oct 5, 2019

oveddan commented Oct 5, 2019