-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create a list of keywords for STATUS messages to better organize well data #3
Comments
Based on Jimmy's work with NLTK on the status messages, we have a list of keywords that correspond to different failure modes: https://www.lucidchart.com/documents/edit/26a13991-a3a9-4fb2-8572-16b497b7e191?shared=true& Environmental drivers: |
Some words used to tag mechanical failures, (e.g. 'construction'), are applied to wells that are in fact working (e.g. 'STATUS' = 'Functional ( in use)|New Under construction'). Consider using bigrams? Or removing 'FUNC' = 'Yes' entries from consideration for mechanical failures? |
I think it is fine to process them for now (if we have a `MECH_FAIL`
column, have entries even if `FUNC` is yes). We can exclude the `FUNC =
Yes` entries when we do failure analysis, then maybe next year's group can
worry about cleaning up our data a little bit :)
|
In addition to the well functional binary (YES/NO), we also have status messages, e.g.,
So we need to figure out some of these keywords in order to make better categories of well failure conditions.
The text was updated successfully, but these errors were encountered: