Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updated code for extraction #8

Merged
merged 79 commits into from
May 1, 2024
Merged

Conversation

venvis
Copy link
Collaborator

@venvis venvis commented Feb 27, 2024

The three python files has the code for extraction/scraping of operative part from the documents of the respected Celex Id's and accordingly writes them into csv,json and txt files . For now the txt,json method has been commented under the call function of the class Writing() in the class ### output.py.

Copy link
Member

@shashankmc shashankmc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

https://code.visualstudio.com/docs/python/linting -- If you areusing visual studio code, please check linting, add a linter and/or a formatter - https://code.visualstudio.com/docs/python/formatting

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are the class methods for operative part actually being called for testing? For the previous tests, methods are called under main ; Not sure what's happening here?




# one=self.html_page_structure_one()
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is unused code still part of the PR?

three.append(y.text)
return three


Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You need to clean up such that this sticks to PEP8 standard. Unnecessary new lines and indentation in code. Please lint your code.

@venvis venvis requested a review from shashankmc March 21, 2024 08:52
@shashankmc shashankmc changed the base branch from main to cellar March 28, 2024 07:53
@shashankmc shashankmc merged commit 1807938 into cellar May 1, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants