Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

General Performance/Efficiency Improvements #14

Closed
radix0000 opened this issue Jun 21, 2023 · 2 comments
Closed

General Performance/Efficiency Improvements #14

radix0000 opened this issue Jun 21, 2023 · 2 comments

Comments

@radix0000
Copy link
Collaborator

There are a number of performance/efficiency improvements to the current pipeline which we may want to consider, which would have general positive effects across the board (faster pipeline execution will help not only in production but also for future development), but specifically would highly beneficial to either of the two main options (2 or 3) for improving handling of updates to input data (see #9), since both of these options on their own would likely result in significantly longer processing times. Improvements to consider would be:

  1. Improve XML parser performance
  2. Optimise Elasticsearch usage
  3. Decouple various sub-stages of pipeline (with concurrency or separate processes)
  4. Possibly improve Kinesis usage (though that has seen some work already)

Depending on exactly where the bottlenecks are there are likely to be significant performance gains that could be achieved with a small amount of effort, which would provide a good foundation to move forward from.

@kathryn-ods
Copy link

@radix0000 has this been done? I know you made a lot of performance improvements recently

@radix0000
Copy link
Collaborator Author

Closing as done for now, but have created new issue to stick pin in potential future memory issue: #19

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants