From Noisy Data to Scientific Insights

My thesis work tackled the challenge of making sense of extremely noisy ♦︎, high-dimensional data from multidimensional photoemission spectroscopy (MPES), covering statistical and physical understanding of the measurement process to developing a state-of-the-art denoising schemes.

What I Did

Wrangling Raw Data: Started with raw single-event MPES datasets, creating multidimensional images of different noise levels.
Statistical Analysis: Characterized the electron counting statistics and the correction of the double (and multi-hit) counting from the detector (using a kNN-based scheme).
Exploring Classical Methods: Tested BM3D with variance stabilization (via the Anscombe transform) for moderately noisy datasets. It works great, but only at lower noise levels.
Dataset Design: Using only a single (less) noisy dataset, designed training and validation sets.
Deep Learning: Trained a 3D U-Net model using Noise2Noise, successfully denoising images in the ultra-low-count regimes where traditional methods give up.
Real-World Validation: Put everything to the test by running inference on new data, proving that the model could reveal features in 10 minutes that would take hours of conventional acquisition to capture.

Simplified visualization of electron events on the 3D delay-line detector as a function of time. These events are binned to form images. In this setup, thousands of electrons might be detected every second. In laser based setups, this can go up to millions.

Why It Matters

This work solves a real problem: saving time and resources at some of the world’s most advanced research facilities, such as the free-electron lasers FLASH. By optimizing data acquisition, researchers can focus on exploring new physics instead of waiting around for their data to converge.

♦︎ While some physicists might disagree on the term 'noise' since the measurment process is inherently probabilistic, but these inherent fluctuations is how it is defined within this work.

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
.vscode		.vscode
appendix		appendix
chapters		chapters
images		images
notes		notes
.gitignore		.gitignore
README.md		README.md
abstract.tex		abstract.tex
acknowledgements.tex		acknowledgements.tex
acronyms.tex		acronyms.tex
contributions.tex		contributions.tex
declaration.tex		declaration.tex
glossary.tex		glossary.tex
main.pdf		main.pdf
main.tex		main.tex
references.bib		references.bib
symbols.tex		symbols.tex
titlepage.tex		titlepage.tex

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

From Noisy Data to Scientific Insights

What I Did

Why It Matters

About

Releases

Packages

Languages

zain-sohail/master-thesis

Folders and files

Latest commit

History

Repository files navigation

From Noisy Data to Scientific Insights

What I Did

Why It Matters

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages