Replace integrate.quad with native spline integration method by tomaskontrimas · Pull Request #299 · icecube/skyllh

tomaskontrimas · 2026-04-13T21:04:50Z

The d6a927a change speeds up create_analysis call from 16.5s to 12.6s

tomaskontrimas · 2026-04-13T21:14:32Z

Omitting this check:

skyllh/skyllh/analyses/i3/publicdata_ps/signalpdf.py

Lines 86 to 95 in d6a927a

    
           # Check integrity. 
        
           integral = ( 
        
               integrate.quad( 
        
                   self.f_e_spl.evaluate, self.log10_reco_e_min, self.log10_reco_e_max, limit=200, full_output=1 
        
               )[0] 
        
               / self.f_e_spl.norm 
        
           ) 
        
           if not np.isclose(integral, 1): 
        
               raise ValueError(f'The integral over log10_reco_e of the energy term must be unity! But it is {integral}!')

would speed up create_analysis further to 9.16s

It is kind of nice to have, but I'm not sure if people will try to create their own PDSignalEnergyPDF, otherwise it should always pass by construction from PDSignalEnergyPDFSet, I think. Maybe we could cover it with create_analysis tests for both 10yr and 14yr datasets?

chiarabellenghi · 2026-04-15T09:16:41Z

Yes, the correct normalization of the signal PDF should be enforced somehow, at least if people want to contribute to the development. So a unittest that ensures normalized PDF sounds like a good alternative to me.

I remember that I (or Martin) put it there when I was developing the signal energy PDF for the public data, as it is an annoying sum of splines, and normalization needs to be carefully checked. But now that works, and I think it should always work when you try to build an energy PDF from the smearing matrix...

chiarabellenghi · 2026-04-15T09:22:23Z

        if norm:
-            self.norm = integrate.quad(self.__call__, self.x_min, self.x_max, limit=200, full_output=1)[0]
+            # The spline is defined only in the `x` (bincenters) interval by construction.
+            # We chose to not extrapolate oor values.


Why use bin centers instead of bin edges here? Is it because of PchipInterpolator?

Yes, the def __call__(self, x, oor_value=0): method sets values outside of bin centers range to 0, but calling native self.spl_f.integrate method (that provides this speedup) fails outside of bounds. I think the self.norm result is the same by definition though.

We could use the extrapolate option:
https://docs.scipy.org/doc/scipy/reference/generated/scipy.interpolate.PchipInterpolator.integrate.html#scipy.interpolate.PchipInterpolator.integrate
but then it wouldn't match the pdf evaluation setup, so they both would need to be updated.

So, in other words, the current method of integration extends to the bin edges, but it's just adding zeros, whereas the new one simply integrates up to where values might not be zero?

The fit of NGC 1068 could reveal any difference, if any, because there are many signal events. So if there is some discrepancy due to normalization, it will add up there

So, in other words, the current method of integration extends to the bin edges, but it's just adding zeros, whereas the new one simply integrates up to where values might not be zero?

yes

tomaskontrimas · 2026-04-16T19:04:53Z

Blocked by #302

Copilot

Pull request overview

This PR speeds up the public-data point source analysis initialization (create_analysis) by removing expensive scipy.integrate.quad calls and replacing them with spline-native integration routines.

Changes:

Switch FctSpline1D normalization from integrate.quad to PchipInterpolator.integrate.
Switch effective-area integration in PDAeff.get_detection_prob_for_decnu from quad(splev) to interpolate.splint.
Remove a quad-based normalization integrity check in PDSignalEnergyPDF (and related SciPy imports).

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
`skyllh/analyses/i3/publicdata_ps/utils.py`	Uses `PchipInterpolator.integrate` for faster spline normalization.
`skyllh/analyses/i3/publicdata_ps/signalpdf.py`	Drops `quad` import and removes `quad`-based normalization integrity check.
`skyllh/analyses/i3/publicdata_ps/aeff.py`	Replaces `quad`-based spline integration with `interpolate.splint` for detection probabilities.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tomaskontrimas · 2026-04-16T19:18:23Z

        spl = interpolate.splrep(x, y, k=1, s=0)

-        def _eval_spl_func(x):
-            return interpolate.splev(x, spl, der=0, ext=1)
-
-        norm = integrate.quad(_eval_spl_func, enu_range_min, enu_range_max, limit=200, full_output=1)[0]
+        norm = interpolate.splint(enu_range_min, enu_range_max, spl)

        enu_min = np.atleast_1d(enu_min)
        enu_max = np.atleast_1d(enu_max)

        det_prob = np.empty((len(enu_min),), dtype=np.double)
        for i in range(len(enu_min)):
-            integral = integrate.quad(_eval_spl_func, enu_min[i], enu_max[i], limit=200, full_output=1)[0]
-
-            det_prob[i] = integral / norm
+            det_prob[i] = interpolate.splint(enu_min[i], enu_max[i], spl) / norm


From docs: splint silently assumes that the spline function is zero outside the data interval (a, b).

which I think is better than the suggested implementation and it still pass existing analysis tests

tomaskontrimas · 2026-04-16T19:13:44Z

        # Add the PDF axes.
        self.add_axis(PDFAxis(name='log_energy', vmin=self.log10_reco_e_min, vmax=self.log10_reco_e_max))



Covered it in the integration test

Co-authored-by: Copilot Autofix powered by AI <[email protected]>

Replace integrate.quad with native spline integration method

d6a927a

tomaskontrimas requested a review from chiarabellenghi April 13, 2026 21:04

tomaskontrimas self-assigned this Apr 13, 2026

tomaskontrimas added the performance Performance could be improved label Apr 13, 2026

chiarabellenghi reviewed Apr 15, 2026

View reviewed changes

Merge branch 'master' into update_integrals

01702fc

tomaskontrimas mentioned this pull request Apr 16, 2026

Test that PDSignalEnergyPDF splines integrate to 1 and NGC 1068 unblinding result #302

Open

tomaskontrimas added 2 commits April 16, 2026 20:43

Remove integrity check after covering it with test

d0586c1

ruff

f8e6499

tomaskontrimas marked this pull request as ready for review April 16, 2026 19:04

Copilot AI review requested due to automatic review settings April 16, 2026 19:05

Copilot started reviewing on behalf of tomaskontrimas April 16, 2026 19:05 View session

Copilot AI reviewed Apr 16, 2026

View reviewed changes

Potential fix for pull request finding

f209936

Co-authored-by: Copilot Autofix powered by AI <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace integrate.quad with native spline integration method#299

Replace integrate.quad with native spline integration method#299
tomaskontrimas wants to merge 5 commits intomasterfrom
update_integrals

tomaskontrimas commented Apr 13, 2026

Uh oh!

tomaskontrimas commented Apr 13, 2026

Uh oh!

chiarabellenghi commented Apr 15, 2026

Uh oh!

chiarabellenghi Apr 15, 2026

Uh oh!

tomaskontrimas Apr 15, 2026

Uh oh!

chiarabellenghi Apr 15, 2026

Uh oh!

chiarabellenghi Apr 15, 2026

Uh oh!

tomaskontrimas Apr 15, 2026

Uh oh!

tomaskontrimas commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

tomaskontrimas Apr 16, 2026 •

edited

Loading

Uh oh!

tomaskontrimas Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# Add the PDF axes.
		self.add_axis(PDFAxis(name='log_energy', vmin=self.log10_reco_e_min, vmax=self.log10_reco_e_max))

Conversation

tomaskontrimas commented Apr 13, 2026

Uh oh!

tomaskontrimas commented Apr 13, 2026

Uh oh!

chiarabellenghi commented Apr 15, 2026

Uh oh!

chiarabellenghi Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

tomaskontrimas Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

chiarabellenghi Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

chiarabellenghi Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

tomaskontrimas Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

tomaskontrimas commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

tomaskontrimas Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tomaskontrimas Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tomaskontrimas Apr 16, 2026 •

edited

Loading