Skip to content

Conversation

yoyolicoris
Copy link
Member

@yoyolicoris yoyolicoris commented Jul 23, 2025

[------------------------------------- TorchLPC -------------------------------------]
                        |  v07, complex    |  thrust, complex |     v07    |  thrust
4 threads: ---------------------------------------------------------------------------
      [8, 16384, 1]     |        256.6     |       274.6      |     252.5  |    273.6 
      [8, 65536, 1]     |        290.1     |       290.7      |     271.7  |    275.1 
      [8, 262144, 1]    |        483.7     |       495.2      |     416.0  |    308.5 
      [32, 16384, 1]    |        282.9     |       287.9      |     263.0  |    274.5 
      [32, 65536, 1]    |        558.2     |       498.2      |     385.8  |    305.6 
      [32, 262144, 1]   |       1909.9     |      2007.3      |    1380.9  |   1018.4 
      [128, 16384, 1]   |        473.9     |       501.4      |     322.2  |    307.3 
      [128, 65536, 1]   |       1654.8     |      2004.9      |     852.7  |   1018.1 
      [128, 262144, 1]  |       7467.1     |      7928.6      |    4992.5  |   3941.5 

Times are in microseconds (us).
  • thrust performs slightly slower than v0.7 with complex numbers.
  • thrust can be slightly faster on long sequences and large batch sizes with real numbers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant