Skip to content

Conversation

@fangchenli
Copy link
Member

No description provided.

fangchenli and others added 3 commits December 18, 2025 19:53
Avoid conversion to TimedeltaArray by using PyArrow compute directly.
Cast duration to int64, then to float64, and multiply by unit factor.

~3.7x speedup (3.53ms -> 0.96ms for 1M rows).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
PyArrow automatically promotes int64 to double when multiplying with float.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
@fangchenli fangchenli added Performance Memory or execution speed performance Arrow pyarrow functionality labels Dec 19, 2025
@fangchenli fangchenli marked this pull request as ready for review December 19, 2025 05:21
@fangchenli fangchenli added the Datetime Datetime data dtype label Dec 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Arrow pyarrow functionality Datetime Datetime data dtype Performance Memory or execution speed performance

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants