feat: Upgrade Hive to 4.0.1 and align timestamp behaviour with SQL standard#24571
feat: Upgrade Hive to 4.0.1 and align timestamp behaviour with SQL standard#24571imjalpreet wants to merge 6 commits into
Conversation
7ec86c0 to
954a77b
Compare
7e6d265 to
0448c62
Compare
21c80f6 to
282e2c4
Compare
|
Looking closely at the reason why tests are failing, beginning with presto-orc module.
from pyarrow import orc
table2 = orc.read_table('/tmp/3420396529049254202/data.orc')
print(table2)Whereas, record reader for ORC in presto reads it differently
ORC files generated by the version of hive in the PR and master:
This is giving a clue as to something has changed between the versions. This indicates that files written by older version of hive + ORC will give incorrect output? |
|
Another interesting find: There is a difference between row indices and stripe information. |
13d9947 to
5c04f64
Compare
|
@ethanyzhang imported this issue as lakehouse/presto #24571 |
|
The test failures such as: Is in the way presto reads the data and writes the data. Somehow even before the data is interpreted as a timestamp type i.e it is still a long type, it has a timestamp adjusted to system timezone. Why it happens is not yet clear to me, when the data written by presto is read via an external ORC reader it has a 6h adjustment applied to it. A similar thing happens when presto reads the data written by hive. There are no issues while reading other datatypes, e.g. Long/Ints etc... The problem seems to be specific to timestamp only. @imjalpreet agrees with this. |
5b896d6 to
6db405e
Compare
894b0aa to
5a2117a
Compare
71ed5ae to
c2e4072
Compare
c2e4072 to
602229d
Compare
761be28 to
435c545
Compare
435c545 to
0cb3e95
Compare
bff77b0 to
4944e29
Compare
|
Dependent PR still waiting for reviews: prestodb/presto-hive-apache#65 |
|
@imjalpreet |
a6d7f0b to
794d4f7
Compare
|
The following dependent PRs under review: Tempto: prestodb/tempto#299 |
794d4f7 to
3d4524c
Compare
|
Cassandra Product Tests failing due to prestodb/tempto#298 |
e740ce3 to
4b508fd
Compare
…tiveExpressionOptimizer.testCurrentTimestamp
|
Need to debug why a couple of recently added native sidecar plugin unit tests are failing after the latest rebase |


Description
Upgrade to Hive 4.0.1
Depends on prestodb/presto-hive-apache#65, prestodb/tempto#299, prestodb/orc-protobuf#25, prestodb/presto-hive-dwrf#13, prestodb/presto-hive-dwrf#14 and prestodb/docker-images#58
Motivation and Context
#24435 and #27028
Impact
Test Plan
Contributor checklist
Release Notes
Please follow release notes guidelines and fill in the release notes below.