Skip to content

[refactor](table) Refactor table and file reader#63893

Draft
Gabriel39 wants to merge 57 commits into
masterfrom
refact_reader_branch
Draft

[refactor](table) Refactor table and file reader#63893
Gabriel39 wants to merge 57 commits into
masterfrom
refact_reader_branch

Conversation

@Gabriel39

Copy link
Copy Markdown
Contributor

What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

Release note

None

Check List (For Author)

  • Test

    • Regression test
    • Unit Test
    • Manual test (add detailed scripts or steps below)
    • No need to test or manual test. Explain why:
      • This is a refactor/code format and no logic has been changed.
      • Previous test can cover this change.
      • No code files have been changed.
      • Other reason
  • Behavior changed:

    • No.
    • Yes.
  • Does this need documentation?

    • No.
    • Yes.

Check List (For Reviewer who merge this PR)

  • Confirm the release note
  • Confirm test cases
  • Confirm document
  • Add branch pick label

@hello-stephen

Copy link
Copy Markdown
Contributor

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

  1. What problem was fixed (it's best to include specific error reporting information). How it was fixed.
  2. Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
  3. What features were added. Why was this function added?
  4. Which code was refactored and why was this part of the code refactored?
  5. Which functions were optimized and what is the difference before and after the optimization?

@Gabriel39 Gabriel39 marked this pull request as draft May 29, 2026 06:39
Gabriel39 added a commit to Gabriel39/incubator-doris that referenced this pull request May 29, 2026
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: apache#63893

Problem Summary: Add focused BE unit coverage for new table reader and new parquet reader edge cases, including aggregate pushdown over split ranges, Iceberg equality/position deletes, row lineage after delete filtering, Parquet dictionary/statistics pruning, and IOContext release. Also clean up temporary delete predicate expression columns in the new Parquet reader so equality delete predicates with cast children do not alter the returned file block schema.

### Release note

None

### Check List (For Author)

- Test: Unit Test
    - Added BE UT cases in table_reader_test and parquet_reader_test.
    - Ran git diff --check.
    - Tried ./run-be-ut.sh with focused filters, but local JAVA_HOME points to JDK 11 and JDK_17 is not set; the runner requires JDK 17.
- Behavior changed: No
- Does this need documentation: No
Gabriel39 added a commit that referenced this pull request May 29, 2026
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #63893

Problem Summary: Add focused BE unit coverage for new table reader and
new parquet reader edge cases, including aggregate pushdown over split
ranges, Iceberg equality/position deletes, row lineage after delete
filtering, Parquet dictionary/statistics pruning, and IOContext release.
Also clean up temporary delete predicate expression columns in the new
Parquet reader so equality delete predicates with cast children do not
alter the returned file block schema.

### Release note

None

### Check List (For Author)

- Test: Unit Test
    - Added BE UT cases in table_reader_test and parquet_reader_test.
    - Ran git diff --check.
- Tried ./run-be-ut.sh with focused filters, but local JAVA_HOME points
to JDK 11 and JDK_17 is not set; the runner requires JDK 17.
- Behavior changed: No
- Does this need documentation: No

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
@Gabriel39 Gabriel39 force-pushed the refact_reader_branch branch 4 times, most recently from 837cc56 to 475e48a Compare June 3, 2026 05:14
@Gabriel39

Copy link
Copy Markdown
Contributor Author

run buildall

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-H: Total hot run time: 29107 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit c7e07bf0f4367f7634e524741d26894f84d16410, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17461	4051	4019	4019
q2	q3	10688	1355	807	807
q4	4687	472	342	342
q5	7582	883	589	589
q6	187	173	139	139
q7	796	840	650	650
q8	9383	1609	1560	1560
q9	5881	4483	4507	4483
q10	6786	1831	1557	1557
q11	421	269	252	252
q12	636	422	288	288
q13	18186	3500	2734	2734
q14	264	264	253	253
q15	q16	812	779	704	704
q17	1018	1005	897	897
q18	6979	5935	5593	5593
q19	2044	1276	992	992
q20	505	386	268	268
q21	6381	2852	2672	2672
q22	475	385	308	308
Total cold run time: 101172 ms
Total hot run time: 29107 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	5074	4871	4694	4694
q2	q3	4929	5301	4650	4650
q4	2133	2173	1402	1402
q5	4796	4986	4655	4655
q6	228	181	128	128
q7	1915	1752	1598	1598
q8	2429	2131	2084	2084
q9	7840	7666	7441	7441
q10	4695	4665	4246	4246
q11	530	379	352	352
q12	730	734	521	521
q13	3029	3327	2799	2799
q14	287	281	254	254
q15	q16	687	692	603	603
q17	1280	1252	1252	1252
q18	7398	6975	6771	6771
q19	1109	1094	1118	1094
q20	2214	2220	1955	1955
q21	5243	4552	4491	4491
q22	513	471	436	436
Total cold run time: 57059 ms
Total hot run time: 51426 ms

@hello-stephen

Copy link
Copy Markdown
Contributor

FE Regression Coverage Report

Increment line coverage 66.67% (2/3) 🎉
Increment coverage report
Complete coverage report

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-DS: Total hot run time: 168765 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit c7e07bf0f4367f7634e524741d26894f84d16410, data reload: false

query5	4336	620	484	484
query6	459	210	175	175
query7	4842	554	313	313
query8	375	211	208	208
query9	8801	4019	4022	4019
query10	434	321	274	274
query11	5941	2346	2210	2210
query12	166	104	101	101
query13	1287	678	408	408
query14	6372	5363	5054	5054
query14_1	4390	4379	4354	4354
query15	203	195	175	175
query16	1032	453	433	433
query17	1120	709	583	583
query18	2464	465	344	344
query19	202	180	136	136
query20	109	109	109	109
query21	211	135	116	116
query22	13661	13491	13333	13333
query23	17282	16449	16129	16129
query23_1	16201	16314	16343	16314
query24	7482	1728	1346	1346
query24_1	1335	1319	1306	1306
query25	589	467	411	411
query26	1359	317	178	178
query27	2601	555	319	319
query28	4489	2082	2027	2027
query29	1112	628	491	491
query30	312	235	197	197
query31	1113	1107	963	963
query32	106	62	62	62
query33	535	339	275	275
query34	1180	1194	651	651
query35	756	803	714	714
query36	1416	1408	1236	1236
query37	160	114	127	114
query38	3209	3114	3049	3049
query39	916	908	898	898
query39_1	884	884	870	870
query40	224	125	101	101
query41	64	62	61	61
query42	94	97	93	93
query43	315	321	280	280
query44	
query45	201	189	180	180
query46	1108	1190	719	719
query47	2370	2376	2229	2229
query48	401	421	303	303
query49	622	469	354	354
query50	982	356	259	259
query51	4282	4268	4232	4232
query52	87	87	81	81
query53	246	269	191	191
query54	270	227	204	204
query55	78	75	70	70
query56	242	224	215	215
query57	1459	1415	1332	1332
query58	252	216	199	199
query59	1591	1676	1425	1425
query60	285	246	234	234
query61	165	167	160	160
query62	692	642	576	576
query63	232	178	183	178
query64	2521	772	614	614
query65	
query66	1769	474	339	339
query67	29729	29711	28959	28959
query68	
query69	422	301	263	263
query70	944	988	988	988
query71	300	212	214	212
query72	2912	2878	2593	2593
query73	837	775	461	461
query74	5130	4933	4801	4801
query75	2695	2550	2225	2225
query76	2283	1160	784	784
query77	353	385	292	292
query78	12324	12452	11858	11858
query79	1456	996	795	795
query80	586	480	396	396
query81	461	287	244	244
query82	577	158	128	128
query83	353	270	261	261
query84	257	140	108	108
query85	868	537	442	442
query86	361	308	292	292
query87	3346	3309	3141	3141
query88	3658	2762	2763	2762
query89	419	374	323	323
query90	1969	182	185	182
query91	178	164	134	134
query92	65	64	60	60
query93	1555	1506	856	856
query94	567	370	309	309
query95	674	475	351	351
query96	1038	751	340	340
query97	2698	2679	2598	2598
query98	229	205	208	205
query99	1171	1180	1023	1023
Total cold run time: 250908 ms
Total hot run time: 168765 ms

@Gabriel39

Copy link
Copy Markdown
Contributor Author

run buildall

@hello-stephen

Copy link
Copy Markdown
Contributor

FE UT Coverage Report

Increment line coverage 100.00% (2/2) 🎉
Increment coverage report
Complete coverage report

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-H: Total hot run time: 29068 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit e9a9b14d5d681b1e807b8d06a2a8f3c4a1c6feef, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17764	4108	4077	4077
q2	q3	10894	1483	807	807
q4	4807	487	352	352
q5	8583	909	600	600
q6	362	174	141	141
q7	945	845	658	658
q8	10940	1596	1641	1596
q9	7162	4542	4565	4542
q10	6787	1952	1532	1532
q11	441	273	252	252
q12	646	428	294	294
q13	18143	3451	2794	2794
q14	267	258	247	247
q15	q16	821	788	711	711
q17	1011	860	935	860
q18	7059	5840	5519	5519
q19	1175	1231	1001	1001
q20	520	413	262	262
q21	5787	2711	2519	2519
q22	445	366	304	304
Total cold run time: 104559 ms
Total hot run time: 29068 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4380	4309	4346	4309
q2	q3	4562	4996	4393	4393
q4	2104	2217	1390	1390
q5	4466	4347	4679	4347
q6	264	212	149	149
q7	2027	1875	1665	1665
q8	2529	2181	2177	2177
q9	7982	7985	8407	7985
q10	4908	4987	4297	4297
q11	587	415	385	385
q12	773	898	558	558
q13	3321	3710	2976	2976
q14	299	313	270	270
q15	q16	722	763	650	650
q17	1407	1353	1330	1330
q18	7859	7354	6768	6768
q19	1138	1099	1113	1099
q20	2225	2231	1945	1945
q21	5268	4576	4450	4450
q22	542	462	402	402
Total cold run time: 57363 ms
Total hot run time: 51545 ms

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-DS: Total hot run time: 171113 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit e9a9b14d5d681b1e807b8d06a2a8f3c4a1c6feef, data reload: false

query5	4346	638	492	492
query6	452	204	183	183
query7	4833	536	307	307
query8	382	235	220	220
query9	8807	4169	4108	4108
query10	461	323	274	274
query11	5972	2371	2257	2257
query12	163	114	101	101
query13	1272	629	440	440
query14	6405	5472	5096	5096
query14_1	4447	4433	4418	4418
query15	210	205	179	179
query16	1071	463	458	458
query17	1156	740	600	600
query18	2725	511	366	366
query19	210	194	153	153
query20	111	110	108	108
query21	226	150	121	121
query22	13616	13704	13406	13406
query23	17317	16568	16148	16148
query23_1	16390	16374	16396	16374
query24	7475	1826	1318	1318
query24_1	1314	1316	1306	1306
query25	593	481	421	421
query26	1302	324	173	173
query27	2613	569	333	333
query28	4417	2007	2025	2007
query29	1119	639	512	512
query30	313	240	215	215
query31	1124	1087	964	964
query32	113	64	64	64
query33	560	340	268	268
query34	1201	1140	669	669
query35	761	826	713	713
query36	1376	1410	1224	1224
query37	156	108	90	90
query38	3198	3187	3058	3058
query39	940	930	909	909
query39_1	899	872	883	872
query40	225	125	109	109
query41	67	64	61	61
query42	96	96	96	96
query43	326	321	275	275
query44	
query45	195	186	184	184
query46	1075	1210	770	770
query47	2366	2397	2272	2272
query48	384	418	297	297
query49	644	474	374	374
query50	1038	364	250	250
query51	4369	4366	4289	4289
query52	90	89	79	79
query53	252	273	198	198
query54	265	220	214	214
query55	81	77	75	75
query56	247	231	221	221
query57	1425	1404	1317	1317
query58	251	229	221	221
query59	1606	1629	1418	1418
query60	288	245	244	244
query61	161	164	163	163
query62	698	675	592	592
query63	230	182	188	182
query64	2551	818	638	638
query65	
query66	1773	462	344	344
query67	29838	29808	29748	29748
query68	
query69	432	300	271	271
query70	966	942	968	942
query71	302	225	214	214
query72	3018	2757	2384	2384
query73	826	748	446	446
query74	5123	4992	4767	4767
query75	2704	2605	2241	2241
query76	2313	1165	790	790
query77	364	380	298	298
query78	12545	12447	11919	11919
query79	1419	1053	787	787
query80	1320	487	395	395
query81	530	284	241	241
query82	603	156	124	124
query83	337	286	271	271
query84	275	139	117	117
query85	946	545	448	448
query86	442	322	285	285
query87	3397	3309	3221	3221
query88	3677	2754	2721	2721
query89	452	384	333	333
query90	1936	189	188	188
query91	182	171	140	140
query92	64	67	62	62
query93	1563	1498	874	874
query94	732	348	310	310
query95	693	381	345	345
query96	1088	777	320	320
query97	2719	2718	2566	2566
query98	219	211	213	211
query99	1155	1186	1043	1043
Total cold run time: 253223 ms
Total hot run time: 171113 ms

@hello-stephen

Copy link
Copy Markdown
Contributor

FE Regression Coverage Report

Increment line coverage 100.00% (2/2) 🎉
Increment coverage report
Complete coverage report

@Gabriel39

Copy link
Copy Markdown
Contributor Author

run buildall

@hello-stephen

Copy link
Copy Markdown
Contributor

FE UT Coverage Report

Increment line coverage 100.00% (2/2) 🎉
Increment coverage report
Complete coverage report

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-H: Total hot run time: 29390 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 5decd77b25dfb7993971c59414b306ae17d37096, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17882	4102	4130	4102
q2	q3	10907	1439	837	837
q4	4763	489	350	350
q5	8430	888	590	590
q6	319	171	136	136
q7	895	868	635	635
q8	10915	1700	1634	1634
q9	7272	4595	4500	4500
q10	6775	1856	1542	1542
q11	439	272	249	249
q12	652	425	289	289
q13	18226	3410	2820	2820
q14	275	260	243	243
q15	q16	818	777	714	714
q17	1062	875	973	875
q18	6842	5790	5477	5477
q19	1393	1253	1214	1214
q20	559	430	277	277
q21	6086	2731	2596	2596
q22	479	369	310	310
Total cold run time: 104989 ms
Total hot run time: 29390 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4857	4697	4970	4697
q2	q3	4878	5285	4704	4704
q4	2161	2206	1411	1411
q5	4992	4771	4719	4719
q6	246	189	127	127
q7	1849	1712	1565	1565
q8	2301	1964	1951	1951
q9	7391	7434	7450	7434
q10	4728	4687	4233	4233
q11	544	384	352	352
q12	726	748	527	527
q13	3023	3432	2783	2783
q14	267	278	265	265
q15	q16	676	704	598	598
q17	1271	1258	1254	1254
q18	7357	6776	6861	6776
q19	1092	1079	1114	1079
q20	2226	2233	1936	1936
q21	5290	4562	4473	4473
q22	534	466	408	408
Total cold run time: 56409 ms
Total hot run time: 51292 ms

@hello-stephen

Copy link
Copy Markdown
Contributor

FE Regression Coverage Report

Increment line coverage 100.00% (2/2) 🎉
Increment coverage report
Complete coverage report

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-DS: Total hot run time: 168294 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 5decd77b25dfb7993971c59414b306ae17d37096, data reload: false

query5	4326	641	483	483
query6	454	202	179	179
query7	4880	580	281	281
query8	368	230	209	209
query9	8785	3994	3990	3990
query10	453	314	271	271
query11	5732	2353	2175	2175
query12	157	102	97	97
query13	1259	562	452	452
query14	6567	5393	5066	5066
query14_1	4397	4400	4372	4372
query15	209	200	178	178
query16	1028	490	434	434
query17	1131	716	596	596
query18	2724	490	385	385
query19	204	180	140	140
query20	118	109	111	109
query21	213	138	116	116
query22	13611	13597	13356	13356
query23	17395	16545	16200	16200
query23_1	16260	16236	16287	16236
query24	7500	1683	1307	1307
query24_1	1319	1288	1327	1288
query25	552	450	378	378
query26	1295	326	171	171
query27	2652	557	343	343
query28	4418	2004	2008	2004
query29	1065	628	478	478
query30	315	239	200	200
query31	1130	1071	961	961
query32	120	61	59	59
query33	524	315	246	246
query34	1189	1117	647	647
query35	744	780	680	680
query36	1390	1387	1257	1257
query37	155	103	91	91
query38	3215	3136	3068	3068
query39	944	915	897	897
query39_1	882	886	859	859
query40	218	121	102	102
query41	64	65	61	61
query42	94	93	93	93
query43	318	319	275	275
query44	
query45	199	188	180	180
query46	1123	1250	750	750
query47	2341	2377	2306	2306
query48	398	423	280	280
query49	635	466	360	360
query50	1040	343	261	261
query51	4416	4269	4246	4246
query52	86	87	76	76
query53	252	255	190	190
query54	264	217	200	200
query55	76	75	69	69
query56	243	220	220	220
query57	1430	1387	1354	1354
query58	241	214	210	210
query59	1583	1612	1423	1423
query60	278	245	228	228
query61	149	150	156	150
query62	716	650	586	586
query63	234	185	183	183
query64	2511	770	615	615
query65	
query66	1768	468	338	338
query67	29734	28977	28885	28885
query68	
query69	420	298	265	265
query70	999	954	927	927
query71	298	226	209	209
query72	2952	2663	2407	2407
query73	834	784	426	426
query74	5161	4946	4803	4803
query75	2660	2553	2249	2249
query76	2314	1184	801	801
query77	360	366	289	289
query78	12505	12347	11807	11807
query79	1266	1070	792	792
query80	525	461	392	392
query81	446	283	244	244
query82	248	163	129	129
query83	269	279	251	251
query84	267	147	114	114
query85	900	532	449	449
query86	329	300	291	291
query87	3344	3302	3195	3195
query88	3618	2724	2704	2704
query89	416	380	324	324
query90	2176	177	179	177
query91	177	168	133	133
query92	63	63	57	57
query93	1412	1411	927	927
query94	542	344	299	299
query95	687	457	355	355
query96	1090	840	388	388
query97	2703	2699	2586	2586
query98	211	202	210	202
query99	1138	1176	1028	1028
Total cold run time: 250720 ms
Total hot run time: 168294 ms

@Gabriel39

Copy link
Copy Markdown
Contributor Author

run buildall

1 similar comment
@Gabriel39

Copy link
Copy Markdown
Contributor Author

run buildall

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-H: Total hot run time: 29426 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 1be063b5d93f73378ae33a40d6692c8f75681079, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17658	4085	4062	4062
q2	q3	10917	1418	827	827
q4	4746	508	365	365
q5	8253	892	595	595
q6	332	179	136	136
q7	923	856	634	634
q8	10856	1689	1575	1575
q9	7228	4584	4539	4539
q10	6810	1834	1536	1536
q11	437	285	257	257
q12	648	436	296	296
q13	18089	3978	2770	2770
q14	271	261	235	235
q15	q16	826	776	713	713
q17	946	928	1034	928
q18	7058	5761	5477	5477
q19	1186	1237	1200	1200
q20	577	455	292	292
q21	5834	2815	2654	2654
q22	458	460	335	335
Total cold run time: 104053 ms
Total hot run time: 29426 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4923	4842	4754	4754
q2	q3	4971	5267	4645	4645
q4	2113	2184	1390	1390
q5	4980	4730	4669	4669
q6	236	200	147	147
q7	1921	1728	1526	1526
q8	2422	2124	2128	2124
q9	7990	7416	7525	7416
q10	4754	4677	4222	4222
q11	528	384	357	357
q12	737	737	523	523
q13	3057	3417	2819	2819
q14	277	287	258	258
q15	q16	685	709	605	605
q17	1288	1273	1263	1263
q18	7351	6967	6754	6754
q19	1101	1108	1135	1108
q20	2231	2215	1953	1953
q21	5287	4595	4472	4472
q22	528	449	402	402
Total cold run time: 57380 ms
Total hot run time: 51407 ms

@hello-stephen

Copy link
Copy Markdown
Contributor

Cloud UT Coverage Report

Increment line coverage 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 78.34% (1906/2433)
Line Coverage 64.79% (33994/52468)
Region Coverage 65.32% (17527/26833)
Branch Coverage 53.99% (9303/17230)

@Gabriel39 Gabriel39 force-pushed the refact_reader_branch branch from 267891d to 64d2dda Compare June 26, 2026 01:50
### What problem does this PR solve?

Issue Number: None

Related PR: #63781, #64671

Problem Summary: File scanner v2 did not carry the same fixes as the
existing file scanner path. Predicate rows filtered inside v2 file
readers were still reported through scanner load counters unless the
scanner was a real load source, and Hive TEXTFILE empty physical lines
were still skipped unless read_csv_empty_line_as_null was enabled. This
change gates v2 load counter reporting with the same FILE_STREAM
exception used by FileScanner and adds a delimited text hook so Hive
Text v2 treats empty physical lines as records while CSV keeps the old
default behavior.

### Release note

Fix file scanner v2 load counter reporting and Hive TEXTFILE empty-line
handling.

### Check List (For Author)

- Test: Unit Test / Manual test
- Added TextV2ReaderTest coverage for Hive TEXTFILE empty line records,
single-column empty string fields, and COUNT pushdown.
    - Ran git diff --check.
- Ran clang-format v16 through build-support/run_clang_format.py for
changed files.
- Attempted ./run-be-ut.sh --run
--filter='TextV2ReaderTest.*:FileScannerV2Test.*', but the local run was
blocked because the script needed to update/download datasketches-cpp
and network access was unavailable; no BE UT binary was already built.
- Attempted clang-tidy with the available compile_commands.json, but it
pointed at a stale /mnt/disk3/gabriel path; the project clang-tidy
wrapper also requires bash 4+ while only system bash is available.
- Behavior changed: Yes. File scanner v2 now matches v1 load counter
gating and Hive TEXTFILE empty-line semantics.
- Does this need documentation: No

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
@Gabriel39

Copy link
Copy Markdown
Contributor Author

run buildall

@hello-stephen

Copy link
Copy Markdown
Contributor

FE UT Coverage Report

Increment line coverage 66.67% (4/6) 🎉
Increment coverage report
Complete coverage report

@suxiaogang223

Copy link
Copy Markdown
Member

run buildall

@hello-stephen

Copy link
Copy Markdown
Contributor

FE Regression Coverage Report

Increment line coverage 27.27% (3/11) 🎉
Increment coverage report
Complete coverage report

@hello-stephen

Copy link
Copy Markdown
Contributor

FE Regression Coverage Report

Increment line coverage 1.97% (3/152) 🎉
Increment coverage report
Complete coverage report

1 similar comment
@hello-stephen

Copy link
Copy Markdown
Contributor

FE Regression Coverage Report

Increment line coverage 1.97% (3/152) 🎉
Increment coverage report
Complete coverage report

suxiaogang223 added a commit to suxiaogang223/doris that referenced this pull request Jun 26, 2026
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: apache#63893

Problem Summary: Restore regression expected outputs that only changed because INT96 timestamp reads moved away from the old parquet reader timezone behavior. The code now restores old INT96 compatibility, so these expected result rows should also return to the old timezone-adjusted values while leaving unrelated output differences untouched.

### Release note

None

### Check List (For Author)

- Test: Not run (expected result adjustment only)
- Behavior changed: No
- Does this need documentation: No
suxiaogang223 added a commit to suxiaogang223/doris that referenced this pull request Jun 26, 2026
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: apache#63893

Problem Summary: The previous regression expected-result update restored timestamp output too broadly. Only the lines produced by INT96 parquet inputs should be restored to match the old parquet reader compatibility behavior. This change keeps the expected results limited to the affected INT96 tvf cases and avoids unrelated timestamp output changes.

### Release note

None

### Check List (For Author)

- Test: No need to test (expected-result scope correction only)
- Behavior changed: No
- Does this need documentation: No
suxiaogang223 added a commit to suxiaogang223/doris that referenced this pull request Jun 26, 2026
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: apache#63893

Problem Summary: Restore regression expected outputs that only changed because INT96 timestamp reads moved away from the old parquet reader timezone behavior. The code now restores old INT96 compatibility, so these expected result rows should also return to the old timezone-adjusted values while leaving unrelated output differences untouched.

### Release note

None

### Check List (For Author)

- Test: Not run (expected result adjustment only)
- Behavior changed: No
- Does this need documentation: No
@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-H: Total hot run time: 29192 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit e0df3cc0f44b1313d2e823d22abbd9c657892be4, data reload: false

------ Round 1 ----------------------------------
============================================
q1	17654	4076	4005	4005
q2	2004	326	188	188
q3	10915	1383	833	833
q4	4741	474	337	337
q5	8360	855	584	584
q6	360	170	139	139
q7	841	846	624	624
q8	10548	1699	1713	1699
q9	5906	4593	4564	4564
q10	6830	1812	1532	1532
q11	439	272	243	243
q12	662	423	290	290
q13	18132	3412	2727	2727
q14	269	263	243	243
q15	q16	781	784	705	705
q17	931	1003	920	920
q18	7062	5920	5579	5579
q19	1173	1291	1100	1100
q20	509	391	268	268
q21	5729	2629	2317	2317
q22	438	359	295	295
Total cold run time: 104284 ms
Total hot run time: 29192 ms

----- Round 2, with runtime_filter_mode=off -----
============================================
q1	4362	4265	4265	4265
q2	304	342	219	219
q3	4595	5037	4336	4336
q4	2072	2146	1392	1392
q5	4433	4327	4338	4327
q6	231	181	123	123
q7	2347	1856	1630	1630
q8	2462	2118	2106	2106
q9	8149	8141	8174	8141
q10	4856	4778	4495	4495
q11	562	408	395	395
q12	759	748	532	532
q13	3302	3678	2947	2947
q14	312	295	261	261
q15	q16	708	741	652	652
q17	1346	1337	1315	1315
q18	7943	7297	6720	6720
q19	1139	1111	1102	1102
q20	2231	2256	1994	1994
q21	5261	4548	4484	4484
q22	508	430	396	396
Total cold run time: 57882 ms
Total hot run time: 51832 ms

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-H: Total hot run time: 29229 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit e0df3cc0f44b1313d2e823d22abbd9c657892be4, data reload: false

------ Round 1 ----------------------------------
============================================
q1	17264	4045	4039	4039
q2	2002	305	194	194
q3	10228	1400	826	826
q4	4681	471	344	344
q5	7549	866	571	571
q6	181	169	135	135
q7	805	856	627	627
q8	9403	1750	1543	1543
q9	5577	4500	4501	4500
q10	6647	1817	1538	1538
q11	428	286	247	247
q12	630	423	295	295
q13	18103	3437	2797	2797
q14	263	265	242	242
q15	q16	794	785	719	719
q17	1051	895	918	895
q18	7131	5791	5724	5724
q19	1240	1364	1054	1054
q20	485	414	271	271
q21	5622	2666	2355	2355
q22	451	350	313	313
Total cold run time: 100535 ms
Total hot run time: 29229 ms

----- Round 2, with runtime_filter_mode=off -----
============================================
q1	4397	4256	4300	4256
q2	305	348	221	221
q3	4666	4990	4430	4430
q4	2092	2181	1359	1359
q5	4491	4351	4330	4330
q6	252	182	130	130
q7	1727	1644	1755	1644
q8	2748	2232	2208	2208
q9	8329	8564	8117	8117
q10	4826	4812	4327	4327
q11	559	419	378	378
q12	771	762	528	528
q13	3309	3654	2881	2881
q14	289	320	263	263
q15	q16	706	747	634	634
q17	1366	1326	1318	1318
q18	8093	7261	7175	7175
q19	1158	1143	1111	1111
q20	2277	2211	1932	1932
q21	5267	4582	4411	4411
q22	521	458	395	395
Total cold run time: 58149 ms
Total hot run time: 52048 ms

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-DS: Total hot run time: 170764 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit e0df3cc0f44b1313d2e823d22abbd9c657892be4, data reload: false

query5	4332	617	483	483
query6	448	198	169	169
query7	4863	556	304	304
query8	342	191	170	170
query9	8766	3992	4016	3992
query10	448	312	286	286
query11	5996	2358	2151	2151
query12	157	102	105	102
query13	1303	592	453	453
query14	6505	5276	4947	4947
query14_1	4269	4292	4298	4292
query15	220	204	185	185
query16	1006	462	445	445
query17	1117	717	599	599
query18	2710	486	346	346
query19	206	185	146	146
query20	114	110	110	110
query21	229	139	115	115
query22	13648	13623	13399	13399
query23	17400	16491	16101	16101
query23_1	16211	16225	16271	16225
query24	7553	1767	1281	1281
query24_1	1327	1324	1322	1322
query25	568	487	363	363
query26	1342	313	172	172
query27	2669	581	338	338
query28	4411	1984	1972	1972
query29	1046	604	460	460
query30	308	232	195	195
query31	1112	1095	957	957
query32	102	62	59	59
query33	512	314	238	238
query34	1164	1133	658	658
query35	761	789	676	676
query36	1386	1400	1222	1222
query37	147	108	94	94
query38	1868	1718	1682	1682
query39	919	914	924	914
query39_1	875	885	875	875
query40	216	117	99	99
query41	65	65	66	65
query42	88	88	88	88
query43	317	317	276	276
query44	1438	793	791	791
query45	199	190	180	180
query46	1037	1207	725	725
query47	2306	2397	2235	2235
query48	400	418	267	267
query49	575	418	332	332
query50	991	341	257	257
query51	4530	4387	4334	4334
query52	83	82	69	69
query53	252	263	191	191
query54	254	211	191	191
query55	72	71	65	65
query56	261	249	203	203
query57	1444	1432	1348	1348
query58	242	208	199	199
query59	1558	1632	1392	1392
query60	276	243	231	231
query61	150	150	148	148
query62	697	651	583	583
query63	232	184	193	184
query64	2471	762	598	598
query65	4822	4786	4787	4786
query66	1728	455	336	336
query67	28871	28908	28607	28607
query68	3101	1658	932	932
query69	390	316	267	267
query70	1058	967	962	962
query71	275	230	214	214
query72	2884	2657	2340	2340
query73	841	783	464	464
query74	5126	4980	4754	4754
query75	2565	2543	2152	2152
query76	2326	1171	801	801
query77	334	373	282	282
query78	12394	12463	11812	11812
query79	1432	1149	801	801
query80	603	455	385	385
query81	449	281	240	240
query82	560	160	121	121
query83	342	275	253	253
query84	259	141	112	112
query85	855	515	417	417
query86	361	288	280	280
query87	1838	1821	1757	1757
query88	3713	2789	2776	2776
query89	437	386	343	343
query90	1889	180	177	177
query91	177	193	128	128
query92	62	59	55	55
query93	1581	1503	936	936
query94	546	340	299	299
query95	670	459	346	346
query96	1017	799	359	359
query97	2718	2716	2564	2564
query98	218	209	201	201
query99	1180	1164	1010	1010
Total cold run time: 256579 ms
Total hot run time: 170764 ms

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary: File scanner v2 reads Parquet through Arrow, so the old
vparquet page cache path is not used. Repeated scans still go through
the Doris file reader for serialized Parquet column chunk data even when
the Parquet page cache option is enabled. This change registers the
selected Parquet column chunk byte ranges after row-group planning and
lets the Arrow RandomAccessFile adapter reuse StoragePageCache for reads
inside those ranges. Footer and metadata reads happen before range
registration and are intentionally excluded.

### Release note

None

### Check List (For Author)

- Test: Manual test
    - Ran git diff --check.
- Ran build-support/run_clang_format.py with clang-format 16 on modified
BE files.
- Could not compile with existing be/cmake-build-debug-dev-perf because
CMakeCache.txt was generated for /mnt/disk3/gabriel/Workspace/dev1/doris
and the configured ninja path is not available in this worktree.
- Behavior changed: No
- Does this need documentation: No

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
suxiaogang223 added a commit to suxiaogang223/doris that referenced this pull request Jun 26, 2026
Issue Number: close #xxx

Related PR: apache#63893

Problem Summary: Restore regression expected outputs that only changed because INT96 timestamp reads moved away from the old parquet reader timezone behavior. The code now restores old INT96 compatibility, so these expected result rows should also return to the old timezone-adjusted values while leaving unrelated output differences untouched.

None

- Test: Not run (expected result adjustment only)
- Behavior changed: No
- Does this need documentation: No
@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-DS: Total hot run time: 171959 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit e0df3cc0f44b1313d2e823d22abbd9c657892be4, data reload: false

query5	4332	621	473	473
query6	435	187	171	171
query7	4890	578	302	302
query8	344	189	164	164
query9	8752	4053	4030	4030
query10	460	320	275	275
query11	5915	2389	2133	2133
query12	156	107	105	105
query13	1261	593	413	413
query14	6275	5330	5060	5060
query14_1	4321	4296	4316	4296
query15	225	203	185	185
query16	1052	479	431	431
query17	1135	755	600	600
query18	2705	474	334	334
query19	204	185	144	144
query20	111	107	106	106
query21	222	138	118	118
query22	13707	13738	13388	13388
query23	17432	16562	16221	16221
query23_1	16367	16312	16365	16312
query24	7576	1772	1313	1313
query24_1	1328	1278	1293	1278
query25	548	449	360	360
query26	1291	318	165	165
query27	2645	593	341	341
query28	4390	2011	1984	1984
query29	1085	637	494	494
query30	307	245	207	207
query31	1114	1092	958	958
query32	106	60	56	56
query33	518	328	247	247
query34	1179	1166	660	660
query35	771	799	689	689
query36	1393	1412	1244	1244
query37	160	100	92	92
query38	1899	1728	1677	1677
query39	928	929	893	893
query39_1	898	880	880	880
query40	220	123	102	102
query41	73	65	63	63
query42	96	90	92	90
query43	351	319	279	279
query44	1415	778	769	769
query45	201	190	177	177
query46	1065	1214	719	719
query47	2397	2416	2250	2250
query48	401	433	289	289
query49	571	420	305	305
query50	974	353	269	269
query51	4433	4372	4341	4341
query52	81	81	68	68
query53	259	269	192	192
query54	260	222	198	198
query55	78	70	66	66
query56	245	219	209	209
query57	1434	1428	1317	1317
query58	235	212	214	212
query59	1581	1664	1436	1436
query60	293	246	229	229
query61	147	149	152	149
query62	721	655	587	587
query63	231	190	182	182
query64	2475	757	584	584
query65	4907	4813	4805	4805
query66	1738	451	335	335
query67	28903	28925	28740	28740
query68	3085	1666	938	938
query69	456	300	262	262
query70	1027	953	973	953
query71	287	227	211	211
query72	2843	2609	2338	2338
query73	877	787	421	421
query74	5135	4969	4774	4774
query75	2585	2566	2174	2174
query76	2335	1193	786	786
query77	358	383	281	281
query78	12585	12419	11889	11889
query79	1219	1123	768	768
query80	505	483	394	394
query81	449	291	239	239
query82	243	163	125	125
query83	285	276	259	259
query84	265	148	124	124
query85	902	616	493	493
query86	347	305	283	283
query87	1890	1832	1777	1777
query88	3692	2804	2773	2773
query89	414	395	332	332
query90	2097	189	172	172
query91	174	155	130	130
query92	65	58	58	58
query93	1560	1454	894	894
query94	517	344	316	316
query95	679	483	343	343
query96	1047	825	352	352
query97	2728	2724	2582	2582
query98	213	204	198	198
query99	1160	1169	1032	1032
Total cold run time: 256623 ms
Total hot run time: 171959 ms

@hello-stephen

Copy link
Copy Markdown
Contributor
ClickBench: Total hot run time: 25.78 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit e0df3cc0f44b1313d2e823d22abbd9c657892be4, data reload: false

query1	0.01	0.00	0.01
query2	0.14	0.08	0.10
query3	0.37	0.24	0.24
query4	1.61	0.24	0.25
query5	0.33	0.32	0.31
query6	1.17	0.67	0.67
query7	0.04	0.01	0.01
query8	0.10	0.07	0.07
query9	0.52	0.39	0.38
query10	0.58	0.57	0.59
query11	0.31	0.18	0.18
query12	0.32	0.19	0.18
query13	0.53	0.53	0.53
query14	0.94	0.92	0.92
query15	0.67	0.59	0.60
query16	0.38	0.39	0.39
query17	0.97	0.97	1.00
query18	0.31	0.29	0.29
query19	1.89	1.81	1.81
query20	0.02	0.02	0.02
query21	15.38	0.37	0.32
query22	4.92	0.15	0.13
query23	15.81	0.50	0.30
query24	2.41	0.59	0.42
query25	0.14	0.10	0.10
query26	0.75	0.27	0.22
query27	0.11	0.09	0.10
query28	3.40	0.92	0.50
query29	12.47	4.44	3.51
query30	0.40	0.24	0.26
query31	2.77	0.64	0.34
query32	3.25	0.61	0.48
query33	2.95	3.10	3.00
query34	15.58	4.07	3.36
query35	3.27	3.26	3.25
query36	0.62	0.52	0.51
query37	0.13	0.09	0.09
query38	0.07	0.08	0.06
query39	0.07	0.06	0.06
query40	0.20	0.18	0.16
query41	0.12	0.07	0.08
query42	0.08	0.05	0.06
query43	0.07	0.07	0.07
Total cold run time: 96.18 s
Total hot run time: 25.78 s

@hello-stephen

Copy link
Copy Markdown
Contributor
ClickBench: Total hot run time: 25.08 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit e0df3cc0f44b1313d2e823d22abbd9c657892be4, data reload: false

query1	0.01	0.01	0.01
query2	0.09	0.05	0.05
query3	0.25	0.13	0.13
query4	1.61	0.13	0.14
query5	0.23	0.22	0.22
query6	1.28	1.07	1.03
query7	0.04	0.01	0.00
query8	0.06	0.04	0.04
query9	0.38	0.31	0.32
query10	0.53	0.54	0.54
query11	0.19	0.14	0.13
query12	0.18	0.15	0.14
query13	0.47	0.47	0.48
query14	1.02	1.03	0.99
query15	0.63	0.60	0.58
query16	0.32	0.32	0.30
query17	1.15	1.14	1.05
query18	0.23	0.21	0.21
query19	2.06	1.99	1.97
query20	0.02	0.01	0.01
query21	15.43	0.18	0.13
query22	5.00	0.06	0.05
query23	16.17	0.30	0.13
query24	2.96	0.45	0.31
query25	0.11	0.05	0.05
query26	0.72	0.22	0.15
query27	0.04	0.04	0.04
query28	3.52	0.95	0.52
query29	12.46	4.30	3.43
query30	0.27	0.17	0.15
query31	2.76	0.59	0.32
query32	3.21	0.61	0.48
query33	3.26	3.27	3.19
query34	15.57	4.18	3.52
query35	3.59	3.51	3.55
query36	0.56	0.42	0.43
query37	0.09	0.06	0.06
query38	0.05	0.04	0.03
query39	0.04	0.03	0.03
query40	0.17	0.16	0.16
query41	0.09	0.03	0.03
query42	0.04	0.03	0.03
query43	0.04	0.04	0.03
Total cold run time: 96.9 s
Total hot run time: 25.08 s

Gabriel39 and others added 2 commits June 26, 2026 19:39
### What problem does this PR solve?

Issue Number: None

Related PR: None

Problem Summary: FileScannerV2 did not support Doris Native files. This
change adds a native v2 FileReader implementation instead of wrapping
the legacy NativeReader. The reader validates the Native header, reads
serialized PBlock payloads, caches and replays the first block for
schema probing, exposes nullable file-local schema, projects requested
columns, and applies file-local filters. Shared materialized-column
filtering is also used by JSON and delimited text readers so predicate
accounting stays consistent. WAL is intentionally not implemented on the
v2 path because current group commit WAL scans are load scans and
FileScanOperator only selects FileScannerV2 when src_tuple_id does not
resolve to an input tuple.

### Release note

None

### Check List (For Author)

- Test:
    - Style check: build-support/check-format.sh
- Unit Test: not run locally because sandbox execution cannot write
.git/modules for submodule setup and cannot download datasketches-cpp;
the attempted run-be-ut command failed before compiling tests.
- Behavior changed: No
- Does this need documentation: No

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
apache/doris-website#1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
@suxiaogang223

Copy link
Copy Markdown
Member

run buildall

@hello-stephen

Copy link
Copy Markdown
Contributor

Cloud UT Coverage Report

Increment line coverage 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 77.38% (1895/2449)
Line Coverage 64.44% (33995/52753)
Region Coverage 64.84% (17490/26973)
Branch Coverage 54.01% (9368/17344)

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-H: Total hot run time: 29032 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 12ffec6d85e2030cdd8b21bedf3a2d72ab544cbc, data reload: false

------ Round 1 ----------------------------------
============================================
q1	17698	3988	4008	3988
q2	2006	306	197	197
q3	10870	1379	837	837
q4	4739	462	339	339
q5	8214	856	570	570
q6	310	172	136	136
q7	838	843	627	627
q8	10641	1596	1595	1595
q9	5671	4585	4469	4469
q10	6827	1794	1500	1500
q11	455	271	240	240
q12	635	426	297	297
q13	18105	3921	2755	2755
q14	269	277	241	241
q15	q16	774	771	701	701
q17	2107	995	1019	995
q18	6742	5771	5529	5529
q19	1635	1280	1046	1046
q20	517	406	266	266
q21	5909	2590	2403	2403
q22	425	353	301	301
Total cold run time: 105387 ms
Total hot run time: 29032 ms

----- Round 2, with runtime_filter_mode=off -----
============================================
q1	4405	4267	4308	4267
q2	317	340	222	222
q3	4504	4980	4406	4406
q4	2049	2150	1335	1335
q5	4456	4318	4280	4280
q6	229	276	227	227
q7	2151	1750	1551	1551
q8	2486	2113	2097	2097
q9	8089	8135	8042	8042
q10	4809	4727	4538	4538
q11	578	432	379	379
q12	791	749	557	557
q13	3149	3613	3007	3007
q14	299	294	277	277
q15	q16	715	765	632	632
q17	1340	1343	1307	1307
q18	7949	7548	6841	6841
q19	1128	1149	1061	1061
q20	2220	2221	1948	1948
q21	5267	4575	4389	4389
q22	510	452	428	428
Total cold run time: 57441 ms
Total hot run time: 51791 ms

@hello-stephen

Copy link
Copy Markdown
Contributor
TPC-DS: Total hot run time: 171569 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 12ffec6d85e2030cdd8b21bedf3a2d72ab544cbc, data reload: false

query5	4326	643	502	502
query6	449	199	177	177
query7	4818	581	304	304
query8	342	189	170	170
query9	8754	4111	4093	4093
query10	435	318	260	260
query11	5851	2368	2153	2153
query12	159	101	100	100
query13	1296	627	440	440
query14	6295	5354	4942	4942
query14_1	4382	4323	4331	4323
query15	222	205	184	184
query16	996	474	456	456
query17	1056	740	606	606
query18	2509	476	357	357
query19	204	191	146	146
query20	115	108	107	107
query21	217	146	127	127
query22	13676	13681	13480	13480
query23	17397	16488	16067	16067
query23_1	16258	16143	16288	16143
query24	7606	1791	1282	1282
query24_1	1360	1331	1327	1327
query25	571	479	394	394
query26	1315	331	173	173
query27	2630	566	359	359
query28	4465	2036	2008	2008
query29	1104	648	494	494
query30	312	225	207	207
query31	1123	1085	962	962
query32	108	64	65	64
query33	561	324	268	268
query34	1177	1132	644	644
query35	775	793	701	701
query36	1388	1391	1204	1204
query37	158	110	97	97
query38	1881	1744	1634	1634
query39	925	898	906	898
query39_1	867	876	910	876
query40	238	157	101	101
query41	64	63	62	62
query42	92	88	87	87
query43	330	329	293	293
query44	1499	801	783	783
query45	201	191	175	175
query46	1054	1267	743	743
query47	2336	2337	2221	2221
query48	403	397	297	297
query49	583	419	334	334
query50	962	358	257	257
query51	4416	4412	4320	4320
query52	81	83	70	70
query53	245	275	188	188
query54	269	234	230	230
query55	74	77	65	65
query56	231	222	214	214
query57	1414	1387	1299	1299
query58	246	216	209	209
query59	1590	1624	1382	1382
query60	288	253	227	227
query61	156	150	157	150
query62	689	650	583	583
query63	225	188	207	188
query64	2532	783	610	610
query65	4772	4787	4781	4781
query66	1785	463	339	339
query67	28746	28776	28600	28600
query68	3008	1498	997	997
query69	411	296	266	266
query70	1061	967	957	957
query71	287	239	223	223
query72	2958	2627	2331	2331
query73	855	750	449	449
query74	5145	5012	4774	4774
query75	2572	2564	2203	2203
query76	2326	1244	795	795
query77	355	385	291	291
query78	12390	12473	11824	11824
query79	1387	1158	804	804
query80	730	483	396	396
query81	466	280	241	241
query82	583	158	127	127
query83	368	276	254	254
query84	294	150	117	117
query85	902	525	415	415
query86	425	304	271	271
query87	1854	1835	1780	1780
query88	3730	2805	2800	2800
query89	433	385	338	338
query90	1883	189	178	178
query91	172	163	132	132
query92	64	59	60	59
query93	1506	1532	887	887
query94	616	366	330	330
query95	691	375	446	375
query96	1054	806	340	340
query97	2666	2720	2558	2558
query98	215	209	208	208
query99	1155	1136	1020	1020
Total cold run time: 256398 ms
Total hot run time: 171569 ms

@hello-stephen

Copy link
Copy Markdown
Contributor
ClickBench: Total hot run time: 25.96 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 12ffec6d85e2030cdd8b21bedf3a2d72ab544cbc, data reload: false

query1	0.01	0.01	0.01
query2	0.14	0.11	0.09
query3	0.37	0.25	0.25
query4	1.61	0.23	0.23
query5	0.33	0.32	0.32
query6	1.17	0.67	0.67
query7	0.04	0.01	0.01
query8	0.09	0.08	0.07
query9	0.53	0.38	0.38
query10	0.59	0.59	0.58
query11	0.33	0.18	0.17
query12	0.31	0.19	0.19
query13	0.53	0.53	0.53
query14	0.94	0.93	0.92
query15	0.68	0.60	0.59
query16	0.38	0.39	0.40
query17	1.00	1.01	1.01
query18	0.31	0.29	0.29
query19	1.91	1.80	1.80
query20	0.02	0.02	0.01
query21	15.40	0.36	0.32
query22	4.84	0.14	0.14
query23	15.85	0.50	0.31
query24	2.40	0.62	0.43
query25	0.16	0.11	0.10
query26	0.74	0.27	0.23
query27	0.10	0.10	0.11
query28	3.45	0.89	0.53
query29	12.47	4.48	3.50
query30	0.37	0.27	0.28
query31	2.77	0.63	0.34
query32	3.25	0.60	0.48
query33	2.97	3.00	3.02
query34	15.67	4.04	3.36
query35	3.31	3.26	3.25
query36	0.63	0.53	0.49
query37	0.12	0.10	0.10
query38	0.09	0.07	0.06
query39	0.08	0.06	0.06
query40	0.20	0.18	0.18
query41	0.12	0.08	0.08
query42	0.09	0.06	0.06
query43	0.07	0.06	0.07
Total cold run time: 96.44 s
Total hot run time: 25.96 s

@hello-stephen

Copy link
Copy Markdown
Contributor

BE Regression && UT Coverage Report

Increment line coverage 75.70% (12097/15981) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 72.69% (28740/39536)
Line Coverage 56.26% (309819/550683)
Region Coverage 52.96% (258648/488418)
Branch Coverage 53.88% (113109/209921)

@hello-stephen

Copy link
Copy Markdown
Contributor

FE Regression Coverage Report

Increment line coverage 40.00% (2/5) 🎉
Increment coverage report
Complete coverage report

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants