sync #16

GuoPhilipse · 2022-07-03T10:14:26Z

Description of PR

How was this patch tested?

For code changes:

Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?
Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation?
If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?
If applicable, have you updated the LICENSE, LICENSE-binary, NOTICE-binary files?

#3877) Adds a new option fs.s3a.create.storage.class which can be used to set the storage class for files created in AWS S3. Consult the documentation for details and instructions on how disable the relevant tests when testing against third-party stores. Contributed by Monthon Klongklaew

…, updateApplicationTimeouts API's for Federation (#4396)

…#4412) Reviewed-by: Viraj Jasani <[email protected]> Signed-off-by: Tao Li <[email protected]>

…-csi (#4417) This is a followup to HADOOP-18275 and its upgrade of os-maven-plugin.version When that change is merged in, this MUST follow it. Contributed by Steve Loughran

Co-authored-by: Ashutosh Gupta <[email protected]> Reviewed-by: Tao Li <[email protected]> Signed-off-by: Akira Ajisaka <[email protected]>

* jnihelper.c in HDFS native client uses dirent.h. This header file isn't available on Windows. * This PR provides a cross platform compatible implementation for dirent under the XPlatform library.

…ed by JiangHua Zhu. Signed-off-by: Ayush Saxena <[email protected]>

* HDFS-16623. Avoid IllegalArgumentException in LifelineSender Co-authored-by: zengqiang.xu <[email protected]>

…ss. (#4419). Contributed by fanshilun.

…. Contributed by fanshilun.

Signed-off-by: Ashutosh Gupta <[email protected]> Signed-off-by: stack <[email protected]>

…4366). Contributed by ZanderXu. Reviewed-by: Mingxiang Li <[email protected]> Signed-off-by: He Xiaoqiao <[email protected]>

… Szilard Nemeth.

* We use the TARGET_FILE CMake generator expression to get the location of the protoc-gen-hrpc CMake target.

* Maven runs the ember build script. The environment variable TMPDIR was set as per bash syntax. * This failed on Windows since the Windows command prompt doesn't support bash syntax. * We're now detecting the OS and setting a Maven property "emberBuildScript" in a cross platform compatible way.

…ros login user. (#4424). Contributed by Xiping Zhang.

…ist.txt (#4444) Bump cos_api-bundle to 5.6.69 All copies of httpclient, including shaded ones in libraries used by the s3a, gs and cos cloud connectors, turn out to load their TLD list from the same resource mozilla/public-suffix-list.txt Updating the hadoop-cos dependency ensures that its version of public-suffix-list.txt is up to date -and so the s3a connector able to talk to s3 resources if the cos-api-bundle JAR is where the resource is loaded from. Contributed by André Fonseca

Reviewed-by: Viraj Jasani <[email protected]> Signed-off-by: Tao Li <[email protected]>

…c blocks (#4398)

Co-authored-by: slfan1989 <louj1988@@>

…ode. (#4367). Contributed by ZanderXu. Reviewed-by: Mingxiang Li <[email protected]> Reviewed-by: Ayush Saxena <[email protected]> Signed-off-by: He Xiaoqiao <[email protected]>

…engbing Liu.

Speed up the magic committer with key changes being * Writes under __magic always retain directory markers * File creation under __magic skips all overwrite checks, including the LIST call intended to stop files being created over dirs. * mkdirs under __magic probes the path for existence but does not look any further. Extra parallelism in task and job commit directory scanning Use of createFile and openFile with parameters which all for HEAD checks to be skipped. The committer can write the summary _SUCCESS file to the path `fs.s3a.committer.summary.report.directory`, which can be in a different file system/bucket if desired, using the job id as the filename. Also: HADOOP-15460. S3A FS to add `fs.s3a.create.performance` Application code can set the createFile() option fs.s3a.create.performance to true to disable the same safety checks when writing under magic directories. Use with care. The createFile option prefix `fs.s3a.create.header.` can be used to add custom headers to S3 objects when created. Contributed by Steve Loughran.

…DFS-16563. (#4408) Regression caused by HDFS-16563; the hdfs exception text was changed, but because it was a YARN test doing the check, Yetus didn't notice. Contributed by zhengchenyu

… servers (#4431) Reviewed-by: Steve Loughran <[email protected]> Signed-off-by: Tao Li <[email protected]>

…4448) Signed-off-by: Tao Li <[email protected]>

…mber of usable replicas (#4410) Co-authored-by: Kevin Wikant <[email protected]> Signed-off-by: Akira Ajisaka <[email protected]>

Co-authored-by: Ashutosh Gupta <[email protected]> Reviewed by Akira Ajisaka.

Co-Authored by: Samrat Deb

…-16202 (#4472) Fixing a mockito-based test which broke when HADOOP-16202 changed the methods being invoked. Contributed by Steve Loughran

…d even if multiple log aggregation file controllers are configured. Contributed by Szilard Nemeth.

part of HADOOP-18103. Add support for multiple ranged vectored read api in PositionedReadable. The default iterates through the ranges to read each synchronously, but the intent is that FSDataInputStream subclasses can make more efficient readers especially in object stores implementation. Also added implementation in S3A where smaller ranges are merged and sliced byte buffers are returned to the readers. All the merged ranged are fetched from S3 asynchronously. Contributed By: Owen O'Malley and Mukund Thakur

… maxReadSizeForVectorReads (#3964) Part of HADOOP-18103. Introducing fs.s3a.vectored.read.min.seek.size and fs.s3a.vectored.read.max.merged.size to configure min seek and max read during a vectored IO operation in S3A connector. These properties actually define how the ranges will be merged. To completely disable merging set fs.s3a.max.readsize.vectored.read to 0. Contributed By: Mukund Thakur

part of HADOOP-18103. Contributed By: Mukund Thakur

part of HADOOP-18103. Required for vectored IO feature. None of current buffer pool implementation is complete. ElasticByteBufferPool doesn't use weak references and could lead to memory leak errors and DirectBufferPool doesn't support caller preferences of direct and heap buffers and has only fixed length buffer implementation. Contributed By: Mukund Thakur

part of HADOOP-18103. Handling memory fragmentation in S3A vectored IO implementation by allocating smaller user range requested size buffers and directly filling them from the remote S3 stream and skipping undesired data in between ranges. This patch also adds aborting active vectored reads when stream is closed or unbuffer() is called. Contributed By: Mukund Thakur

This feature adds methods for ranged vectored read operations in PositionedReadable. All stream which implement that interface support the new API. The default implementation reads each range in the vector sequentially. However, specific implementations may provide higher performance versions. This is done in two places * Local FileSystem/Checksum FileSystem * The S3A client. The S3A client first coalesces adjacent and "nearby" ranges together, then fetches each range in separate HTTP GET requests, executed in parallel. As such it delivers significant speedups to applications reading separate blocks of data from the same file, columnar data format libraries in particular. This is the merge commit of the feature branch; the work is in HADOOP-11867. Add a high-performance vectored read API. HADOOP-18104. S3A: Add configs to configure minSeekForVectorReads and maxReadSizeForVectorReads. HADOOP-18107. Adding scale test for vectored reads for large file HADOOP-18105. Implement buffer pooling with weak references. HADOOP-18106. Handle memory fragmentation in S3A Vectored IO. Contributed By: Owen O'Malley and Mukund Thakur

Reviewed-by: Ayush Saxena <[email protected]> Signed-off-by: Chris Nauroth <[email protected]>

Co-authored-by: Ashutosh Gupta <[email protected]>

…. Contributed by fanshilun. Signed-off-by: Ayush Saxena <[email protected]>

…gation (#4486) * YARN-10320.Replace FSDataInputStream#read with readFully in Log Aggregation Co-authored-by: Ashutosh Gupta <[email protected]>

…onStore#confirmMutation (#4487) Co-authored-by: Ashutosh Gupta <[email protected]>

…n some cases (#4452) * HDFS-16633.Reserved Space For Replicas is not released on some cases Co-authored-by: Ashutosh Gupta <[email protected]>

Update the dependencies of the LDAP libraries used for testing: ldap-api.version = 2.0.0 apacheds.version = 2.0.0.AM26 Contributed by Colm O hEigeartaigh.

…omplete state (#4331) ABFS rename fails intermittently when the Storage-blob tracking metadata is in an incomplete state. This surfaces as the error code 404 and an error message of "RenameDestinationParentPathNotFound" To mitigate this issue, when a request fails with this response. the ABFS client issues a HEAD call on the source file and then retries the rename operation again ABFS filesystem statistics track when this occurs with new counters rename_recovery metadata_incomplete_rename_failures rename_path_attempts This is very rare occurrence and appears to be triggered under certain heavy load conditions, just as with HADOOP-18163. Contributed by Mehakmeet Singh.

…user not present on client (#4474). Contributed by swamirishi.

…HBase is down (#4492) Co-authored-by: Ashutosh Gupta <[email protected]>

Contributed by Samrat Deb.

…enabled Fixes #4447 Signed-off-by: Owen O'Malley <[email protected]>

…y fanshilun.

…etionService.stopRMClient. Contributed by Szilard Nemeth.

…Tamas Domok. Change-Id: I55ddb46fd0e4cdb644747d6d43083215f10861b5

…emoving queue which is referred in queue mapping (#4515) * YARN-10287.Update scheduler-conf corrupts the CS configuration when removing queue which is referred in queue mapping Co-authored-by: Ashutosh Gupta <[email protected]>

…ation table instead of entity table (#4516) Co-authored-by: Ashutosh Gupta <[email protected]>

Co-authored-by: Ashutosh Gupta <[email protected]>

…uted by JiangHua Zhu. Signed-off-by: Ayush Saxena <[email protected]>

monthonk and others added 30 commits June 8, 2022 19:05

YARN-11159. Support failApplicationAttempt, updateApplicationPriority…

98ca6fa

…, updateApplicationTimeouts API's for Federation (#4396)

HDFS-16624. Fix flaky unit test TestDFSAdmin#testAllDatanodesReconfig (…

2ac01b2

…#4412) Reviewed-by: Viraj Jasani <[email protected]> Signed-off-by: Tao Li <[email protected]>

YARN-11173. remove redeclaration of os-maven-plugin.version from yarn…

985161f

…-csi (#4417) This is a followup to HADOOP-18275 and its upgrade of os-maven-plugin.version When that change is merged in, this MUST follow it. Contributed by Steve Loughran

HDFS-16576. Remove unused imports in HDFS project (#4389)

6e11c94

Co-authored-by: Ashutosh Gupta <[email protected]> Reviewed-by: Tao Li <[email protected]> Signed-off-by: Akira Ajisaka <[email protected]>

HDFS-16463. Make dirent cross platform compatible (#4370)

d557c44

* jnihelper.c in HDFS native client uses dirent.h. This header file isn't available on Windows. * This PR provides a cross platform compatible implementation for dirent under the XPlatform library.

HDFS-16621.Remove unused JNStorage#getCurrentDir() (#4404). Contribut…

7f5a34d

…ed by JiangHua Zhu. Signed-off-by: Ayush Saxena <[email protected]>

HDFS-16623. Avoid IllegalArgumentException in LifelineSender (#4409)

af5003a

* HDFS-16623. Avoid IllegalArgumentException in LifelineSender Co-authored-by: zengqiang.xu <[email protected]>

HDFS-16627. Improve BPServiceActor#register log to add NameNode addre…

02c21ef

…ss. (#4419). Contributed by fanshilun.

HDFS-16609. Fix Flakes Junit Tests that often report timeouts. (#4382)…

170668b

…. Contributed by fanshilun.

YARN-11175. Refactor LogAggregationFileControllerFactory

c9a174a

HDFS-16625. Check assumption about PMDK availability (#4414)

bebf03a

Signed-off-by: Ashutosh Gupta <[email protected]> Signed-off-by: stack <[email protected]>

HDFS-16598. Fix DataNode FsDatasetImpl lock issue without GS checks. (#…

d0715b1

…4366). Contributed by ZanderXu. Reviewed-by: Mingxiang Li <[email protected]> Signed-off-by: He Xiaoqiao <[email protected]>

YARN-11176. Refactor TestAggregatedLogDeletionService. Contributed by…

75bc6cf

… Szilard Nemeth.

HDFS-16469. Locate protoc-gen-hrpc across platforms (#4434)

5a40224

* We use the TARGET_FILE CMake generator expression to get the location of the protoc-gen-hrpc CMake target.

HDFS-16628 RBF: Correct target directory when move to trash for kerbe…

f8c7e67

…ros login user. (#4424). Contributed by Xiping Zhang.

HDFS-16581.Print node status when executing printTopology. (#4321)

6cbeae2

Reviewed-by: Viraj Jasani <[email protected]> Signed-off-by: Tao Li <[email protected]>

HDFS-16613. EC: Improve performance of decommissioning dn with many e…

9e3fc40

…c blocks (#4398)

HADOOP-18289. Remove WhiteBox in hadoop-kms module. (#4433)

7bfff63

Co-authored-by: slfan1989 <louj1988@@>

HDFS-16600. Fix deadlock of fine-grain lock for FsDatastImpl of DataN…

4893f00

…ode. (#4367). Contributed by ZanderXu. Reviewed-by: Mingxiang Li <[email protected]> Reviewed-by: Ayush Saxena <[email protected]> Signed-off-by: He Xiaoqiao <[email protected]>

Queue filter in CS UI v1 does not work as expected. Contributed by Ch…

020201c

…engbing Liu.

YARN-11172. Fix TestClientRMTokens#testDelegationToken introduced by H…

80446dc

…DFS-16563. (#4408) Regression caused by HDFS-16563; the hdfs exception text was changed, but because it was a YARN test doing the check, Yetus didn't notice. Contributed by zhengchenyu

YARN-10122. Support signalToContainer API for Federation. (#4421)

62e4476

HADOOP-18288. Total requests and total requests per sec served by RPC…

e38e13b

… servers (#4431) Reviewed-by: Steve Loughran <[email protected]> Signed-off-by: Tao Li <[email protected]>

HDFS-16634. Dynamically adjust slow peer report size on JMX metrics (#…

cb04210

…4448) Signed-off-by: Tao Li <[email protected]>

HDFS-16064. Determine when to invalidate corrupt replicas based on nu…

cfceaeb

…mber of usable replicas (#4410) Co-authored-by: Kevin Wikant <[email protected]> Signed-off-by: Akira Ajisaka <[email protected]>

YARN-9827.Fix Http Response code in GenericExceptionHandler (#4393)

4f425b6

Co-authored-by: Ashutosh Gupta <[email protected]> Reviewed by Akira Ajisaka.

Samrat002 and others added 29 commits June 22, 2022 10:17

HDFS-16616. remove use of org.apache.hadoop.util.Sets (#4400)

e8fd914

Co-Authored by: Samrat Deb

MAPREDUCE-7391. TestLocalDistributedCacheManager failing after HADOOP…

c9ddbd2

…-16202 (#4472) Fixing a mockito-based test which broke when HADOOP-16202 changed the methods being invoked. Contributed by Steve Loughran

YARN-11188. Only files belong to the first file controller are remove…

e6ecc4f

…d even if multiple log aggregation file controllers are configured. Contributed by Szilard Nemeth.

HADOOP-18107 Adding scale test for vectored reads for large file (#4273)

1408dd8

part of HADOOP-18103. Contributed By: Mukund Thakur

HADOOP-18300. Upgrade Gson dependency to version 2.9.0 (#4454)

77d1b19

Reviewed-by: Ayush Saxena <[email protected]> Signed-off-by: Chris Nauroth <[email protected]>

HADOOP-18271.Remove unused Imports in Hadoop Common project (#4392)

dd819f7

Co-authored-by: Ashutosh Gupta <[email protected]>

YARN-11192. TestRouterWebServicesREST failing after YARN-9827. (#4484)…

0af4bb3

…. Contributed by fanshilun. Signed-off-by: Ayush Saxena <[email protected]>

YARN-10320.Replace FSDataInputStream#read with readFully in Log Aggre…

4abb2ba

…gation (#4486) * YARN-10320.Replace FSDataInputStream#read with readFully in Log Aggregation Co-authored-by: Ashutosh Gupta <[email protected]>

YARN-9874.Remove unnecessary LevelDb write call in LeveldbConfigurati…

734b6f1

…onStore#confirmMutation (#4487) Co-authored-by: Ashutosh Gupta <[email protected]>

HDFS-16633. Fixing when Reserved Space For Replicas is not released o…

b7edc6c

…n some cases (#4452) * HDFS-16633.Reserved Space For Replicas is not released on some cases Co-authored-by: Ashutosh Gupta <[email protected]>

HADOOP-18308 - Update to Apache LDAP API 2.0.x (#4477)

25f8bdc

Update the dependencies of the LDAP libraries used for testing: ldap-api.version = 2.0.0 apacheds.version = 2.0.0.AM26 Contributed by Colm O hEigeartaigh.

HADOOP-18306: Warnings should not be shown on cli console when linux …

43112bd

…user not present on client (#4474). Contributed by swamirishi.

YARN-9822.TimelineCollectorWebService#putEntities blocked when ATSV2 …

a177232

…HBase is down (#4492) Co-authored-by: Ashutosh Gupta <[email protected]>

YARN-11195. Adding document to enable numa (#4501)

7eefdf8

Contributed by Samrat Deb.

HDFS-16591. Setup JaasConfiguration in ZKCuratorManager when SASL is …

cf33164

…enabled Fixes #4447 Signed-off-by: Owen O'Malley <[email protected]>

HADOOP-18314. Add some description for PowerShellFencer. (#4505)

321a484

HADOOP-18284. Remove Unnecessary semicolon ';' (#4422). Contributed b…

073b8ea

…y fanshilun.

YARN-11204. Various MapReduce tests fail with NPE in AggregatedLogDel…

2d133a5

…etionService.stopRMClient. Contributed by Szilard Nemeth.

YARN-11202. Optimize ClientRMService.getApplications. Contributed by …

3cad632

…Tamas Domok. Change-Id: I55ddb46fd0e4cdb644747d6d43083215f10861b5

YARN-9403.GET /apps/{appid}/entities/YARN_APPLICATION accesses applic…

151bb31

…ation table instead of entity table (#4516) Co-authored-by: Ashutosh Gupta <[email protected]>

HADOOP-18297. Upgrade dependency-check-maven to 7.1.1 (#4449)

9e206fe

Co-authored-by: Ashutosh Gupta <[email protected]>

HDFS-16647. Delete unused NameNode#FS_HDFS_IMPL_KEY. (#4525). Contrib…

a333674

…uted by JiangHua Zhu. Signed-off-by: Ayush Saxena <[email protected]>

GuoPhilipse merged commit b5b8482 into GuoPhilipse:trunk Jul 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sync #16

sync #16

Uh oh!

GuoPhilipse commented Jul 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

sync #16

sync #16

Uh oh!

Conversation

GuoPhilipse commented Jul 3, 2022

Description of PR

How was this patch tested?

For code changes:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants