-
Notifications
You must be signed in to change notification settings - Fork 25.7k
Setting elastic password from stored hash #77036
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
In package installations, we will be generating the password of the elastic user on installation and we will be storing the hash of it to the elasticsearch.keystore. This change will - attempt to create the security index and elastic user document at the time of the first successful authentication using the the "promised" password - if bootstrap.password setting is detected, won’t attempt to use autoconfiguration.password_hash setting and proceed with the normal authentication path - if elastic user document exists, won't attempt to use autoconfiguration.password_hash setting and proceed with the normal authentication path - if unable to create the Security Index for any reason, returns HTTP_SERVER_ERROR to indicate that the cluster is probably not ready yet Resolves: elastic#75704
|
Pinging @elastic/es-security (Team:Security) |
|
@elasticmachine update branch |
jkakavas
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks Lyudmila, I think this does what we've set out to do in the design doc. I think it's ++ to polish and open as a proper PR when you feel ready for it. We might be able to wire things differently in the ReservedRealm class but I don't have a concrete suggestion now, nor specific concerns around your approach. Let's see what @albertzaharovits thinks also
| /** | ||
| * Asynchronous method to create a security index if necessary and a reserved user if one doesn't exist | ||
| */ | ||
| public void createReservedUserAndGetUserInfo(String username, char[] passwordHash, RefreshPolicy refresh, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we split this in two methods and pass the listener from the first to the second ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is refresh policy a parameter if we always set it to IMMEDIATE?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This method overall is poorly designed.
There is a very similar private method above. This one here chains that one with another one, for no apparent reason, to me, at least. Afterwards it is going to be used in a very particular circumstance (creating the automatically configured elastic user), but its interface is designed to be used in general, which goes against private qualifier of the method above.
Please follow my suggestion from https://github.com/elastic/elasticsearch/pull/77036/files#r701072274
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the suggestion @albertzaharovits! I tried this approach, I think the main problem with it is that if an elastic user document exists, we will fail instead of proceeding to validate against existing document information.
There are other differences justifying having it as a separate method: we need special exception handling to not return an UNAUTHORIZED on index creation failure, we don't need to clearRealmCache since the document wasn't updated.
I have refactored the code though.
| if ((bootstrapPasswordExists == false && autoconfigHashExists) && username.equals(ElasticUser.NAME) | ||
| && bootstrapUserInfo.verifyPassword(credentials)) { | ||
| nativeUsersStore.createReservedUserAndGetUserInfo(username, bootstrapUserInfo.passwordHash, | ||
| WriteRequest.RefreshPolicy.IMMEDIATE, listener); | ||
| } else { | ||
| listener.onResponse(getDefaultUserInfo(username)); | ||
| } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we can extract this in a method instead of duplicating the code here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
++ But we don't need a new method even, we can move this logic in the caller:
@@ -283,6 +284,11 @@ public class NativeUsersStore {
});
}
+ void storeAutoConfigElasticUser(ActionListener<User> listener) {
+ createReservedUser(ElasticUser.NAME, autoConfigUserInfo.hash, RefreshPolicy.IMMEDIATE,
+ listener.map(aVoid -> new ElasticUser(true)));
+ }
+
/**
* Asynchronous method to put a user. A put user request without a password hash is treated as an update and will fail with a
* {@link ValidationException} if the user does not exist. If a password hash is provided, then we issue a update request with an
diff --git a/x-pack/plugin/security/src/main/java/org/elasticsearch/xpack/security/authc/esnative/ReservedRealm.java b/x-pack/plugin/security/src/main/java/org/elasticsearch/xpack/security/authc/esnative/ReservedRealm.java
index 2dea4b8b1f2..65382de98ca 100644
--- a/x-pack/plugin/security/src/main/java/org/elasticsearch/xpack/security/authc/esnative/ReservedRealm.java
+++ b/x-pack/plugin/security/src/main/java/org/elasticsearch/xpack/security/authc/esnative/ReservedRealm.java
@@ -102,6 +102,10 @@ public class ReservedRealm extends CachingUsernamePasswordRealm {
if (userInfo.hasEmptyPassword()) {
result = AuthenticationResult.terminate("failed to authenticate user [" + token.principal() + "]", null);
} else if (userInfo.verifyPassword(token.credentials())) {
+ if (userInfo == autoconfigUserInfo)) {
+ nativeUsersStore.storeAutoConfigElasticUser(listener.map(user -> AuthenticationResult.success(user)));
+ return;
+ }
final User user = getUser(token.principal(), userInfo);
logDeprecatedUser(user);
result = AuthenticationResult.success(user);
@@ -242,7 +246,12 @@ public class ReservedRealm extends CachingUsernamePasswordRealm {
private ReservedUserInfo getDefaultUserInfo(String username) {
if (ElasticUser.NAME.equals(username)) {
- return bootstrapUserInfo.deepClone();
+ if (autoconfigHashExists && false == bootstrapPasswordExists) {
+ assert autoconfigUserInfo != null;
+ return autoconfigUserInfo;
+ } else {
+ return bootstrapUserInfo.deepClone();
+ }
} else {
return ReservedUserInfo.defaultEnabledUserInfo();
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See ^^
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the main problem with leaving this to the caller of getUserInfo is that it has no way of knowing if the ReservedUserInfo it got is from the keystore or the index (i.e. differentiate between the 1st authentication with that password and subsequent ones )
What if we move the call to just re-read about the need to return a custom exceptionnativeUsersStore.storeAutoConfigElasticUser here ?
| final char[] hash = (bootstrapPasswordExists || autoconfigHashExists == false) ? | ||
| (BOOTSTRAP_ELASTIC_PASSWORD.get(settings).length() == 0 ? | ||
| new char[0] : reservedRealmHasher.hash(BOOTSTRAP_ELASTIC_PASSWORD.get(settings))) : | ||
| AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.get(settings).getChars(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This complication here is unnecessary, because when you are going to use the bootstrapInfo, you are still going to duplicate the checks. Better to have two separate variables.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have refactored it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The only way for BOOTSTRAP_ELASTIC_PASSWORD.get(settings).length() == 0 to be true is if both bootstrap.password and keystore.seed are missing, which is not a proper state.
Can't we simplify this to just :
if (BOOTSTRAP_ELASTIC_PASSWORD.exists(settings)) {
hash = reservedRealmHasher.hash(BOOTSTRAP_ELASTIC_PASSWORD.get(settings);
} else {
hash = AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.get(settings).getChars();
}?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Still, I think Albert's comment was about the fact that you try to use only bootstrapUserInfo here in both cases but this is unncessary complexity since below when you actually going to check against it, you will run the same checks about bootstrap.password and autoconfiguration.password_hash again.
You could have an autoconfiguredElasticUserInfo ReservedUserInfovariable here that gets populated with AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.get(settings).getChars() and below in getUserInfo you can call verifyPassword on that when needed. Or maybe not even that and use a local ReservedUserInfo variable just there if you need it?
| Fields.ENABLED.getPreferredName(), | ||
| true, Fields.TYPE.getPreferredName(), RESERVED_USER_TYPE) | ||
| .setRefreshPolicy(refresh).request(), | ||
| listener.<IndexResponse>delegateFailure((l, indexResponse) -> getReservedUserInfo(username, l)), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The get after a successful create is unnecessary.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See ^^
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Apologies for jumping in but I had a similar comment when doing my review and I can't find where above (^^) this comment was addressed/explained
albertzaharovits
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The logic looks OK, the placement of it definitely needs a more careful consideration (eg we shouldn't do a store in a get).
I've left a rough code suggestion in https://github.com/elastic/elasticsearch/pull/77036/files#r701072274
|
@elasticmachine update branch |
|
@elasticmachine update branch |
| reservedRealmHasher.hash(BOOTSTRAP_ELASTIC_PASSWORD.get(settings)); | ||
| autoconfigHashExists = AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.exists(settings); | ||
| bootstrapPasswordExists = BOOTSTRAP_ELASTIC_PASSWORD.exists(settings); | ||
| final char[] hash = (bootstrapPasswordExists || autoconfigHashExists == false) ? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I find nested ternary operators really hard to read, can we write this in a clearer way please?
| final char[] hash = (bootstrapPasswordExists || autoconfigHashExists == false) ? | ||
| (BOOTSTRAP_ELASTIC_PASSWORD.get(settings).length() == 0 ? | ||
| new char[0] : reservedRealmHasher.hash(BOOTSTRAP_ELASTIC_PASSWORD.get(settings))) : | ||
| AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.get(settings).getChars(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The only way for BOOTSTRAP_ELASTIC_PASSWORD.get(settings).length() == 0 to be true is if both bootstrap.password and keystore.seed are missing, which is not a proper state.
Can't we simplify this to just :
if (BOOTSTRAP_ELASTIC_PASSWORD.exists(settings)) {
hash = reservedRealmHasher.hash(BOOTSTRAP_ELASTIC_PASSWORD.get(settings);
} else {
hash = AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.get(settings).getChars();
}?
| final char[] hash = (bootstrapPasswordExists || autoconfigHashExists == false) ? | ||
| (BOOTSTRAP_ELASTIC_PASSWORD.get(settings).length() == 0 ? | ||
| new char[0] : reservedRealmHasher.hash(BOOTSTRAP_ELASTIC_PASSWORD.get(settings))) : | ||
| AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.get(settings).getChars(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Still, I think Albert's comment was about the fact that you try to use only bootstrapUserInfo here in both cases but this is unncessary complexity since below when you actually going to check against it, you will run the same checks about bootstrap.password and autoconfiguration.password_hash again.
You could have an autoconfiguredElasticUserInfo ReservedUserInfovariable here that gets populated with AUTOCONFIG_BOOTSTRAP_ELASTIC_PASSWORD_HASH.get(settings).getChars() and below in getUserInfo you can call verifyPassword on that when needed. Or maybe not even that and use a local ReservedUserInfo variable just there if you need it?
| if ((bootstrapPasswordExists == false && autoconfigHashExists) && username.equals(ElasticUser.NAME) | ||
| && bootstrapUserInfo.verifyPassword(credentials)) { | ||
| nativeUsersStore.createReservedUserAndGetUserInfo(username, bootstrapUserInfo.passwordHash, | ||
| WriteRequest.RefreshPolicy.IMMEDIATE, listener); | ||
| } else { | ||
| listener.onResponse(getDefaultUserInfo(username)); | ||
| } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the main problem with leaving this to the caller of getUserInfo is that it has no way of knowing if the ReservedUserInfo it got is from the keystore or the index (i.e. differentiate between the 1st authentication with that password and subsequent ones )
What if we move the call to just re-read about the need to return a custom exceptionnativeUsersStore.storeAutoConfigElasticUser here ?
...in/java/org/elasticsearch/xpack/core/security/authc/DefaultAuthenticationFailureHandler.java
Show resolved
Hide resolved
| Fields.ENABLED.getPreferredName(), | ||
| true, Fields.TYPE.getPreferredName(), RESERVED_USER_TYPE) | ||
| .setRefreshPolicy(refresh).request(), | ||
| listener.<IndexResponse>delegateFailure((l, indexResponse) -> getReservedUserInfo(username, l)), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Apologies for jumping in but I had a similar comment when doing my review and I can't find where above (^^) this comment was addressed/explained
| } | ||
| } | ||
| assertTrue(securityIndexCreated); | ||
| securityIndexExistsAuthenticate(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why are we doing this call to another method that is actually the same method as we are in right now ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just validating that now when the index exists we can authenticate (not sure it is necessary...)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd simply do
ClusterHealthResponse response = client()
.filterWithHeader(singletonMap("Authorization", basicAuthHeaderValue(ElasticUser.NAME,
new SecureString("password1".toCharArray()))))
.admin()
.cluster()
.prepareHealth().setWaitForEvents(Priority.LANGUID).setWaitForGreenStatus()
.get();
once more here
| boolean securityIndexNotCreated = true; | ||
| for (Map.Entry<String, ClusterIndexHealth> indexEntry: response.getIndices().entrySet()) { | ||
| if (indexEntry.getKey().startsWith(".security")) { | ||
| securityIndexNotCreated = false; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I find this confusing ( the double negation ) , I think what you do above in the previous test is easier to read
| builder.put(LicenseService.SELF_GENERATED_LICENSE_TYPE.getKey(), "trial"); | ||
| builder.put("transport.type", "security4"); | ||
| builder.put("path.home", customSecuritySettingsSource.nodePath(0)); | ||
| builder.setSecureSettings(new MockSecureSettings()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: I'd create the MockSecureSettings first , then call setString, then call builder.setSecureSettings
| } | ||
| ((MockSecureSettings) builder.getSecureSettings()).setString("bootstrap.password", BOOTSTRAP_PASSWORD.toString()); | ||
| // Verify `autoconfiguration.password_hash` is not used when `bootstrap.password` set | ||
| ((MockSecureSettings) builder.getSecureSettings()).setString("autoconfiguration.password_hash", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If the intention here is to see that all other tests that implement SecurityIntegTestCase don't fail , now that you added autoconfiguration.passowrd_hash then I don'think this is the way to do this.
Instead of polluting the secure settings of any node in any test and check implicitly that nothing breaks, add a test case in the newly introduced integ test that ensures that autoconfiguration.password_hash is not used when bootstrap.password is set
| @Override | ||
| public Settings nodeSettings() { | ||
| Settings customSettings = customSecuritySettingsSource.nodeSettings(0, Settings.EMPTY); | ||
| Settings.Builder builder = Settings.builder().put(customSettings, false); // handle secure settings separately |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"handle secure settings separately" == don't bring in bootstrap.password
|
@elasticmachine update branch |
|
@elasticmachine update branch |
| import static org.hamcrest.Matchers.notNullValue; | ||
|
|
||
| public class PasswordHashAndBootstrpPwdIntegTests extends SecuritySingleNodeTestCase { | ||
| public class PasswordHashAndBootstrapPwdInterTests extends SecuritySingleNodeTestCase { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PasswordHashAndBootstrapPasswordIntegTests
| } | ||
| } | ||
|
|
||
| private void getAutoconfiguredUser(final String username, SecureString credentials, ActionListener<ReservedUserInfo> listener) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Naming is hard.. does this get the/an auto configured user ? Don't have a good suggestion
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You are right, not necessary autoconfigured. May be getAutoconfiguredOrDefaultUser
...st/java/org/elasticsearch/xpack/security/authc/esnative/PasswordHashPromotionIntegTests.java
Show resolved
Hide resolved
jkakavas
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM , I left some comments for consideration but won't block. Feel free to merge this once CI is green
| if (docType.equals(RESERVED_USER_TYPE)) { | ||
| createReservedUser(username, request.passwordHash(), request.getRefreshPolicy(), listener); | ||
| createOrUpdateReservedUser(username, request.passwordHash(), request.getRefreshPolicy(), | ||
| UPDATE, listener::onFailure, listener); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needs to be INDEX, not UPDATE
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just figured it out!
jkakavas
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
https://github.com/elastic/elasticsearch/pull/77036/files#r703651072 needs to be addressed before merging
`createOrUpdateReservedUser` to INDEX. More cleanup and renaming
|
@elasticmachine run elasticsearch-ci/part-2 |
2 similar comments
|
@elasticmachine run elasticsearch-ci/part-2 |
|
@elasticmachine run elasticsearch-ci/part-2 |
|
@elasticmachine run elasticsearch-ci/part-2-fips |
password won't work when both bootstrap.password and keystore.seed are missing
jkakavas
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, lets hold off merging for now as we discussed
| char[] passwordHash, | ||
| DocWriteRequest.OpType opType, | ||
| RefreshPolicy refresh, | ||
| Consumer<Exception> consumer, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need this ? We only ever pass the onFailure of the ActionListener we also pass as a parameter, so we can just simply leave this as it was ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Happy to push this change if needed, no worries
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please do! I merged the conflict, but there are some failures I don't have time to look at (my flight is in 2hr ))).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need this ? We only ever pass the onFailure of the ActionListener we also pass as a parameter, so we can just simply leave this as it was ?
I use it for the special error handling to return INTERNAL_SERVER_ERROR, look at storeAutoconfiguredElasticUser
|
@albertzaharovits we didn't have to merge this before you return so feel free to take another look when you're back and I can make all necessary changes and amendments while Lyudmila is away |
| listener.onFailure(new ElasticsearchStatusException(e.getMessage(), INTERNAL_SERVER_ERROR, e.getCause())); | ||
| } | ||
| }, listener.delegateFailure((l, v) -> l.onResponse(elasticUserInfo.deepClone()))); | ||
| } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The logic here is wrong, I did some testing.
Firstly, the branching on the instance type should be on the listener::onFailure argument, not on the consumer. But even then, intercepting VersionConflictEngineException exceptions in order to return 401 instead of 500 is not correct. It is possible that multiple concurrent requests on the same node try to set the auto configured elastic user password simultaneously, and only one will succeed, but the others will be returned with 401 which is wrong (should signal authn success).
This is tricky I need to think some more for the proper fix.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
From #77036 (comment) , I think we should return 503 (slightly more suggestive than 500, I think) when the promised password cannot be persisted. This is what we set about to do, though back then I haven't considered that any other failures return 401.
Also, in particular, returning 401 because the elastic user becomes present in the .security index in the meantime, is technically wrong because it assumes the update must be with a different password, which might be true in practice, but only because of reserved realm caching, which means that the update comes from a different node, or the user, which probably use a different password to set for elastic, but not necessarily.
| } else if (t instanceof ElasticsearchStatusException && ((ElasticsearchStatusException) t).status() == INTERNAL_SERVER_ERROR) { | ||
| ese = internalServerError(message, t, args); | ||
| containsNegotiateWithToken = false; | ||
| } else { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seeing this code, I'm having second thoughts about the plan here.
I didn't realize we return 401 as a catch all error case.
From this perspective, it makes little sense to return 401 if, eg, the index cannot be created, but return 500 if the elastic password had already been changed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
From this perspective, it makes little sense to return 401 if, eg, the index cannot be created, but return 500 if the elastic password had already been changed.
It's the other way round, right ?
We can simplify this very much by just returning 401 in all of our cases too. What we were trying to capture was a difference between:
- "we checked, this is the wrong password - not the one in the keystore" (401)
- "we checked, this is the wrong password - it's the one in the local keystore but something else has set it on the index" (401)
- "we checked, this is the correct password but we can't persist it and some other node might set it to something else later, so next time you call it might be a 401 so it doesn't make sense to return 200" (500)
The contract to the user is that
- if they successfully authenticate once then they can successfully authenticate with these credentials until something (not ES internal) changes the password.
- If they get a 401 then the password is wrong and they should not try it again.
but maybe we're thinking this too much from the user's perspective? There is no mandate that a 401 is permanent or that the user should not retry the request etc. We were trying to use this to signal our logic.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can simplify this very much by just returning 401 in all of our cases too. What we were trying to capture was a difference between:
- "we checked, this is the wrong password - not the one in the keystore" (401)
- "we checked, this is the wrong password - it's the one in the local keystore but something else has set it on the index" (401)
- "we checked, this is the correct password but we can't persist it and some other node might set it to something else later, so next time you call it might be a 401 so it doesn't make sense to return 200" (500)
Thanks for writing this down!
I think this captures the intent when we set about to code this.
On point 2) above. That something else that already set the password could be an identical process racing to set the password after a successful validation of the hash in the keystore. The racing process could be on the same node, writing the same password hash in the index, or on a different node, potentially writing a different one, or it can even be a password change by the user through the API or the cmd line tool. In this general case (without knowing who set the elastic user in the index in the meantime) returning 401 or 200 are both wrong (the only two error codes we can currently return for authn failures). Authentication has to be retried, given the new state where there is an elastic user in the .security index (which would make authn validate that instead of the keystore value).
We either retry it internally or we signal it to the client to retry it themself.
Point 3) sounds right to me, apart that maybe we should return 503. But I missed that there are a bunch of other error conditions that should probably be 503 and are now 401, eg shards unavailable for the .security index (though those are "read" ops and we're going to return 503 for "write" ones).
Overall, after this conversation, I still think returning 503 if the "promised" password cannot be persisted (because the index or document cannot be created, or because they have been created in the meantime by something else) is the best course of action, because this "write" op during authentication is the only exception (which is not the approach this PR implements, FWIW).
I can also concede to returning 401 as well, given that the other "reads" that fail for system reasons also return 401.
|
|
||
| private void getAutoconfiguredOrDefaultUser(final String username, SecureString credentials, | ||
| ActionListener<ReservedUserInfo> listener) { | ||
| if (autoconfigured && username.equals(ElasticUser.NAME) && bootstrapUserInfo.verifyPassword(credentials)) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This condition here makes it so that the auto configured elastic password is actually checked twice (second time in ReservedRealm#doAuthenticate) when wrong.
Not a big issue in practice, but it shows that the control flow is not polished.
|
|
||
|
|
||
| private void getUserInfo(final String username, ActionListener<ReservedUserInfo> listener) { | ||
| private void getUserInfo(final String username, @Nullable SecureString credentials, ActionListener<ReservedUserInfo> listener) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I still don't like that a "get" does a "set", and that it takes a credentials parameter for it, which can be null but then the callees can't deal with a null parameter: this code will fail the lookup of the elastic user if it hasn't been created yet.
|
Closed in favor of #78306 . |
In package installations, we will be generating the password of
the elastic user on installation and we will be storing the hash
of it to the elasticsearch.keystore.
This change will
time of the first successful authentication using the
the "promised" password
autoconfiguration.password_hash setting and proceed with the normal
authentication path
autoconfiguration.password_hash setting and proceed with the normal
authentication path
HTTP_SERVER_ERROR to indicate that the cluster is probably not ready yet
Resolves: #75704