Thorough rewrite and optimization of integration tests #82

hemidactylus · 2024-09-26T15:09:50Z

This PR completely restructures the integration tests for this project.

This was a much-needed improvement: the test suite had grown in a scarcely-controlled way, becoming bloated, very long-lasting and unstable (mostly due to the unnecessarily high number of collections being created and dropped during testing). Additionally, some of the tests were using obsolete constructs and duplicated assets.

The gist

This PR implements an extremely parsimonious usage of collections thanks to a hierarchy of fixtures (and ample usage of setup_mode). Whenever possible, collections are simply truncated and re-used between tests. Some collections, which remain unavoidably per-test-function, are marked as such ("ephemeral..."). This is the case mostly for "DDL-related" tests, for instance when testing deprecations and errors due to indexing mismatches and such. For the case of vector-store tests, these are grouped in a separate module to skip them easily while developing.

Numbers

The general idea is that there should never be more than 3 collections in the database, at any given time. (nevertheless, the collection names involved are way more, to avoid running into metadata-cache issues when dropping and re-creating them.)

Before this PR, the whole suite took about 40+ minutes to run (... and often fail). Now it takes 13 minutes. Specifically, the vector store tests alone went from 19 minutes down to about 7.

Additional notes

The setup_mode is tested scarcely in itself (perhaps the only "loss of coverage" of this PR); however, it is implicitly tested essentially in all collection-related fixtures.
Care has been taken to ensure that the test suite can run smoothly on DSE/HCD (with the obvious exception of the "core clients warning" tests which are skipped in that case).
The Graph Vector Store tests, in the autodetect (flat-document!) case, are made to use specific collections, to ensure that the metadata keys encoding the graph structure are indexed. This is a corner case to keep in mind also outside of testing: in actual usage, when promoting an "autodetect(flat) vector store" to a graph store, it is responsibility of the user to ensure that the collection's indexing settings allow for such promotion.
the test module for the two cache objects (regular and semantic) have been split in two modules.

cbornet · 2024-09-27T08:21:00Z

libs/astradb/tests/integration_tests/conftest.py

+
+@pytest.fixture
+def vector_store_d2(
+    empty_collection_d2: Collection,  # noqa: ARG001


NIT: Is it possible to use @pytest.mark.usefixtures("empty_collection_d2") and get rid of the noqa ?

Good catch - filing for a later improvement since it's merged now (saw the comment only now, sorry)

I merged since it was NIT.
I'm doing a PR rn.

Stefano Lottini added 16 commits September 24, 2024 00:37

complete removal of SomeEmbeddings for tests

88fab28

test_vectorstore_autodetect brought to rationality

ce1bb9e

test_graphvectorstore is nice now

00db524

halfway through test_vectorstore.py

6af1232

tests of from_ methods of vectorstore are now good

2cae6fc

most test_vectorstore done; missing only indexing and coreclients_init

0a4dc15

all of graph/vectorstores brought to order

767cce7

graph/vstore tests mostly hcd-compatible (wip)

5f445ae

wip on fixing the hcd/apikey header thing

153583e

completed rewrite of tests for graph/vectorstores

f686979

chat message histories tested nicely

959d03e

deep restructuring of the caches testing

69e9f68

test_document_loaders under control

21a5fae

further improvement test document loader

b3a6682

with test_storage it seems everything is done now.

8d00948

tiny docstr edit

a201567

hemidactylus requested review from cbornet and kerinin September 26, 2024 15:09

Stefano Lottini added 2 commits September 26, 2024 17:24

make openai key into a fixture to heal compile test

12c2c9c

clean info on IT prereqs

e789540

hemidactylus mentioned this pull request Sep 26, 2024

astradb[minor]: update dependencies for compatibility with langchain-core 0.3 #71

Merged

cbornet reviewed Sep 27, 2024

View reviewed changes

cbornet self-requested a review September 27, 2024 08:22

cbornet approved these changes Sep 27, 2024

View reviewed changes

cbornet merged commit 3eb73d7 into main Sep 27, 2024
13 checks passed

cbornet deleted the SL-optimize-ci branch September 27, 2024 08:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Thorough rewrite and optimization of integration tests #82

Thorough rewrite and optimization of integration tests #82

Uh oh!

hemidactylus commented Sep 26, 2024

Uh oh!

cbornet Sep 27, 2024 •

edited

Loading

Uh oh!

hemidactylus Sep 30, 2024

Uh oh!

cbornet Sep 30, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Thorough rewrite and optimization of integration tests #82

Thorough rewrite and optimization of integration tests #82

Uh oh!

Conversation

hemidactylus commented Sep 26, 2024

The gist

Numbers

Additional notes

Uh oh!

cbornet Sep 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hemidactylus Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

cbornet Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cbornet Sep 27, 2024 •

edited

Loading