feat: implement Approximate Nearest Neighbor support for DDL (CREATE TABLE, CREATE VECTOR INDEX)#124
Conversation
dfeab0a to
2ef461a
Compare
36c552c to
64358ab
Compare
7e5279a to
44d0996
Compare
|
/gcbrun |
gauravpurohit06
left a comment
There was a problem hiding this comment.
Added two more comments, based on it... you also need to update function signature throughout the file not mentioning it explicitly.
a791690 to
8ea75e1
Compare
This change introduces new nox directives: * blacken: `nox -s blacken` * format: `nox -s format` to apply formatting to files * lint: `nox -s lint` to flag linting issues * unit: to run unit tests locally which are the basis to enable scalable development and continuous testing as I prepare to bring in Approximate Nearest Neighors (ANN) functionality into this package. Also while here, fixed a typo in the README.rst file that didn't have the correct import path.
This change adds ANN distance strategies for GoogleSQL semantics. While here started unit tests to effectively test out components without having to have a running Cloud Spanner instance. Updates googleapis#94
bc0b254 to
d654428
Compare
d654428 to
430d14d
Compare
0f2cd8e to
8be267d
Compare
b8948a3 to
66930b4
Compare
gauravpurohit06
left a comment
There was a problem hiding this comment.
@odeke-em, Other than granular comments there are inconsistencies in return type here through the implementation and also use of None keyword. Please correct it.
|
/gcbrun |
90220f1 to
5b8cfd3
Compare
5b8cfd3 to
130bc46
Compare
gauravpurohit06
left a comment
There was a problem hiding this comment.
in public apis... decide whether to use knn vs ann... Mention it explicitly what to do.
|
@gauravpurohit06 I humbly and highly recommend that we get this PR in as is, then I can send more given firstly how massive it is, I actually had to delete code out of other helpers to keep in manageable, plus your time constraint. Getting 90% in before you are out of the office goes a much longer way than our 5+ week long PR. Sending smaller PRs for the other parts is planned as I mentioned a couple of times offline and allows for much more effective testing too. |
|
/gcbrun |
|
@gauravpurohit06 kindly help me run gcbrun. |
|
/gcbrun |
This change adds ANN distance strategies for GoogleSQL semantics.
While here started unit tests to effectively test out components
without having to have a running Cloud Spanner instance.
Implements Data Definition Language (DDL) functionality for:
Updates #94