fixed db seed error by dwnoble · Pull Request #2014 · datacommonsorg/data

dwnoble · 2026-05-16T00:11:24Z

This pull request updates the database seeding logic and its corresponding test to add two new fields, name and value, to the seeded Node table entries. The changes ensure that these fields are populated with appropriate values during the seeding process and that the tests verify their correctness.

gemini-code-assist

Code Review

This pull request updates the seed_database function in spanner_client.py to include name and value columns when inserting seed nodes into the Spanner Node table. Corresponding unit tests in spanner_client_test.py were updated to verify these new fields. The review feedback suggests refactoring the candidates dictionary using a comprehension to reduce string redundancy and improving the test assertions by using a loop for better readability and efficiency.

gemini-code-assist · 2026-05-16T00:24:44Z

            candidates = {
-                "StatisticalVariable": ["StatisticalVariable", ["Class"], spanner.COMMIT_TIMESTAMP],
-                "StatVarGroup": ["StatVarGroup", ["Class"], spanner.COMMIT_TIMESTAMP],
-                "StatVarObservation": ["StatVarObservation", ["Class"], spanner.COMMIT_TIMESTAMP],
-                "Topic": ["Topic", ["Class"], spanner.COMMIT_TIMESTAMP],
-                "c/g/Root": ["c/g/Root", ["StatVarGroup"], spanner.COMMIT_TIMESTAMP],
+                "StatisticalVariable": ["StatisticalVariable", "StatisticalVariable", "StatisticalVariable", ["Class"], spanner.COMMIT_TIMESTAMP],
+                "StatVarGroup": ["StatVarGroup", "StatVarGroup", "StatVarGroup", ["Class"], spanner.COMMIT_TIMESTAMP],
+                "StatVarObservation": ["StatVarObservation", "StatVarObservation", "StatVarObservation", ["Class"], spanner.COMMIT_TIMESTAMP],
+                "Topic": ["Topic", "Topic", "Topic", ["Class"], spanner.COMMIT_TIMESTAMP],
+                "c/g/Root": ["c/g/Root", "c/g/Root", "c/g/Root", ["StatVarGroup"], spanner.COMMIT_TIMESTAMP],
            }


The candidates dictionary contains a lot of repeated strings. To improve maintainability and reduce redundancy, you could generate this dictionary programmatically. This would make it easier to add or modify fields in the future.

By defining the unique parts (like the types) separately and then building the full dictionary with a comprehension, the code becomes cleaner and less error-prone for future modifications.

Suggested change

candidates = {

"StatisticalVariable": ["StatisticalVariable", ["Class"], spanner.COMMIT_TIMESTAMP],

"StatVarGroup": ["StatVarGroup", ["Class"], spanner.COMMIT_TIMESTAMP],

"StatVarObservation": ["StatVarObservation", ["Class"], spanner.COMMIT_TIMESTAMP],

"Topic": ["Topic", ["Class"], spanner.COMMIT_TIMESTAMP],

"c/g/Root": ["c/g/Root", ["StatVarGroup"], spanner.COMMIT_TIMESTAMP],

"StatisticalVariable": ["StatisticalVariable", "StatisticalVariable", "StatisticalVariable", ["Class"], spanner.COMMIT_TIMESTAMP],

"StatVarGroup": ["StatVarGroup", "StatVarGroup", "StatVarGroup", ["Class"], spanner.COMMIT_TIMESTAMP],

"StatVarObservation": ["StatVarObservation", "StatVarObservation", "StatVarObservation", ["Class"], spanner.COMMIT_TIMESTAMP],

"Topic": ["Topic", "Topic", "Topic", ["Class"], spanner.COMMIT_TIMESTAMP],

"c/g/Root": ["c/g/Root", "c/g/Root", "c/g/Root", ["StatVarGroup"], spanner.COMMIT_TIMESTAMP],

}

base_nodes = {

"StatisticalVariable": ["Class"],

"StatVarGroup": ["Class"],

"StatVarObservation": ["Class"],

"Topic": ["Class"],

"c/g/Root": ["StatVarGroup"],

}

candidates = {

subj: [subj, subj, subj, types, spanner.COMMIT_TIMESTAMP]

for subj, types in base_nodes.items()

}

gemini-code-assist · 2026-05-16T00:24:44Z

        actual_subjects = [val[0] for val in kwargs['values']]
+        actual_names = [val[1] for val in kwargs['values']]
+        actual_values = [val[2] for val in kwargs['values']]
        self.assertEqual(actual_subjects, expected_subjects)
+        self.assertEqual(actual_names, expected_subjects)
+        self.assertEqual(actual_values, expected_subjects)


These assertions can be made more concise and robust by iterating through the values and checking the properties for each row within a loop. This makes the test easier to read and extend if more fields need to be checked against the subject_id in the future. It also avoids iterating over the list of values multiple times.

Suggested change

actual_subjects = [val[0] for val in kwargs['values']]

actual_names = [val[1] for val in kwargs['values']]

actual_values = [val[2] for val in kwargs['values']]

self.assertEqual(actual_subjects, expected_subjects)

self.assertEqual(actual_names, expected_subjects)

self.assertEqual(actual_values, expected_subjects)

actual_subjects = []

for val in kwargs['values']:

subject_id, name, value, _, _ = val

self.assertEqual(name, subject_id)

self.assertEqual(value, subject_id)

actual_subjects.append(subject_id)

self.assertEqual(actual_subjects, expected_subjects)

fixed db seed error

bae47f4

dwnoble requested a review from clincoln8 May 16, 2026 00:11

dwnoble enabled auto-merge (squash) May 16, 2026 00:11

clincoln8 approved these changes May 16, 2026

View reviewed changes

dwnoble merged commit 547de2d into datacommonsorg:master May 16, 2026
9 checks passed

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed db seed error#2014

fixed db seed error#2014
dwnoble merged 1 commit into
datacommonsorg:masterfrom
dwnoble:seed-fixes

dwnoble commented May 16, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 16, 2026

Uh oh!

gemini-code-assist Bot May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dwnoble commented May 16, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants