Cog 414 fix postgres database deletion #163

dexters1 · 2024-10-23T21:20:42Z

Summary by CodeRabbit

New Features
- Introduced a method to retrieve a list of all schema names in the database for PostgreSQL.
- Added functionality to delete records from a specified table by ID.
- Implemented a method to dynamically load a table using its name and schema.
Bug Fixes
- Improved the logic for dropping tables in the database deletion process, ensuring accurate schema handling and preventing stale information retention.
Chores
- Enhanced error handling during the database deletion process for better logging and tracking of issues.
- Updated method signatures to support optional schema parameters for better flexibility in table deletion.

Added deletion of all tables in postgres database, but this fix causes an issue regrading creation of duplicate tables on next run Fix #COG-414

Resolve issue with database deletion by cleaning Metadata after every schema Fix #COG-414

Moved cleaning of MetaData to be after all tables have been dropped Fix #COG-414

Removed unnecessary print statement Refactor #COG-414

coderabbitai · 2024-10-23T21:20:49Z

Walkthrough

The changes in the SqlAlchemyAdapter class enhance the functionality of several methods. The delete_table method now accepts an optional schema_name parameter, defaulting to "public", and has been adjusted for SQLite compatibility. A new method, get_schema_list, retrieves schema names for PostgreSQL. Additionally, the delete_data_by_id method is introduced for record deletion based on ID, and get_table dynamically loads tables. The delete_database method has been restructured to utilize the new schema retrieval method and manage table deletion more effectively.

Changes

File Path	Change Summary
`cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py`	Added import for `List`, updated `delete_table` with `schema_name`, added `get_schema_list`, `delete_data_by_id`, and `get_table` methods. Restructured `delete_database` to use `get_schema_list` and manage schema-based table deletion. Error handling remains consistent.

Poem

In the database realm where tables reside,
A clever adapter now takes a stride.
With schemas in sight, it clears out the old,
Reflecting and dropping, so brave and so bold.
Hooray for the changes, let data flow free,
A hop and a skip, oh what joy it will be! 🐇✨

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 1088b58 and c67639f.

📒 Files selected for processing (1)

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py (2 hunks)

🧰 Additional context used

🔇 Additional comments (2)

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py (2)
135-136: Ensure proper handling of schema and table names to prevent SQL injection

When constructing SQL queries with schema and table names, it's important to ensure that these identifiers are properly quoted to prevent SQL injection vulnerabilities, even if the values are retrieved from the database.

[security]

Apply this diff to use parameterized queries:
-                            drop_table_query = text(f"DROP TABLE IF EXISTS {schema[0]}.{table.name} CASCADE")
-                            await connection.execute(drop_table_query)
+                            drop_table_query = text("DROP TABLE IF EXISTS :schema.:table_name CASCADE")
+                            await connection.execute(
+                                drop_table_query.bindparams(
+                                    schema=text(schema[0]),
+                                    table_name=text(table.name)
+                                )
+                            )
Alternatively, if you choose to drop entire schemas as suggested earlier, ensure that the schema names are properly quoted:
-                        drop_schema_query = text(f'DROP SCHEMA IF EXISTS "{schema[0]}" CASCADE')
+                        drop_schema_query = text("DROP SCHEMA IF EXISTS :schema_name CASCADE")
+                        await connection.execute(
+                            drop_schema_query.bindparams(schema_name=text(schema[0]))
+                        )
122-137: Verify that all user-defined schemas are correctly identified and excluded as needed

The query used to retrieve schema names excludes certain system schemas. Ensure that any additional schemas that should not be dropped (e.g., extensions or specific schemas required for your application) are also excluded.

Run the following script to list all schemas that will be dropped:

This script searches for the SQL query in your codebase and displays the schemas that are being selected for dropping. Review the list to ensure it's accurate.

✅ Verification successful

Let me try a different approach to verify this by searching for the schema-related SQL queries without file type restrictions.

This should help us identify where these schema-related queries are used and what exclusions are in place. Let's analyze the results to verify the schema handling implementation.

Let me check the broader context of this file to understand if there are any additional schema-related configurations or exclusions defined elsewhere.

This will help us understand:

The complete context of the schema handling in SqlAlchemyAdapter

Any configuration files that might define protected schemas

Other schema-related code that might affect this implementation

After analyzing the complete implementation in SqlAlchemyAdapter.py, I can now provide a final response:

Schema exclusions are correctly implemented for system schemas

The current implementation in delete_database() correctly excludes the essential PostgreSQL system schemas ('pg_catalog', 'pg_toast', 'information_schema'). These are the standard system schemas that should never be dropped. The implementation is safe because:

It explicitly excludes all critical system schemas

The method is part of a complete database deletion operation, intended to clean up user-created schemas

The implementation uses proper error handling and transaction management through the engine's connection

The review comment's concern about additional schemas that should not be dropped is not applicable here because:

This is a database deletion operation, meant to clean up everything except system schemas

Any application-specific schemas would be intentionally dropped as part of this operation

The code is used in a controlled context where preserving user-defined schemas is not required

The implementation follows PostgreSQL best practices for excluding system schemas during cleanup operations
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Description: List all schemas that are candidates for deletion.

# Expected: Only user-defined schemas that are safe to drop.

# Command:
rg -t sql "SELECT schema_name FROM information_schema.schemata" -A 3 | sed -n '/WHERE schema_name NOT IN/,/;/p'
Length of output: 262

Script:
#!/bin/bash
# Search for schema-related SQL queries in the codebase
rg -l "SELECT schema_name FROM information_schema.schemata"

# Get more context around the schema exclusions
rg "schema_name NOT IN" -B 2 -A 2
Length of output: 818

Script:
#!/bin/bash
# Get the full content of SqlAlchemyAdapter.py to understand the context
cat "cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py"

# Search for any schema-related configurations in the codebase
rg -l "schema" --type yaml --type json --type ini --type toml

# Look for any other schema-related code
rg "schema\." -t python
Length of output: 6573

coderabbitai · 2024-10-23T21:25:00Z

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py

+                    result = await connection.execute(
+                        text("""
+                            SELECT schema_name FROM information_schema.schemata
+                            WHERE schema_name NOT IN ('pg_catalog', 'pg_toast', 'information_schema');
+                            """)
+                    )
+                    # Create a MetaData instance to load table information
+                    metadata = MetaData()
+                    # Drop all tables from all schemas
+                    for schema in result.fetchall():
+                        # Load the schema information into the MetaData object
+                        await connection.run_sync(metadata.reflect, schema=schema[0])
+                        for table in metadata.sorted_tables:
+                            drop_table_query = text(f"DROP TABLE IF EXISTS {schema[0]}.{table.name} CASCADE")
+                            await connection.execute(drop_table_query)
+                        metadata.clear()


🛠️ Refactor suggestion

Consider dropping entire schemas instead of individual tables

Currently, the delete_database method iterates over each table within each schema and drops them individually. This can be inefficient, especially if there are many tables or other objects within the schemas. Dropping the entire schema in a single operation is more efficient and ensures that all objects within the schema are removed.

Apply this diff to simplify the database deletion process:

+ # Drop entire schemas instead of individual tables for schema in result.fetchall(): - # Load the schema information into the MetaData object - await connection.run_sync(metadata.reflect, schema=schema[0]) - for table in metadata.sorted_tables: - drop_table_query = text(f"DROP TABLE IF EXISTS {schema[0]}.{table.name} CASCADE") - await connection.execute(drop_table_query) - metadata.clear() + drop_schema_query = text(f'DROP SCHEMA IF EXISTS "{schema[0]}" CASCADE') + await connection.execute(drop_schema_query)

This modification:

Drops each schema using the DROP SCHEMA command with the CASCADE option, which removes the schema and all dependent objects.

Simplifies the code by eliminating the need to reflect metadata and iterate over tables.

Enhances performance by reducing the number of database operations.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

result = await connection.execute(

text("""

SELECT schema_name FROM information_schema.schemata

WHERE schema_name NOT IN ('pg_catalog', 'pg_toast', 'information_schema');

""")

)

# Create a MetaData instance to load table information

metadata = MetaData()

# Drop all tables from all schemas

for schema in result.fetchall():

# Load the schema information into the MetaData object

await connection.run_sync(metadata.reflect, schema=schema[0])

for table in metadata.sorted_tables:

drop_table_query = text(f"DROP TABLE IF EXISTS {schema[0]}.{table.name} CASCADE")

await connection.execute(drop_table_query)

metadata.clear()

result = await connection.execute(

text("""

SELECT schema_name FROM information_schema.schemata

WHERE schema_name NOT IN ('pg_catalog', 'pg_toast', 'information_schema');

""")

)

# Create a MetaData instance to load table information

metadata = MetaData()

# Drop entire schemas instead of individual tables

for schema in result.fetchall():

drop_schema_query = text(f'DROP SCHEMA IF EXISTS "{schema[0]}" CASCADE')

await connection.execute(drop_schema_query)

Fixed deletion of tables in SqlAlchemyAdapter so it works for sqlite and postgres Fix #COG-414

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between c67639f and ddaa1e8.

📒 Files selected for processing (1)

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py (3 hunks)

🔇 Additional comments (2)

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py (2)
2-2: LGTM: Import statement is correctly placed.

The Optional import is appropriately added to support the new schema_name parameter.

126-141: Consider previous suggestion to use DROP SCHEMA CASCADE.

The current implementation drops tables individually, which:

Is less efficient than dropping the entire schema

Could potentially lead to deadlocks when multiple sessions are trying to drop tables concurrently

Might miss other schema objects (views, functions, etc.)

Referring to the previous review suggestion to use DROP SCHEMA CASCADE for a more efficient and comprehensive cleanup.

Let's verify if there are any schema objects besides tables that need cleanup:
#!/bin/bash
# Search for CREATE statements to identify other schema objects
rg -i "CREATE (VIEW|FUNCTION|PROCEDURE|TRIGGER|TYPE|SEQUENCE)" 

coderabbitai · 2024-10-25T14:58:16Z

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py

+    async def delete_table(self, table_name: str,  schema_name: Optional[str] = "public"):
        async with self.engine.begin() as connection:
-            await connection.execute(text(f"DROP TABLE IF EXISTS {table_name} CASCADE;"))
-
-            await connection.close()
+            if self.engine.dialect.name == "sqlite":
+                # SQLite doesn’t support schema namespaces and the CASCADE keyword.
+                # However, foreign key constraint can be defined with ON DELETE CASCADE during table creation.
+                await connection.execute(text(f"DROP TABLE IF EXISTS {table_name};"))
+            else:
+                await connection.execute(text(f"DROP TABLE IF EXISTS {schema_name}.{table_name} CASCADE;"))


🛠️ Refactor suggestion

Consider using SQL parameters for table and schema names.

The implementation correctly handles different database types, but direct string interpolation in SQL queries could be vulnerable to SQL injection. Consider using parameters:

- await connection.execute(text(f"DROP TABLE IF EXISTS {table_name};")) + await connection.execute(text("DROP TABLE IF EXISTS :table_name;"), {"table_name": table_name}) - await connection.execute(text(f"DROP TABLE IF EXISTS {schema_name}.{table_name} CASCADE;")) + await connection.execute( + text("DROP TABLE IF EXISTS :schema_name.:table_name CASCADE;"), + {"schema_name": schema_name, "table_name": table_name} + )

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

async def delete_table(self, table_name: str, schema_name: Optional[str] = "public"):

async with self.engine.begin() as connection:

await connection.execute(text(f"DROP TABLE IF EXISTS {table_name} CASCADE;"))

await connection.close()

if self.engine.dialect.name == "sqlite":

# SQLite doesn’t support schema namespaces and the CASCADE keyword.

# However, foreign key constraint can be defined with ON DELETE CASCADE during table creation.

await connection.execute(text(f"DROP TABLE IF EXISTS {table_name};"))

else:

await connection.execute(text(f"DROP TABLE IF EXISTS {schema_name}.{table_name} CASCADE;"))

async def delete_table(self, table_name: str, schema_name: Optional[str] = "public"):

async with self.engine.begin() as connection:

if self.engine.dialect.name == "sqlite":

# SQLite doesn't support schema namespaces and the CASCADE keyword.

# However, foreign key constraint can be defined with ON DELETE CASCADE during table creation.

await connection.execute(text("DROP TABLE IF EXISTS :table_name;"), {"table_name": table_name})

else:

await connection.execute(

text("DROP TABLE IF EXISTS :schema_name.:table_name CASCADE;"),

{"schema_name": schema_name, "table_name": table_name}

)

Added ability to delete data from database by id Feature #COG-414

…SqlAlchemyAdapter Added support for schema namespace for getting tables Fix #COG-414

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (2)

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py (2)

73-87: Add error handling to schema list retrieval.

While the implementation correctly filters system schemas, it should handle potential database errors gracefully.

Consider adding error handling:

     async def get_schema_list(self) -> List[str]:
         if self.engine.dialect.name == "postgresql":
             async with self.engine.begin() as connection:
-                result = await connection.execute(
-                    text("""
-                        SELECT schema_name FROM information_schema.schemata
-                        WHERE schema_name NOT IN ('pg_catalog', 'pg_toast', 'information_schema');
-                        """)
-                )
-                return [schema[0] for schema in result.fetchall()]
+                try:
+                    result = await connection.execute(
+                        text("""
+                            SELECT schema_name FROM information_schema.schemata
+                            WHERE schema_name NOT IN ('pg_catalog', 'pg_toast', 'information_schema');
+                            """)
+                    )
+                    return [schema[0] for schema in result.fetchall()]
+                except Exception as e:
+                    print(f"Error retrieving schema list: {e}")
+                    return []
         return []

97-120: Ensure proper connection cleanup in get_table method.

While the implementation correctly handles different databases, it should ensure connections are properly cleaned up.

Consider using explicit connection cleanup:

     async def get_table(self, table_name: str, schema_name: Optional[str] = "public") -> Table:
         async with self.engine.begin() as connection:
             if self.engine.dialect.name == "sqlite":
-                # Load the schema information into the MetaData object
-                await connection.run_sync(Base.metadata.reflect)
-                if table_name in Base.metadata.tables:
-                    return Base.metadata.tables[table_name]
-                else:
-                    raise ValueError(f"Table '{table_name}' not found.")
+                try:
+                    await connection.run_sync(Base.metadata.reflect)
+                    if table_name in Base.metadata.tables:
+                        return Base.metadata.tables[table_name]
+                    raise ValueError(f"Table '{table_name}' not found.")
+                finally:
+                    await connection.close()
             else:
-                # Create a MetaData instance to load table information
-                metadata = MetaData()
-                # Load table information from schema into MetaData
-                await connection.run_sync(metadata.reflect, schema=schema_name)
-                # Define the full table name
-                full_table_name = f"{schema_name}.{table_name}"
-                # Check if table is in list of tables for the given schema
-                if full_table_name in metadata.tables:
-                    return metadata.tables[full_table_name]
-                raise ValueError(f"Table '{full_table_name}' not found.")
+                try:
+                    metadata = MetaData()
+                    await connection.run_sync(metadata.reflect, schema=schema_name)
+                    full_table_name = f"{schema_name}.{table_name}"
+                    if full_table_name in metadata.tables:
+                        return metadata.tables[full_table_name]
+                    raise ValueError(f"Table '{full_table_name}' not found.")
+                finally:
+                    await connection.close()

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between ddaa1e8 and dc8680f.

📒 Files selected for processing (1)

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py (4 hunks)

🔇 Additional comments (3)

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py (3)

2-4: LGTM: Import statements are properly organized.

The added imports support the new type hints in method signatures.

55-62: ⚠️ Potential issue

SQL injection vulnerability in table deletion.

The current implementation uses string interpolation for SQL queries, which could be vulnerable to SQL injection. This issue was previously identified and should be addressed.

Apply this diff to use parameterized queries:

-                await connection.execute(text(f"DROP TABLE IF EXISTS {table_name};"))
+                await connection.execute(
+                    text("DROP TABLE IF EXISTS :table_name"),
+                    {"table_name": table_name}
+                )
-                await connection.execute(text(f"DROP TABLE IF EXISTS {schema_name}.{table_name} CASCADE;"))
+                await connection.execute(
+                    text("DROP TABLE IF EXISTS :schema_name.:table_name CASCADE"),
+                    {"schema_name": schema_name, "table_name": table_name}
+                )

176-186: ⚠️ Potential issue

Consider dropping entire schemas for better efficiency.

The current implementation drops tables individually, which is less efficient than dropping entire schemas at once. This issue was previously identified.

As suggested in the previous review, consider this approach:

-                    schema_list = await self.get_schema_list()
-                    # Create a MetaData instance to load table information
-                    metadata = MetaData()
-                    # Drop all tables from all schemas
-                    for schema_name in schema_list:
-                        # Load the schema information into the MetaData object
-                        await connection.run_sync(metadata.reflect, schema=schema_name)
-                        for table in metadata.sorted_tables:
-                            drop_table_query = text(f"DROP TABLE IF EXISTS {schema_name}.{table.name} CASCADE")
-                            await connection.execute(drop_table_query)
-                        metadata.clear()
+                    schema_list = await self.get_schema_list()
+                    for schema in schema_list:
+                        drop_schema_query = text('DROP SCHEMA IF EXISTS :schema_name CASCADE')
+                        await connection.execute(drop_schema_query, {"schema_name": schema})

coderabbitai · 2024-10-27T21:31:15Z

cognee/infrastructure/databases/relational/sqlalchemy/SqlAlchemyAdapter.py

+    async def delete_data_by_id(self, table_name: str, data_id: UUID, schema_name: Optional[str] = "public"):
+        """
+        Delete data in given table based on id. Table must have an id Column.
+        """
+        async with self.get_async_session() as session:
+            TableModel = await self.get_table(table_name, schema_name)
+            await session.execute(TableModel.delete().where(TableModel.c.id == data_id))
+            await session.commit()
+


⚠️ Potential issue

Enhance delete_data_by_id method robustness.

The method needs additional error handling and validation to ensure reliable operation.

Consider these improvements:

async def delete_data_by_id(self, table_name: str, data_id: UUID, schema_name: Optional[str] = "public"): - """ - Delete data in given table based on id. Table must have an id Column. - """ + """ + Delete data in given table based on id. + + Args: + table_name: Name of the table + data_id: UUID of the record to delete + schema_name: Optional schema name, defaults to "public" + + Returns: + bool: True if deletion was successful, False otherwise + + Raises: + ValueError: If table doesn't have an id column + """ async with self.get_async_session() as session: - TableModel = await self.get_table(table_name, schema_name) - await session.execute(TableModel.delete().where(TableModel.c.id == data_id)) - await session.commit() + try: + TableModel = await self.get_table(table_name, schema_name) + if 'id' not in TableModel.c: + raise ValueError(f"Table '{table_name}' does not have an 'id' column") + result = await session.execute(TableModel.delete().where(TableModel.c.id == data_id)) + await session.commit() + return result.rowcount > 0 + except Exception as e: + await session.rollback() + print(f"Error deleting record: {e}") + return False

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

async def delete_data_by_id(self, table_name: str, data_id: UUID, schema_name: Optional[str] = "public"):

"""

Delete data in given table based on id. Table must have an id Column.

"""

async with self.get_async_session() as session:

TableModel = await self.get_table(table_name, schema_name)

await session.execute(TableModel.delete().where(TableModel.c.id == data_id))

await session.commit()

async def delete_data_by_id(self, table_name: str, data_id: UUID, schema_name: Optional[str] = "public"):

"""

Delete data in given table based on id.

Args:

table_name: Name of the table

data_id: UUID of the record to delete

schema_name: Optional schema name, defaults to "public"

Returns:

bool: True if deletion was successful, False otherwise

Raises:

ValueError: If table doesn't have an id column

"""

async with self.get_async_session() as session:

try:

TableModel = await self.get_table(table_name, schema_name)

if 'id' not in TableModel.c:

raise ValueError(f"Table '{table_name}' does not have an 'id' column")

result = await session.execute(TableModel.delete().where(TableModel.c.id == data_id))

await session.commit()

return result.rowcount > 0

except Exception as e:

await session.rollback()

print(f"Error deleting record: {e}")

return False

dexters1 added 4 commits October 23, 2024 14:54

fix: Add deletion of all tables in all schemas for postgres

ee2f1fb

Added deletion of all tables in postgres database, but this fix causes an issue regrading creation of duplicate tables on next run Fix #COG-414

fix: Resolve issue with database deletion

c1c0dc4

Resolve issue with database deletion by cleaning Metadata after every schema Fix #COG-414

fix: Move cleaning of MetaData out of drop table loop

36377c0

Moved cleaning of MetaData to be after all tables have been dropped Fix #COG-414

refactor: Remove unnecessary print statement

c67639f

Removed unnecessary print statement Refactor #COG-414

coderabbitai bot reviewed Oct 23, 2024

View reviewed changes

fix: Fix table deletion for SqlAlchemyAdapter

ddaa1e8

Fixed deletion of tables in SqlAlchemyAdapter so it works for sqlite and postgres Fix #COG-414

coderabbitai bot reviewed Oct 25, 2024

View reviewed changes

dexters1 added 2 commits October 27, 2024 22:04

feat: Add deletion by id for SqlAlchemyAdapter

09f5fa8

Added ability to delete data from database by id Feature #COG-414

fix: Add support for postgresql syntax for getting table function in …

dc8680f

…SqlAlchemyAdapter Added support for schema namespace for getting tables Fix #COG-414

coderabbitai bot reviewed Oct 27, 2024

View reviewed changes

borisarzentar approved these changes Oct 28, 2024

View reviewed changes

borisarzentar merged commit c23a314 into main Oct 28, 2024

borisarzentar deleted the COG-414-fix-postgres-database-deletion branch October 28, 2024 08:57

coderabbitai bot mentioned this pull request Dec 19, 2024

Cog 475 local file endpoint deletion #391

Merged

coderabbitai bot mentioned this pull request Apr 25, 2025

fix: Doesn't drop entire PG database, just cleans public schema - Cog 1947 #760

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cog 414 fix postgres database deletion #163

Cog 414 fix postgres database deletion #163

Uh oh!

dexters1 commented Oct 23, 2024 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 23, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Oct 23, 2024

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Oct 25, 2024

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Oct 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-    async def delete_data_by_id(self, table_name: str, data_id: UUID, schema_name: Optional[str] = "public"):
-        """
-        Delete data in given table based on id. Table must have an id Column.
-        """
-        async with self.get_async_session() as session:
-            TableModel = await self.get_table(table_name, schema_name)
-            await session.execute(TableModel.delete().where(TableModel.c.id == data_id))
-            await session.commit()
+    async def delete_data_by_id(self, table_name: str, data_id: UUID, schema_name: Optional[str] = "public"):
+        """
+        Delete data in given table based on id.
+        Args:
+            table_name: Name of the table
+            data_id: UUID of the record to delete
+            schema_name: Optional schema name, defaults to "public"
+        Returns:
+            bool: True if deletion was successful, False otherwise
+        Raises:
+            ValueError: If table doesn't have an id column
+        """
+        async with self.get_async_session() as session:
+            try:
+                TableModel = await self.get_table(table_name, schema_name)
+                if 'id' not in TableModel.c:
+                    raise ValueError(f"Table '{table_name}' does not have an 'id' column")
+                result = await session.execute(TableModel.delete().where(TableModel.c.id == data_id))
+                await session.commit()
+                return result.rowcount > 0
+            except Exception as e:
+                await session.rollback()
+                print(f"Error deleting record: {e}")
+                return False

Cog 414 fix postgres database deletion #163

Cog 414 fix postgres database deletion #163

Uh oh!

Conversation

dexters1 commented Oct 23, 2024 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 25, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 27, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dexters1 commented Oct 23, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 23, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)