11

我正在使用烧瓶、sqlalchemy 和烧瓶-sqlalchemy。我想用 gin 和 to_tsvector 在 postgres 中创建一个完整的测试搜索索引。目前,我正在尝试以下方法。我认为它最接近我想要表达的东西,但不起作用。

from sqlalchemy.ext.declarative import declared_attr
from sqlalchemy.schema import Index
from sqlalchemy.sql.expression import func

from app import db


class Post(db.Model):

    id = db.Column(db.Integer, primary_key=True)
    added = db.Column(db.DateTime, nullable=False)
    pub_date = db.Column(db.DateTime, nullable=True)
    content = db.Column(db.Text)

    @declared_attr
    def __table_args__(cls):
        return (Index('idx_content', func.to_tsvector("english", "content"), postgresql_using="gin"), )

这会引发以下错误...

Traceback (most recent call last):
  File "./manage.py", line 5, in <module>
    from app import app, db
  File "/vagrant/app/__init__.py", line 36, in <module>
    from pep.models import *
  File "/vagrant/pep/models.py", line 8, in <module>
    class Post(db.Model):
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/flask_sqlalchemy.py", line 477, in __init__
    DeclarativeMeta.__init__(self, name, bases, d)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/ext/declarative/api.py", line 48, in __init__
    _as_declarative(cls, classname, cls.__dict__)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/ext/declarative/base.py", line 222, in _as_declarative
    **table_kw)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/schema.py", line 326, in __new__
    table._init(name, metadata, *args, **kw)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/schema.py", line 393, in _init
    self._init_items(*args)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/schema.py", line 63, in _init_items
    item._set_parent_with_dispatch(self)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/events.py", line 235, in _set_parent_with_dispatch
    self._set_parent(parent)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/schema.py", line 2321, in _set_parent
    ColumnCollectionMixin._set_parent(self, table)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/schema.py", line 1978, in _set_parent
    self.columns.add(col)
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/sql/expression.py", line 2391, in add
    self[column.key] = column
  File "/home/vagrant/.virtualenvs/pep/local/lib/python2.7/site-packages/sqlalchemy/sql/expression.py", line 2211, in __getattr__
    key)
AttributeError: Neither 'Function' object nor 'Comparator' object has an attribute 'key'

我也试过

return (Index('idx_content', "content", postgresql_using="gin"), )

但是,它不能作为 postgres (至少 9.1,因为这是我运行的)期望 to_tsvector 被调用。此行创建 SQL;

CREATE INDEX content_index ON post USING gin (content)

而不是我想要的;

CREATE INDEX content_index ON post USING gin(to_tsvector('english', content))

我打开了一张票,因为我认为这可能是一个错误/限制。http://www.sqlalchemy.org/trac/ticket/2605

4

3 回答 3

4

现在我已经添加了以下几行来手动完成,但如果有的话,我更喜欢“正确的”SQLAlchemy 方法。

create_index = DDL("CREATE INDEX idx_content ON pep USING gin(to_tsvector('english', content));")
event.listen(Pep.__table__, 'after_create', create_index.execute_if(dialect='postgresql'))

关于 SQLAlchemy 错误跟踪器有一些有趣的讨论。看起来这是当前索引定义的限制。基本上,我的要求是允许索引是表达式,而不仅仅是列名,但目前不支持。此票正在跟踪此功能请求:http ://www.sqlalchemy.org/trac/ticket/695 。但是,这正在等待开发人员采取行动并完成工作(并且已经有一段时间了)。

于 2012-11-13T13:14:10.243 回答
3

在我创建一些单列和多列 tsvector GIN 索引时遇到了这个老问题。对于正在寻找一种使用列名的字符串表示来创建这些索引的简单方法的任何人,这里是使用 SQLAlchemytext()构造的一种方法。

from sqlalchemy import Column, Index, Integer, String, text
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy.sql import func


Base = declarative_base()

def to_tsvector_ix(*columns):
    s = " || ' ' || ".join(columns)
    return func.to_tsvector('english', text(s))

class Example(Base):
    __tablename__ = 'examples'

    id = Column(Integer, primary_key=True)
    atext = Column(String)
    btext = Column(String)

    __table_args__ = (
        Index(
            'ix_examples_tsv',
            to_tsvector_ix('atext', 'btext'),
            postgresql_using='gin'
            ),
        )
于 2019-02-04T22:13:48.100 回答
1

因此,在 sqlalchemy 0.9 及更高版本中,这是可行的:

class Content(Base, ):
    __tablename__ = 'content'

    id = sa.Column(sa.Integer, primary_key=True)

    description = sa.Column(sa.UnicodeText, nullable=False, server_default='')
    @declared_attr
    def __table_args__(cls):
        return (sa.Index('idx_content',
                     sa.sql.func.to_tsvector("english", cls.description),
                     postgresql_using="gin"), )

值得注意的是,与第一个示例的不同之处在于直接引用列名,而不是在引号中提供列名,因为这不起作用。

于 2014-04-10T08:39:20.960 回答