ANN: Sequel 3.1.0 Released

nerdmaster · June 4, 2009, 6:39pm

Sequel is a lightweight database access toolkit for Ruby.

Sequel provides thread safety, connection pooling and a concise DSL
for constructing database queries and table schemas.
Sequel also includes a lightweight but comprehensive ORM layer for
mapping records to Ruby objects and handling associated records.
Sequel supports advanced database features such as prepared
statements, bound variables, stored procedures, master/slave
configurations, and database sharding.
Sequel makes it easy to deal with multiple records without having
to break your teeth on SQL.
Sequel currently has adapters for ADO, Amalgalite, DataObjects,
DB2, DBI, Firebird, Informix, JDBC, MySQL, ODBC, OpenBase, Oracle,
PostgreSQL and SQLite3.

Sequel 3.1.0 has been released and should be available on the gem
mirrors. The 3.1.0 release adds numerous improvements:

New Plugins

3 new plugins were added that implement features supported by
DataMapper: identity_map, tactical_eager_loading, and
lazy_attributes. These plugins don’t add any real new features,
since you could do everything they allow before simply by being
a little more explicit in your Sequel code. However, some people
prefer a less explicit approach that uses a bit more magic, and
now Sequel can accomodate them.

The identity_map plugin allows you to create a 1-1
correspondence of model objects to database rows via a temporary
thread-local identity map. It makes the following statment true:

Sequel::Model.with_identity_map do
Album.filter{(id > 0) & (id < 2)}.first.object_id ==
Album.first(:id=>1).object_id
end

As the code above implies, you need to use the with_identity_map
method with a block to use the identity mapping feature.

By itself, identity maps don’t offer much, but Sequel uses them
as a cache when looking up objects by primary key or looking up
many_to_one associated objects. Basically, it can be used as a
performance enhancer, and it also allows the support of the
later plugins.

The identity_map plugin is expected to be most useful in web
applications. With that in mind, here’s a Rack middleware that
wraps each request in a with_identity_map call, so the
identity_map features are available inside the web app:

Sequel::Model.plugin :identity_map
class SequelIdentityMap
def initialize(app)
@app = app
end
def call(env)
Sequel::Model.with_identity_map{@app.call(env)}
end
end
The tactical_eager_loading plugin allows you to eagerly load an
association for all models retrieved in the same group whenever
one of the models accesses the association:

2 queries total

Album.filter{id<100}.all do |a|
a.artists
end

In order for this correctly, you must use Dataset#all to load the
records, you cannot iterate over them via Dataset#each. This is
because eager loading requires that you have all records in
advance, and when using Dataset#each you cannot know about later
records in the dataset.

Before, you could just be explicit about the associations you
needed and make sure to eagerly load them using eager before
calling Dataset#all.
The lazy_attributes plugin builds on the identity_map and
tactical_eager_loading plugins and allows you to create
attributes that are lazily loaded from the database:

Album.plugin :lazy_attributes, :review

This will remove the :review attribute from being selected by
default. If you try to access the attribute after it is selected,
it’ll retrieve the value from the database. If the object was
retrieved with a group of other objects and an identity map is in
use, it’ll retrieve the lazy attribute for the entire group of
objects at once, similar to the tatical_eager_loading plugin:

2 queries total

Sequel::Model.with_identity_map do
Album.filter{id<100}.all do |a|
a.review
end
end

Before, you could just set the default selected columns for a model
to not include the lazy attributes, and just use select_more to
add them to any query where the resulting model objects will
need the attributes.
A many_through_many plugin was also added. This very powerful
plugin allows you to create associations to multiple objects through
multiple join tables. Here are some examples:

Assume the following many to many associations:

Artist.many_to_many :albums
Album.many_to_many :tags

Same as Artist.many_to_many :albums

Artist.many_through_many :albums,
[[:albums_artists, :artist_id, :album_id]]

All tags associated to any album this artist is associated to

Artist.many_through_many :tags,
[[:albums_artists, :artist_id, :album_id],
[:albums, :id, :id],
[:albums_tags, :album_id, :tag_id]]

All artists associated to any album this artist is associated to

Artist.many_through_many :artists,
[[:albums_artists, :artist_id, :album_id],
[:albums, :id, :id],
[:albums_artists, :album_id, :artist_id]]

All albums by artists that are associated to any album this

artist is associated to

Artist.many_through_many :artist_albums,
[[:albums_artists, :artist_id, :album_id],
[:albums, :id, :id],
[:albums_artists, :album_id, :artist_id],
[:artists, :id, :id],
[:albums_artists, :artist_id, :album_id]]

Basically, for each join table between this model and the
associated model, you use an array with a join table name, left key
name (key closer to this model), and right key name (key closer to
the associated model).

In usual Sequel fashion, this association type works not just
for single objects, but it can also be eagerly loaded via eager or
eager_graph. There are numerous additional configuration options,
please see the RDoc for details.

many_through_many should be able to handle any ActiveRecord has_many
:through association, as well as many associations that aren’t
handled by any other ruby ORM.

New bin/sequel Features

The bin/sequel command line tool now supports the following options:

-C: Copies one database to another. You must specify two database
arguments. Works similar to Taps, copying the table schema, then
the table data, then creating the indexes.
-d: Dump the schema of the database in the database-independent
migration format.
-D: Dump the schema of the database in the database-specific
migration format.
-h: Display the help
-t: Output the full backtrace if an exception is raised

The bin/sequel tool is now better about checking which options can
be used together. It also now supports using the -L option multiple
times and having it load model files from multiple directory trees.

New Features

Dataset#qualify_to and #qualify_to_first_source were added. They
allow you to qualify unqualified columns in the current dataset
to the given table or the first source. This can be used to join
a dataset that has unqualified columns to a new table which has
columns with the same name.

For example, take this dataset:

ds = DB[:albums].select(:name).order(:name).filter(:id=>1)

SELECT name FROM albums WHERE (id = 1) ORDER BY name

Let’s say you want to join it to the artists table:

ds2 = ds.join(:artists, :id=>:artist_id)

SELECT name FROM albums

INNER JOIN artists ON (artists.id = albums.artist_id)

WHERE (id = 1) ORDER BY name

That’s going to give you an error, as the artists table already has
columns named id and name. This new feature allows you to do the
following:

ds2 = ds.qualify_to_first_source.join(:artists, :id=>:artist_id)

SELECT albums.name FROM albums

INNER JOIN artists ON (artists.id = albums.artist_id)

WHERE (albums.id = 1) ORDER BY albums.name

By doing this, all unqualified columns are qualified, so you get
a usable query. This is expected to be most useful for users that
have a default order or filter on their models and want to join
the model to another table. Before you had to replace the filters,
selection, etc. manually, or use qualified columns by default even
though the weren’t needed in most cases.
Savepoints are now supported using SQLite and MySQL, assuming you
are using a database version that supports them. You need to
pass the :savepoint option to Database#transaction to use a
savepoint.
Model plugins can now depend on other plugins, simply by calling
the Model.plugin method inside the plugin’s apply method:

module LazyAttributes
def self.apply(model)
model.plugin :tactical_eager_loading
end
Model.plugin now takes a block with is passed to the plugin’s
apply and configure method (see Backwards Compatibility section for
more information on the configure method).
You can see which plugins are loaded for a model by using
Model.plugins.
You can use Sequel.extension method to load extensions:

Sequel.extension :pagination, :query

This will only load extensions that ship with Sequel, unlike the
Model.plugin method which will also load external plugins.
You can now use Database#create_table? to create the table if it
doesn’t already exist (a very common need, it seems). The schema
plugin now supports Model.create_table? as well.
#sql_subscript is now an allowed method on most SQL expression
objects that Sequel generates. Also, arguments to #sql_subscript
can now be other expressions instead of just integers.
Associations can now take a :cartesian_product_number option, which
can be used to tell Sequel whether to turn on duplicate object
detection when eagerly loading objects through eager_graph. This
number should be 0 if the association can never create multiple
rows for each row in the current table, 1 if it can create multiple
rows in the each row in the current table, and 2 if the association
itself causes a cartesian product.
On MySQL, Dataset#insert_ignore now affects #insert as well as
multi_insert and import.
Database#create_table now supports an :ignore_index_errors option,
and Database#add_index now supports an :ignore_errors option.
These are used by the schema_dumper when dumping an database
schema to be restored on another database type, since indexes
aren’t usually required for proper operation and some indexes
can’t be transferred.
The ADO adapter now takes a :provider option, which can be used
to set the provider.
The ADO adapter now takes a :command_timeout option, which tells
the connection how long to wait before giving up and raising an
exception.
The Sequel.amalgalite adapter method was added. Like the
Sequel.sqlite method, you can call it with no arguments to get
an in memory database.

Other Improvements

MySQL “commands out of sync” errors should no longer occur unless
you are nesting queries (calling Dataset#each inside Dataset#each).
A bug dating at least to 2007 and possibly since the initial
creation of the Sequel MySQL adapter was the cause. Before, SQL
that caused a result set that was sent using a method where Sequel
doesn’t yield a result set would cause the “commands out of sync”
error on the following query. For example, the following code
would cause the error:

DB << “SHOW DATABASES”

If for some reason a “commands out of sync” error does occur,
Sequel will disconnect the connection from the connection pool,
so it won’t continually stay in the pool and raise errors every
time it is used.
The schema_dumper extension is much better about parsing defaults
from the database. It can now correctly parse most defaults on
MySQL, SQLite, and PostgreSQL databases. It no longer includes
defaults that it can’t parse to a ruby object unless a database-
specific dump is requested.
The schema_dumper extension now dumps tables in alphabetical order.
Ordered and limited datasets are now handled correctly when using
union, intersect, and except. Also, union, intersect, and except
now always return a from_self dataset, so further limiting,
filtering, and ordering of them now works as expected.
Dataset#graph now works correctly with a complex dataset without
having to use from_self. Before, code like the following didn’t
do what was expected:

DB[:albums].
graph(DB[:artists].filter{name > ‘M’}, :id=>:artist_id)

Before, the filter on DB[:artists] would be dropped. Now, Sequel
correctly uses a subselect.
You can now specify serialization formats per column in the
serialization plugin, either by calling the plugin multiple
times or by using the new serialize_attributes method:

Album.plugin :serialization
Album.serialize_attributes :marshal, :review
Album.serialize_attributes :yaml, :name
Album.serialization_map #{:name=>:yaml, :review=>:marshal}

The public API for the serialization plugin is still backwards
compatible, but the internals have changed slightly to support
this new feature.
You can now use eager_graph to eagerly load associations for models
that lack primary keys.
The :eager_graph association option now works when lazily-loading
many_to_many associations.
Dataset#add_graph_aliases now works correctly even if
set_graph_aliases hasn’t been used.
The PostgreSQL Database#tables method now assumes the public schema
if a schema is not given and there is no default_schema.
The PostgreSQL Database#indexes method no longer returns partial
indexes or functional indexes.
The MySQL Database#indexes method no longer returns indexes on
partial columns (prefix indexes).
Default values for String :text=>true and File columns on MySQL
are ignored, since MySQL doesn’t support them. They are not
ignored if you use text and blob, since then you are using the
database-specific syntax and Sequel doesn’t do translation when
the database-specific syntax is used.
On PostgreSQL, attempting the reset the primary key sequence for a
table without a primary key no longer causes an error.
Using a placeholder string in an association’s :condition option
now works correctly (e.g. :conditions=>[‘n = ?’, 1])
An error is no longer raised if you attempt to load a plugin that
has a DatasetMethods module but no public dataset methods.
The check for dataset[n] where n is an integer was fixed. It now
raises an error inside of returning a limited dataset.
On PostgreSQL, Dataset#insert with static SQL now works correctly.
A reflection.rdoc file was added giving an overview of Sequel’s
reflection support.
The Migrator now works correctly with file names like
001_12312412_file_name.rb.
The association code now requires the classes match when looking
for a reciprocal association.
An unlikely threading bug (race condition) was possible when using
the validation_class_methods plugin. The plugin was refactored and
now uses a mutex to avoid the issue. One of the refactoring changes
makes it so that you can no longer use a class level vaildation
inside a Class.new block (since inherited isn’t called until the
block finishes).
The exception messages when Sequel.string_to_* fail have been fixed.
The String :text=>true generic database type has been fixed when
using the Firebird adapter.

Backwards Compatibility

A plugin’s apply method is now only called the first time a plugin
is loaded. Plugins can now have a configure method that is called
every time the plugin is loaded, and is always called after the
instance methods, class methods, and dataset method submodules have
been added to the model. This is different from apply, which is
called before the submodules are loaded.

If you are a plugin author, please check your implementation to
make sure this doesn’t cause problems for you. If you have
questions, please post on the Sequel mailing list.

This new plugin feature will make certain things a lot easier, and
it should be mostly backwards compatible. However, if a plugin
was previously expected to be loaded multiple times with the apply
method called each time, it will no longer work correctly.
The plugin_opts methods defined now include multiple args in an
array if multiple args are given. Before, the plugin_opts methods
just returned the first argument.
Database#table_exists? no longer checks the cached schema
information. By default, it will always do a database query
(unless overridden in an adapter). This shouldn’t affect the
results, but if were using the method a lot and expecting it to
use cached information, it doesn’t have the same performance
characteristics.
The internal storage of the :select option for datasets have
changed. You can no longer use a hash as a way of aliasing
columns. Dataset#select now does the translation from the hash to
SQL::AliasedExpression instances. Basically, if you were using
Dataset#clone directly with a :select option with hashes for
aliasing, you should switch to using Dataset#select or changing
the hashes to AliasedExpressions yourself.

Thanks,
Jeremy

{Website}[http://sequel.rubyforge.org]
{Source code}[GitHub - jeremyevans/sequel: Sequel: The Database Toolkit for Ruby]
{Bug tracking}[Google Code Archive - Long-term storage for Google Code Project Hosting.]
{Google group}[http://groups.google.com/group/sequel-talk]
{RDoc}[http://sequel.rubyforge.org/rdoc]

ANN: Sequel 3.1.0 Released

New Plugins

2 queries total

2 queries total

Assume the following many to many associations:

Same as Artist.many_to_many :albums

All tags associated to any album this artist is associated to

All artists associated to any album this artist is associated to

All albums by artists that are associated to any album this

artist is associated to

New bin/sequel Features

New Features

SELECT name FROM albums WHERE (id = 1) ORDER BY name

SELECT name FROM albums

INNER JOIN artists ON (artists.id = albums.artist_id)

WHERE (id = 1) ORDER BY name

SELECT albums.name FROM albums

INNER JOIN artists ON (artists.id = albums.artist_id)

WHERE (albums.id = 1) ORDER BY albums.name

Other Improvements

Backwards Compatibility