This connector is not in the active development and de facto deprecated. Consider using the modern alternatives: cartridge-java and cartridge-springdata. It is based on Netty framework and supports single node/cluster queries.
You may be still interesting in tarantool-java connector if you're using Tarantool via JDBC.
To get the Java connector for Tarantool 1.6.9, visit this GitHub page.
- Getting started
- Spring NamedParameterJdbcTemplate usage example
- JDBC
- Cluster support
- Logging
- Building
- Where to get help
- Add a dependency to your
pom.xmlfile (look for a last version on the releases page):
<dependency>
<groupId>org.tarantool</groupId>
<artifactId>connector</artifactId>
<version>1.9.4</version>
</dependency>- Configure
TarantoolClientConfig:
TarantoolClientConfig config = new TarantoolClientConfig();
config.username = "test";
config.password = "test";- Create a client:
TarantoolClient client = new TarantoolClientImpl("host:3301", config);using TarantoolClientImpl(String, TarantoolClientConfig) is equivalent to:
SocketChannelProvider socketChannelProvider = new SingleSocketChannelProviderImpl("host:3301")
TarantoolClient client = new TarantoolClientImpl(socketChannelProvider, config);You could implement your own SocketChannelProvider. It should return
a connected SocketChannel. Feel free to implement get(int retryNumber, Throwable lastError)
using your appropriate strategy to obtain the channel. The strategy can take into
account current attempt number (retryNumber) and the last transient error occurred on
the previous attempt.
The TarantoolClient will be closed if your implementation of a socket
channel provider raises exceptions. However, throwing a SocketProviderTransientException
or returning null value are handled by the client as recoverable errors. In these cases,
the client will make next attempt to obtain the socket channel. Otherwise, you will need
a new instance of client to recover. Hence, you should only throw an error different
to SocketProviderTransientException in case you have met unrecoverable error.
Below is an example of SocketChannelProvider implementation that tries
to connect no more than 3 times, two seconds for each attempt at max.
SocketChannelProvider socketChannelProvider = new SocketChannelProvider() {
@Override
public SocketChannel get(int retryNumber, Throwable lastError) {
if (retryNumber > 3) {
throw new RuntimeException("Too many attempts");
}
SocketChannel channel = null;
try {
channel = SocketChannel.open();
channel.socket().connect(new InetSocketAddress("localhost", 3301), 2000);
return channel;
} catch (IOException e) {
if (channel != null) {
try {
channel.close();
} catch (IOException ignored) { }
}
throw new SocketProviderTransientException("Couldn't connect to server", e);
}
}
};Same behaviour can be achieved using built-in SingleSocketChannelProviderImpl:
TarantoolClientConfig config = new TarantoolClientConfig();
config.connectionTimeout = 2_000; // two seconds timeout per attempt
config.retryCount = 3; // three attempts at max
SocketChannelProvider socketChannelProvider = new SingleSocketChannelProviderImpl("localhost:3301")
TarantoolClient client = new TarantoolClientImpl(socketChannelProvider, config);SingleSocketChannelProviderImpl implements ConfigurableSocketChannelProvider that
makes possible for the client to configure a socket provider.
Notes:
TarantoolClientis thread-safe and asynchronous, so you should use one client inside the whole application.TarantoolClientdoes not support name resolution for fields, indexes, spaces and so on. We highly recommend to use server-side Lua when working with named items. For example, you could create a data access object (DAO) with simple CRUD functions. If, for some reason, you do need client name resolution, you could create a function that returns necessary name-to-ID mappings.
TarantoolClient provides four interfaces to execute queries:
SyncOps- returns the operation resultAsyncOps- returns the operation result as aFutureComposableAsyncOps- return the operation result as aCompletionStageFireAndForgetOps- returns the query ID
Feel free to override any method of TarantoolClientImpl. For example, to hook
all the results, you could override this:
protected void complete(TarantoolPacket packet, CompletableFuture<?> future);The client configuration options are represented through the TarantoolClientConfig class.
Supported options are follow:
usernameis used to authenticate and authorize an user in a Taratool server instance. Default value isnullthat means client will attempt to auth as a guest.passwordis used to authenticate an user in a Taratool server instance. Default value isnull.defaultRequestSizeused to be an initial binary buffer size in bytes to send requests. Default value is4096(4 KB).predictedFuturesis used to initialize an initial capacity of hash map which stores response futures. The client is asynchronous under the hood even though it provides a synchronous operations usingjava.concurrent.CompletableFuture. Default value is(1024 * 1024) / 0.75) + 1.writerThreadPrioritydescribes a priority of writer thread. Default value isThread.NORM_PRIORITY(5).readerThreadPrioritydescribes a priority of reader thread. Default value isThread.NORM_PRIORITY(5).sharedBufferSizesets a shared buffer size in bytes (place where client collects requests when socket is busy on write). Default value is8 * 1024 * 1024(8 MB).directWriteFactoris used as a factor to calculate a threshold whether request will be accommodated in the shared buffer. If the request size exceedsdirectWriteFactor * sharedBufferSizerequest is sent directly. Default value is0.5.writeTimeoutMillissets the max time in ms to perform writing and send the bytes. Default value is 60 * 1000 (1 minute).useNewCallconfigures whether client has to use new CALL request signature or old one used to be active in Tarantool 1.6. Default value istrue.initTimeoutMillissets a max time in ms to establish connection to the server Default values is60 * 1000L(1 minute).connectionTimeoutis a hint and can be passed to the socket providers which implementConfigurableSocketChannelProviderinterface. This hint should be interpreter as a connection timeout in ms per attempt where0means no limit. This options restricts a time budget to perform one connection attempt, whileinitTimeoutMillislimits an overall time to obtain a connection. Default value is2 * 1000(2 seconds).retryCountis a hint and can be passed to the socket providers which implementConfigurableSocketChannelProviderinterface. This hint should be interpreter as a maximal number of attempts to connect to Tarantool instance. Default value is3.operationExpiryTimeMillisis a default request timeout in ms. Default value is1000(1 second).
Each operation that requires space or index to be executed, can work with
number ID as well as string name of a space or an index.
Assume, we have my_space space with space ID 512 and its primary index
primary with index ID 0. Then, for instance, select operations can be
performed using their names:
client.syncOps().select(512, 0, Collections.singletonList(1), 0, 1, Iterator.EQ);
// or using more convenient way
client.syncOps().select("my_space", "primary", Collections.singletonList(1), 0, 1, Iterator.EQ);Because iproto has not yet supported string spaces and indexes, a client caches current server schema in memory. The client relies on protocol SCHEMA_ID and sends each request with respect to cached schema version. The schema is used primarily to resolve string names of spaces or indexes against its integer IDs.
- Just after a (re-)connection to the Tarantool instance. The client cannot guarantee that new instance is the same and has same schema, thus, the client drops the cached schema and fetches new one.
- Receiving a schema version error as a response to our request. It's possible some request can be rejected by server because of schema mismatching between client and server. In this case the schema will be reloaded and the refused request will be resent using the updated schema version.
- Sending a DDL request and receiving a new version in a response.
- Sending a request against a non-existent space/index name. The client cannot exactly know whether name was not found because of it does not exist or it has not the latest schema version. A ping request is sent in the case to check a schema version and then a client will reload it if needed. The original request will be retried if a space / an index name will be found in a new schema.
- Each schema reloading requires at least two extra requests to fetch spaces and indexes metadata respectively. There is also a ping request followed by reloading of the schema to check whether the client has outdated version (see point 4 in Schema update).
- In some circumstance, requests can be rejected several times until both client's and server's versions matches. It may take significant amount of time or even be a cause of request timeout.
- The client guarantees an order of synchronous requests per thread. Other cases such as asynchronous or multi-threaded requests may be out of order before the execution.
The JDBC driver uses TarantoolClient implementation to provide a communication with server.
To configure socket channel provider you should implements SocketChannelProvider and add
socketChannelProvider=abc.xyz.MySocketChannelProvider to connect url.
For example:
jdbc:tarantool://localhost:3301?user=test&password=test&socketProvider=abc.xyz.MySocketProvider
Here is an example how you can use the driver covered by Spring DriverManagerDataSource:
NamedParameterJdbcTemplate template = new NamedParameterJdbcTemplate(new DriverManagerDataSource("jdbc:tarantool://localhost:3301?user=test&password=test"));
RowMapper<Object> rowMapper = new RowMapper<Object>() {
@Override
public Object mapRow(ResultSet resultSet, int i) throws SQLException {
return Arrays.asList(resultSet.getInt(1), resultSet.getString(2));
}
};
try {
System.out.println(template.update("drop table hello_world", Collections.<String, Object>emptyMap()));
} catch (Exception ignored) {
}
System.out.println(template.update("create table hello_world(hello int not null PRIMARY KEY, world varchar(255) not null)", Collections.<String, Object>emptyMap()));
Map<String, Object> params = new LinkedHashMap<String, Object>();
params.put("text", "hello world");
params.put("id", 1);
System.out.println(template.update("insert into hello_world(hello, world) values(:id,:text)", params));
System.out.println(template.query("select * from hello_world", rowMapper));
System.out.println(template.query("select * from hello_world where hello=:id", Collections.singletonMap("id", 1), rowMapper));For more implementation details, see API documentation.
Statement and PreparedStatement objects can be used to submit batch
updates.
For instance, using Statement object:
Statement statement = connection.createStatement();
statement.addBatch("INSERT INTO student VALUES (30, 'Joe Jones')");
statement.addBatch("INSERT INTO faculty VALUES (2, 'Faculty of Chemistry')");
statement.addBatch("INSERT INTO student_faculty VALUES (30, 2)");
int[] updateCounts = stmt.executeBatch();or using PreparedStatement:
PreparedStatement stmt = con.prepareStatement("INSERT INTO student VALUES (?, ?)");
stmt.setInt(1, 30);
stmt.setString(2, "Michael Korj");
stmt.addBatch();
stmt.setInt(1, 40);
stmt.setString(2, "Linda Simpson");
stmt.addBatch();
int[] updateCounts = stmt.executeBatch();The connector uses a pipelining when it performs a batch request. It means each query is asynchronously sent one-by-one in order they were specified in the batch.
There are a couple of caveats:
-
JDBC spec recommends that auto-commit mode should be turned off to prevent the driver from committing a transaction when a batch request is called. The connector is not support transactions and auto-commit is always enabled, so each statement from the batch is executed in its own transaction.
-
DDL operations aren't transactional in Tarantool. In this way, a batch like this can produce an undefined behaviour (i.e. second statement can fail with an error that
studenttable does not exist).
statement.addBatch("CREATE TABLE student (id INT PRIMARY KEY, name VARCHAR(100))");
statement.addBatch("INSERT INTO student VALUES (1, 'Alex Smith')");-
If
vinylstorage engine is used an execution order of batch statements is not specified. NOTE: This behaviour is incompatible with JDBC spec in the sentence "Batch commands are executed serially (at least logically) in the order in which they were added to the batch" -
The driver continues processing the remaining commands in a batch once execution of a command fails.
To be more fault-tolerant the connector provides cluster extensions. In
particular TarantoolClusterClient and built-in RoundRobinSocketProviderImpl
used as a default SocketProvider implementation. When currently connected
instance is down then the client will try to reconnect to the first available
instance using strategy defined in a socket provider. You need to supply
a list of nodes which will be used by the cluster client to provide such
ability. Also you can prefer to use a discovery mechanism
in order to dynamically fetch and apply the node list.
This cluster-aware provider uses addresses pool to connect to DB server. The provider picks up next address in order the addresses were passed.
Similar to SingleSocketChannelProviderImpl this RR provider also
relies on two options from the config: TarantoolClientConfig.connectionTimeout
and TarantoolClientConfig.retryCount but in a bit different way.
The latter option says how many times the provider should try to establish a
connection to one instance before failing an attempt. The provider requires
positive retry count to work properly. The socket timeout is used to limit
an interval between connections attempts per instance. In other words, the provider
follows a pattern connection should succeed after N attempts with M interval between
them at max.
- Configure
TarantoolClusterClientConfig:
TarantoolClusterClientConfig config = new TarantoolClusterClientConfig();
// fill other settings
config.operationExpiryTimeMillis = 2000;
config.executor = Executors.newSingleThreadExecutor();- Create an instance of
TarantoolClusterClientImpl. You need to provide an initial list of nodes:
String[] nodes = new String[] { "myHost1:3301", "myHost2:3302", "myHost3:3301" };
TarantoolClusterClient client = new TarantoolClusterClient(config, nodes);- Work with the client using same API as defined in
TarantoolClient:
client.syncOps().insert(23, Arrays.asList(1, 1));Auto-discovery feature allows a cluster client to fetch addresses of cluster nodes to reflect changes related to the cluster topology. To achieve this you have to create a Lua function on the server side which returns a single array result. Client periodically polls the server to obtain a fresh list and apply it if its content changes.
- On the server side create a function which returns nodes:
tarantool> function get_cluster_nodes() return { 'host1:3301', 'host2:3302', 'host3:3301' } endYou need to pay attention to a function contract we are currently supporting:
- The client never passes any arguments to a discovery function.
- A discovery function must return an array of strings (i.e
return {'host1:3301', 'host2:3301'}). - Each string should satisfy the following pattern
host[:port](or more strictly/^[^:]+(:\d+)?$/- a mandatory host containing any string and an optional colon followed by digits of the port). Also, the port must be in a range between 1 and 65535 if one is presented. - A discovery function may return multi-results but the client takes
into account only first of them (i.e.
return {'host:3301'}, discovery_delay, where the second result is unused). Even more, any extra results are reserved by the client in order to extend its contract with a backward compatibility. - A discovery function should NOT return no results, empty result, wrong type result, and Lua errors. The client discards such kinds of results but it does not affect the discovery process for next scheduled tasks.
- On the client side configure discovery settings in
TarantoolClusterClientConfig:
TarantoolClusterClientConfig config = new TarantoolClusterClientConfig();
// fill other settings
config.clusterDiscoveryEntryFunction = "get_cluster_nodes"; // discovery function used to fetch nodes
config.clusterDiscoveryDelayMillis = 60_000; // how often client polls the discovery server - Create a client using the config made above.
TarantoolClusterClient client = new TarantoolClusterClient(config);
client.syncOps().insert(45, Arrays.asList(1, 1));- You need to set not empty value to
clusterDiscoveryEntryFunctionto enable auto-discovery feature. - There are only two cases when a discovery task runs: just after initialization of the cluster
client and a periodical scheduler timeout defined in
TarantoolClusterClientConfig.clusterDiscoveryDelayMillis. - A discovery task always uses an active client connection to get the nodes list. It's in your responsibility to provide a function availability as well as a consistent nodes list on all instances you initially set or obtain from the task.
- Every address which is unmatched with
host[:port]pattern will be filtered out from the target addresses list. - If some error occurs while a discovery task is running then this task will be aborted without any after-effects for next task executions. These cases, for instance, are a wrong function result (see discovery function contract) or a broken connection. There is an exception if the client is closed then discovery process will stop permanently.
- It's possible to obtain a list which does NOT contain the node we are currently connected to. It leads the client to try to reconnect to another node from the new list. It may take some time to graceful disconnect from the current node. The client does its best to catch the moment when there are no pending responses and perform a reconnection.
In addition to the options for the standard client, cluster config provides some extra options:
executordefines an executor that will be used as a thread of execution to retry writes. Default values isnullwhich means the cluster client will use a single thread executor.clusterDiscoveryEntryFunctionis a name of the stored function to be used to fetch list of instances. Default value isnull(not set).clusterDiscoveryDelayMillisdescribes how often in ms to poll the server for a new list of cluster nodes. Default value is60 * 1000(1 minute).
The connector uses its own logging facade to abstract from any logging libraries which can be used inside the apps where the connector attached. At the moment, the facade supports JUL as a default logging system, SLF4J facade, and Logback directly via SLF4J interface.
The logging facade offers several ways in integrate its internal logging with foreign one in order:
-
Using system property
org.tarantool.logging.provider. Supported values are jdk and slf4j for the java util logging and SLF4J/Logback respectively. For instance, usejava -Dorg.tarantool.logging.provider=slf4j <...>. -
Using Java SPI mechanism. Implement your own provider org.tarantool.logging.LoggerProvider To register your provider save
META-INF.services/org.tarantool.logging.LoggerProviderfile with a single line text contains a fully-qualified class name of the provider implementation.
cat META-INF/services/org.tarantool.logging.LoggerProvider
org.mydomain.MySimpleLoggerProvider-
CLASSPATH exploring. Now, the connector will use SLF4J if Logback is also in use.
-
If nothing is successful JUL will be used by default.
| Logger name | Level | Description |
|---|---|---|
| o.t.c.TarantoolClusterStoredFunctionDiscoverer | WARN | prints out invalid discovery addresses |
| o.t.TarantoolClusterClient | TRACE | prints out request retries after transient errors |
| o.t.TarantoolClientImpl | WARN | prints out reconnect issues |
To run unit tests use:
./mvnw clean testTo run integration tests use:
./mvnw clean verifyGot problems or questions? Post them on
Stack Overflow with the
tarantool and java tags, or use these tags to search the existing knowledge
base for possible answers and solutions.