Refactor `tiny_tds` to avoid sharing `DBPROCESS` #571

andyundso · 2024-12-20T09:17:31Z

Background

In order for tiny_tds to communicate with a MSSQL server using FreeTDS, they provide a DBPROCESS struct to do so in C land. The interaction with DBPROCESS require to follow an exact sequence:

Open a connection using dbopen.
Prepare a command buffer using dbcmd.
Send it to the server using dbsqlsend.
Acknowledge that the server acknowledge using dbsqlok.
Gather metadata using dbresults.
Fetch each row using dbnextrow or cancel the running results (dbcancel)
Close the client using dbclose if not cancelled.

Our insert and do method, currently implemented on the Result class, perform the entire sequence. However, with execute, this is intentionally not done to allow lazy-loading of results from the server. This can lead to errors, some intended, others not:

You get an error message if you try to make another query without requesting all results first (intended error). Although it can have unintended side-effects. For example, if you call find on Result, it will abort the each early, therefore not all results are consumed and you cannot start a new query - you have to initialise a new Client.
There is a scenario with threads where you force a crash, see the following Ruby code:

it 'raises error when query cannot be sent' do
  client = new_connection
  assert_client_works(client)
    
  thread1 = Thread.new do
    client.execute("WaitFor Delay '00:00:10'")
  end

  assert thread1.alive?    
  thread2 = Thread.new do
    assert_raises(TinyTds::Error) { client.execute("WaitFor Delay '00:00:02'") }
  end
    
  thread1.join
  thread2.join
end

This will result in the following crash:

ruby: mem.c:1202: tds_free_connection: Assertion `conn->in_net_tds == NULL' failed.
Aborted (core dumped)
rake aborted!

Technically, DBPROCESS as well as some metadata of ours (like dbsqlsent) is part of the client instance in C land. If the garbage collector decided to sweep away the Client instance, the results can no longer be consumed. A reproduction of this is provided in Segementation Fault when reading from a closed connection #435.

results = TinyTds::Client.new(opts).execute('SELECT 42 as answer_to_life')
GC.start
puts results.to_a

This will yield a segmentation fault, since the Client instance is unreachable from the point of view of the garbage collector, and it gets deallocated.

You can force a segmentation fault by closing the client before consuming the result. Closing the client deallocates the DBPROCESS as well as the metadata. See Segementation Fault when reading from a closed connection #435 for the reproduction script.

Proposed solution

The proposed solution in this PR removes the lazy-loading functionality of tiny_tds. insert, do and execute are moved to the Client class. The C code for the Result class is removed entirely, thus leading the Client class to have sole control over all C data structures. Result is now a PORO holding the results rows as well as couple of metadata, like fields.

Comment

I am leaving this in draft state for now. It would require the release of a new major, which we only just did. I also did not check all of the code yet - I am sure the implementation is not fully complete yet (e.g. return code is missing). I also need to test a couple of edge-cases with threads and garbage collector.

andyundso added 4 commits December 18, 2024 13:42

Move insert to client class

d05d70d

Move do to client class

0598fde

Refactor execute to fetch an entire result object

62fd9cd

Ensure test database data is loaded before running tests

1a3773a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `tiny_tds` to avoid sharing `DBPROCESS` #571

Refactor `tiny_tds` to avoid sharing `DBPROCESS` #571

andyundso commented Dec 20, 2024

Refactor tiny_tds to avoid sharing DBPROCESS #571

Are you sure you want to change the base?

Refactor tiny_tds to avoid sharing DBPROCESS #571

Conversation

andyundso commented Dec 20, 2024

Background

Proposed solution

Comment

Refactor `tiny_tds` to avoid sharing `DBPROCESS` #571

Refactor `tiny_tds` to avoid sharing `DBPROCESS` #571