Disk space full during insert, what happens?
The actual query wasn't executed in an explicit transaction though, can that explain the existence of the target table?
Yes, exactly so.
If you do a simple select into
outside of an explicit transaction
, there are two transactions
in autocommit mode: the first creates the table
and the second fills it up.
You can prove it to yourself this way:
In a dedicated database
on a test server in simple recovery model
, first make a checkpoint
and ensure that the log contains only a few rows (3 in case of 2016) related to checkpoint
. Then run a select into
of one row and check the log
again, looking for a begin tran
associated with select into
:
checkpoint;
select *
from sys.fn_dblog(null, null);
select 'a' as col
into dbo.t3;
select *
from sys.fn_dblog(null, null)
where Operation = 'LOP_BEGIN_XACT'
and [Transaction Name] = 'SELECT INTO';
You'll get 2 rows, showing you had 2 transactions
.
Are the assumptions I sketched here correct? Is this a likely scenario to have happened?
Yes, they are correct.
The insert
part of select into
was rolled back
, but it does not release any data space. You can verify this by executing sp_spaceused
; you'll see plenty of unallocated space
.
If you want the database to release this unallocated space you should shrink
your data file(s).
You're correct, the SELECT...INTO
command is not atomic. This wasn't documented at the time of the original post, but is now called out specifically on the SELECT - INTO Clause (Transact-SQL) page on MS Docs (yay open source!):
The
SELECT...INTO
statement operates in two parts - the new table is created, and then rows are inserted. This means that if the inserts fail, they will all be rolled back, but the new (empty) table will remain. If you need the entire operation to succeed or fail as a whole, use an explicit transaction.
I'll create a database that uses the full recovery model. I'll give it a fairly small log file, and then tell it that the log file can't autogrow:
CREATE DATABASE [SelectIntoTestDB]
ON PRIMARY
(
NAME = N'SelectIntoTestDB',
FILENAME = N'C:\Program Files\Microsoft SQL Server\MSSQL14.SQL2017\MSSQL\DATA\SelectIntoTestDB.mdf',
SIZE = 8192KB,
FILEGROWTH = 65536KB
)
LOG ON
(
NAME = N'SelectIntoTestDB_log',
FILENAME = N'C:\Program Files\Microsoft SQL Server\MSSQL14.SQL2017\MSSQL\DATA\SelectIntoTestDB_log.ldf',
SIZE = 8192KB,
FILEGROWTH = 0
)
And then I'll try to insert all the posts from my copy of the StackOverflow2010 database. This should write a bunch of stuff to the log file.
USE [SelectIntoTestDB];
GO
SELECT *
INTO dbo.Posts
FROM StackOverflow2010.dbo.Posts;
This resulted in the following error after running for 4 seconds:
Msg 9002, Level 17, State 4, Line 1
The transaction log for database 'SelectIntoTestDB' is full due to 'ACTIVE_TRANSACTION'.
But there is an empty Posts table in my new database:
So, as you suspected, the CREATE TABLE
succeeded, but the INSERT
portion was all rolled back. A workaround would be to use an explicit transaction (which you already noted in your question).