How do I convert a django QuerySet to numpy record array?

If you want to get all of your objects and create a numpy array with objects as elements of array:

import numpy as np
qs = MyModel.objects.all()
numpy_array = np.array(list(qs))

According to my work, I use something as below:

import numpy as np    
qs = MyModel.objects.values_list('id','first_name','last_name').filter(gender='male').order_by('id')
numpy_array = np.array(list(qs))

Rows of array corresponds to records and columns of array corresponds to values that I defined above (id, first name, last name).


import numpy as np
qs = MyModel.objects.all()
vlqs = qs.values_list()
r = np.core.records.fromrecords(vlqs, names=[f.name for f in MyModel._meta.fields])

This uses the QuerySet iterator directly and avoids the time-and-garbage-wasting step of creating a python list. It also uses MyModel._meta.fields to get the actual field names from the model, as explained at Get model's fields in Django

If you just want a single field (e.g. the 'votes' field of the model) extracted into a one-dimensional array, you can do:

vlqs = qs.values_list('votes', flat=True)
votes = np.fromiter(vlqs, numpy.dtype('int_'))

This is like asking "how do I convert the contents of my fridge into dinner?". It depends on what you have in your fridge and what you'd like to eat. The short answer (equivalent to saying "by cooking") is to iterate over the queryset, constructing objects of whatever composite data types you'd like to instantiate the array with (generally an iterable and a dictionary). The long answer depends on what you'd actually like to accomplish.

Tags:

Django

Numpy