Hi dear all,
I have got questionnaire data (SPSS format) with over 2,000 columns. A few questions:
1. Is this usual to have thousands of variables? (in the questionnaire some questions are multiple choice with a few hundreds of options. I don't know how they carried out the survey! Must be a very long page in IE...)
My answer currently is Yes.
2. How can I store this into database (e.g. SQL Server supports 1024 columns only)?
SPSS file looks like:
UID, BoughtA, BoughtB, BoughtC, BoughtD........HateWalmart, HateBestBuy,HateTesco
ResponderA,1,0,0,1................1,1,1
ResponderB,0,0,0,0...............1,0,1
My plan is to transform it to a single table:
UID, Question, Answser
ResponderA,Bought,A
ResponderA,Bought,D
ResponderA,Hate,Walmart
.......
ResponderB,Hate, BestBuy
There are lots of columns are negative (didn't bought, didn't go to), does this need any special consideration?
3. Is it possible to find/write a de-restructure node to consolidate the data to fewer columns?
My guess is: Possibly not if the questionnaire have multiple choice questions.
Thanks guys!