My Friend,
It sounds like you’re a bitter overwhelmed now.
Here is my advice for what its worth:
1. Your data has to be checked for the standard things, i.e. missing values, bad data, and other problems with the data. If you do this first, it makes the process a whole lot easier than to find out that your model is screwed up after you find it later.
2. What do you want to find out from your data? What are you trying to model? Ask yourself these questions. Don’t worry about the modeling procedure yet, you have to define the variable or information you want to model. Write down the model on paper to see if it makes sense? Example: I want to know why some individuals buy certain products when they received a direct mail piece and why some do not. My variable here is Yes = 1, they purchased, No=0, they did not. If you can think through this information that’s half the battle.
3. It sound like you may not have a lot of modeling in your back ground, don’t take that as a negative, so I would start with a decision tree model. There easy to understand compared to some of the other models in PASW. If you understand regression then start there.
4. Define the model.
5. Last, start reading blogs, i.e. Tim Mann has a very good blog. Don’t worry if the information is over head, start reading. It doesn’t come by osmosis.
Triener
A raccoon in the wilderness of data.