Data Mining

Download Free Data Mining Source Code In C/C++, C#, Visual Basic, Visual Basic.NET, Java,
and other programming languages
Welcome to Data Mining Sign in | Join | Help
in Search

Data Mining Source Code Newsletter

Business Analyst Training
Live, Online, Video Courses
Instructor-Led + Hands-On
BusinessAnalystBootCamp.Com

SQL + Database Training
Live, Online, Video Classes
Instructor-Led + Hands-On
SQLBootCamp.Com

Software Developer Training
Live, Online, Video Courses
Instructor-Led + Hands-On
SoftwareDevelperBootCamp.Com

IT CAREER COACH
Hands-On Experience Coaching
IT Skills Training
IT-Career-Coach.NET

IT Professional Newsletter
"Free" IT Career Success Tips
How To Accelerate Your Career
IT Career Newsletter

Ask IT Career Questions
"ASK" A Burning IT Career
Question Or Get Answers
Ask A Burning IT Question Now!

Announcing The Data Mining Source Code Newsletter!

Subscribe By Email | Subscribe By RSS Feed

cross-validation procedure

Last post 02-07-2010, 21:59 by TimManns. 1 replies.
Sort Posts: Previous Next
  •  02-01-2010, 5:31 9637

    cross-validation procedure

    Hi all/

    I working on the pasw13. i need to split a data set (of 13,000 posts). The partition node  dose not allow to part the data set according to the cross-validation methodology. (in this procedure you need to split the data set to 5 up to 10 parts...).  Does anybody have an idea, how can we do this?

    Thank's Dan.

  •  02-07-2010, 21:59 9683 in reply to 9637

    Re: cross-validation procedure

    Not sure I am following your question.  Do you simply want to split data into equal sized pieces numbered 5 to 10?

    You can use the random() function to generate a random number, and then a simple select node to pick records.  

    For example, in a derive node type;

    (random0(5) + 5)

    -> this will craete a random number from 0 to 5 and then just add 5 (so random number will be between 5 and 10)

    Then you will have random allocation of a value 5-to-10 on each record.  You can then select records with a simple select node.

    Cheers

    Tim

Announcing The Data Mining Source Code Newsletter!

Subscribe By Email | Subscribe By RSS Feed