Cut the document in equal pieces
Hello,How can I cut the documents in equal parts (e.g. 10 parts) and save them as separate documents. I cannot use regex (an so the cut doc operator), since I have several documents of different...
View ArticleRetreiving HTML Pages Requiring Login
Hello,I am trying to download a series of HTML pages that require a user login using the Get Pages operator. I tried the solution proposed in the post below, but the data I need did not come through...
View ArticleAbout Rapidminer Studio using Radoop - Hive problem
Dear All,I get an error regarding Rapidminer Radoop. I have downloaded and installed the Radoop extension into the Rapidminer Studio latest version. I can see all the extensions installed. I click the...
View ArticlePolynomial regression gives wrong results (?)
Hi RapidMiner, I'm trying to use Polynomial Regression with a dataset generated from the function: y= 2*x^2 + 3*x + 1, and test the model with the same dataset, but the prediction results look like a...
View ArticleIs assigning custom weights to attributes and using those custom weights in...
I want to assign/set a custom weight to a binominal attribute, so that this attribute's impact on my prediction is much higher. I already know how to set custom weights with "weight by user...
View ArticleEvaluating my models
Hello. Im trying to evalute my models - I've build 3 models, a k-NN, a local polynomial regression and a deep learning.I'm trying do predict the shaftpower of a vessel with different variations of...
View Articleprocess documents loop
Hello everyone, i'm trying to perform sentiment analysis to a bunch of data and i want to test different parameters of Process Document from Data operator.It is obvious that i can't test manually all...
View ArticleTokenize operator of Text processing extension stopped working after...
This is a screenshot of the results tab, when putting a breakpoint after the 'Tokenize' operator. The input if a text attribute called 'description' and the output is the same plain attribute instead...
View ArticleModel Management/ Model Comparison/ Model tracking
Today we released a new extension as part of RapidMiner labs initiative to provide easy way to compare models, track model performance over period of time as well as automatic replacement of models if...
View Articleround real values to the n-th decimal?
hi,is there any operator than can round or cut of some real number on the n-th decimal, e.g on the 5 number after the comma or so?do this for one or any columns?
View ArticleR Script - gridSVG
Hello, It's been a little while since I used R and now I've forgotten how to export plots from R as SVG/PNG so that they can easily be displayed in RM Server without me needing to write to disk. My...
View ArticleUsing MetaCost and Naive Bayes together for Prediction
I have been working on a prediction model for customer churn. My attributes are polynomial, integers and the label is binomial. I have nested the operators as such: X-Validation -> MetaCost ->...
View ArticleRounding and display in 2 decimal place
Hi, I am calculating some numeric number and noticed that the display is either integer or 3 decimal. Can someone kindly advise which operator can I use for below?1) Round to 2 decimal place and...
View ArticleHow to cut a string or digit from right side
Hello,I tried in differnt ways to cut a a string or digit from right side but i m unable to get. For example I have a column A. I need five digits like column B. Please help me out...
View ArticleCreate macros dynamically according to data
Hi RapidMiner, I'm doing predictive maintenance, I have a data table containing a list of fault types and their corresponding percentages (as below). What I want is to create a macro for each fault...
View ArticleWriting a RapidMiner process to disk
I'd like to write my RapidMiner processes to disk (to add to source control). I was able to do this with the web apps using Open File (choosing the web app from the repository), Read Document (as...
View ArticleLogistic Regression - Regularization issue
I have a data set x1 & x2 which cannot be separated by a straight line through the plot so need to map the features into polynomial terms of x1 & x2 upto the SIXTH power. I created the model...
View ArticleWebsite-Content into one cell
Hello everyone,I want to use textmining methods on the lyrics of a website.What I have now is: Artist Song Lyrics The...
View ArticleHow can compare decision tree and linear regression using Cross-Validated or...
Attached are some relavent pictures of my set up and stats on the target variable:1) The Setup of my .rmp , 2) Picture of the Histogram of the Target Variable, 3) Plot of CO2 (target variable) vs....
View ArticleParallelization
Hi...so I know the old parallel processing extension is no longer functional and I understand that you're working on parallelization of other operators. I have a simple problem - I have 50,000...
View Article