classification

Visualizing Weka classification tree

做~自己de王妃 提交于 2019-12-17 22:38:26
问题 This question was migrated from Cross Validated because it can be answered on Stack Overflow. Migrated 7 years ago . I am using few data sets available online and trying to visualize tree. However, it does not let me visualize tree option at all. Could anyone please guide me how to get the tree diagram in weka by using data sets available online? 回答1: Look here http://maya.cs.depaul.edu/classes/ect584/weka/classify.html, for example. First you have to fit your decision tree (I used the J48

One-class classification with SVM in R

北战南征 提交于 2019-12-17 18:42:10
问题 I'm using the package e1071 in R in order to build a one-class SVM model. I don't know how to do that and I neither find any example on the Internet. Could someone give an example code to characterize, for example, the class "setosa" in the "iris" dataset with a one-class classification model and then test all the examples in the same dataset (in order to check what examples belong to the characterization of the "setosa" class and what examples not)? 回答1: I think this is what you want:

Recommended anomaly detection technique for simple, one-dimensional scenario?

被刻印的时光 ゝ 提交于 2019-12-17 17:32:11
问题 I have a scenario where I have several thousand instances of data. The data itself is represented as a single integer value. I want to be able to detect when an instance is an extreme outlier. For example, with the following example data: a = 10 b = 14 c = 25 d = 467 e = 12 d is clearly an anomaly, and I would want to perform a specific action based on this. I was tempted to just try an use my knowledge of the particular domain to detect anomalies. For instance, figure out a distance from the

How to perform logistic regression using vowpal wabbit on very imbalanced dataset

夙愿已清 提交于 2019-12-17 17:25:27
问题 I am trying to use vowpal wabbit for logistic regression. I am not sure if this is the right syntax to do it For training, I do ./vw -d ~/Desktop/new_data.txt --passes 20 --binary --cache_file cache.txt -f lr.vw --loss_function logistic --l1 0.05 For testing I do ./vw -d ~/libsvm-3.18_test/matlab/new_data_test.txt --binary -t -i lr.vw -p predictions.txt -r raw_score.txt Here is a snippet from my train data -1:1.00038 | 110:0.30103 262:0.90309 689:1.20412 1103:0.477121 1286:1.5563 2663:0.30103

What are advantages of Artificial Neural Networks over Support Vector Machines? [closed]

左心房为你撑大大i 提交于 2019-12-17 17:23:20
问题 As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 7 years ago . ANN (Artificial Neural Networks) and SVM (Support Vector Machines) are two popular strategies for supervised machine learning and

How to create a class for non classified object in tensorflow?

一曲冷凌霜 提交于 2019-12-17 17:23:05
问题 Hi i have build my CNN with two classes dogs and cats, i have trained this and now i am able to classify dog and cat image. But what about if i want to introduce a class for new unclassified object? For example if i feed my network with a flower image's the network give me a wrong classification. I want to build my network with a third class for new unclassified object. But how can i build this third class. Which images i have to use to get class for new object that are different from dogs or

Difference between classification and clustering in data mining? [closed]

杀马特。学长 韩版系。学妹 提交于 2019-12-17 15:04:39
问题 Closed . This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed 6 months ago . Can someone explain what the difference is between classification and clustering in data mining? If you can, please give examples of both to understand the main idea. 回答1: In general, in classification you have a set of predefined classes and want to know which class a new object

How to create a dendrogram with colored branches?

微笑、不失礼 提交于 2019-12-17 09:54:03
问题 This question was migrated from Cross Validated because it can be answered on Stack Overflow. Migrated 6 years ago . I would like to create a dendrogram in R which has colored branches, like the one shown below. So far I used following commands to create a standard dendrogram: d <- dist(as.matrix(data[,29])) # find distance matrix hc <- hclust(d) # apply hirarchical clustering plot(hc,labels=data[,1], main="", xlab="") # plot the dendrogram How should I modify this code to obtain desired

Recognise an arbitrary date string

前提是你 提交于 2019-12-17 09:47:17
问题 I need to be able to recognise date strings. It doesn't matter if I can not distinguish between month and date (e.g. 12/12/10), I just need to classify the string as being a date, rather than converting it to a Date object. So, this is really a classification rather than parsing problem. I will have pieces of text such as: "bla bla bla bla 12 Jan 09 bla bla bla 01/04/10 bla bla bla" and I need to be able to recognise the start and end boundary for each date string within. I was wondering if

Finding K-nearest neighbors and its implementation

房东的猫 提交于 2019-12-17 02:13:27
问题 I am working on classifying simple data using KNN with Euclidean distance. I have seen an example on what I would like to do that is done with the MATLAB knnsearch function as shown below: load fisheriris x = meas(:,3:4); gscatter(x(:,1),x(:,2),species) newpoint = [5 1.45]; [n,d] = knnsearch(x,newpoint,'k',10); line(x(n,1),x(n,2),'color',[.5 .5 .5],'marker','o','linestyle','none','markersize',10) The above code takes a new point i.e. [5 1.45] and finds the 10 closest values to the new point.