image-recognition

Finding repeating patterns/images in images

▼魔方 西西 提交于 2019-12-02 20:37:46
I've been looking around for some papers (or info) on this topic. To avoid a misunderstanding: I'm not talking about finding a supplied pattern in multiple locations. Repeating patterns can also be understood to mean repeating images. The definition of pattern here isn't abstract. Imagine, for instance, a brick wall. The wall is composed of individual bricks. A picture of the wall is composed of the repeating image of a brick. The solution must preferably find the largest repeating pattern. Large in this context can be defined two ways: pixel area or number of repetitions. In the above example

How to locate multiple objects in the same image?

99封情书 提交于 2019-12-02 19:41:20
I am a newbie in TensorFlow. Currently, I am testing some classification's examples "Convolutional Neural Network" in the TensorFlow website, and it explains how to classify input images into pre-defined classes, but the problem is: I can't figure out how to locate multiple objects in the same image. For example, I had an input image with a cat and dog and I want my graph to display in the output that there are both of them "a cat and a dog" in the image. Great question. Detecting multiple objects in the same image boils is essentially a "segmentation problem". Two nice and popular algorithms

free OCR processing API in PHP/jQuery/JavaScript [closed]

本小妞迷上赌 提交于 2019-12-02 18:25:02
Is there any free PHP OCR API, that is well documented in PHP, jQuery or JavaScript? I'm programming a web app for iOS and I'd like to utilize the new photo upload API in iOS 6 safari to use for OCR image processing. Thanks a ton! :) ( Edit: Oh and a Google-Googles like API for image recognition would also suffice, it doesn't have to be OCR, though I would pefer it. :) Tesseract is really simple to use. Someone has even written a PHP wrapper for it so you won't have to deal with the exec() command. Have a look at this here: http://github.com/thiagoalessio/tesseract-ocr-for-php You can try http

What are good features for classifying photos of clothing? [closed]

天大地大妈咪最大 提交于 2019-12-02 17:19:07
I want to build a clothing classifier that takes a photo of an item of clothing and classifies it as 'jeans', 'dress', 'trainers' etc. Some examples: These images are from retailer websites, so are typically taken from the same angle, typically on a white or pale background -- they tend to be very similar. I have a set of several thousand images whose category I already know, which I can use to train a machine-learning algorithm. However, I'm struggling for ideas of what features I should use. The features I have so far: def get_aspect_ratio(pil_image): _, _, width, height = pil_image.getbbox(

Alternatives to face.com API

陌路散爱 提交于 2019-12-02 15:16:27
Sadly, the face.com API is being shut down due acquisition by Facebook. Are there any decent alternatives out there? I'm looking to check for a given image if there is a face in it + demographics content about it. bcmoney Sad news indeed. If you have a good reason and already have an account, they may extend you until October, 2012: http://developers.face.com/extension-request/ For everyone else, here's a list of potential alternatives I've compiled: OpenCV: http://opencv.willowgarage.com/ (C/C++ but at least its opensource, standalone and downloadable) Viewdle (Objective-C)... bought/axed by

Get time from the clock image in Matlab - Image Processing

夙愿已清 提交于 2019-12-02 10:20:25
clock http://img710.imageshack.us/img710/2623/circlock.jpg I do not know how to determine the top of the needle of the clock. If you determine the coordinates of them, I can think of the time of the clock. Please help me solve this problem. Thank you very much! Test points in a circle of radius M, around the centre of the clock. (You could use a for loop to vary the angle from -pi to pi.) First set M so that the long hand will touch the circle but not the short hand. You will find a region of black points and these reveal the angle of the long hand. Then repeat with smaller M, so your circle

detecting shapes in a bitmap

不问归期 提交于 2019-12-02 03:22:35
问题 looking for a good algorithm to do the following: I have a b&w image comprised solely of polygonal shapes, which i got by scanning in a document. I would like to find all the shapes in this image, smooth the edges, and in the end get the coordinates for the smooth shapes. Any good algorithms for doing the search and smoothing to get perfect shapes in the end? 回答1: I would start with a connected component labeling in order to separate the different polygons. A simple solution for smoothing

How add new class in saved keras sequential model

纵然是瞬间 提交于 2019-12-02 02:01:28
问题 I have 10 class dataset with this I got 85% accuracy, got the same accuracy on a saved model. now I want to add a new class, how to add a new class To the saved model. I tried by deleting the last layer and train but model get overfit and in prediction every Images show same result (newly added class). This is what I did model.pop() base_model_layers = model.output pred = Dense(11, activation='softmax')(base_model_layers) model = Model(inputs=model.input, outputs=pred) # compile and fit step

How to get a position of custom object on image using vision recognition api

夙愿已清 提交于 2019-12-01 21:14:10
I know there is a lot of vision recognition APIs such as Clarifai, Watson, Google Cloud Vision, Microsoft Cognitive Services which provide recognition of image content. The response of these services is simple json that contains different tags, for example { man: 0.9969295263290405, portrait: 0.9949591159820557, face: 0.9261120557785034 } The problem is that I need to know not only what is on the image but also the position of that object. Some of those APIs have such feature but only for face detection. So does anyone know if there is such API or I need to train own haar cascades on OpenCV

InvalidArgumentError: Expected dimension in the range [-1, 1) but got 1

左心房为你撑大大i 提交于 2019-12-01 16:31:26
I'm not sure what this error means. This error occurs when I try to calculate acc : acc = accuracy.eval(feed_dict = {x: batch_images, y: batch_labels, keep_prob: 1.0}) I've tried looking up solutions, but I couldn't find any online. Any ideas on what's causing my error? Here's a link to my full code . I had a similar error but the problem for me was that I was trying to use argmax on a 1 dimensional vector. So the shape of my label was (50,) and I was trying to do a tf.argmax(y,1) on that when evaluating. The solution reference is Tensorflow: I get something wrong in accuracy The source code