識別圖片中是否有logo,實作方案主要有如下4種方案:
目錄
1.通過logo庫來判斷圖片中是否有logo庫中的logo
2.識別圖片中的文字,只要有文字,就假設有logo,再進行二次校驗
3.識別圖片中的文字,通過logo文字庫,來判斷是否是logo
4.直接識別logo,但需要通過大量帶logo圖片進行訓練,來實作識別的準確性
1.通過logo庫來判斷圖片中是否有logo庫中的logo
百度logo識別介面
介面地址參考:https://cloud.baidu.com/doc/IMAGERECOGNITION/s/Ok3bcxc59

阿里logo識別介面
阿里logo識別地址參考:https://help.aliyun.com/knowledge_detail/155012.html

缺點:這種方式依賴logo庫
2.識別圖片中的文字,只要有文字,就假設有logo,再進行二次校驗
識別圖片中的文字,目前不管自己用Java/Python寫代碼識別圖片中文字,還是呼叫第三方介面,技術都已比較成熟,
下面以其中一種方式舉例說明:
直接上代碼:
public static String generalBasic(String filePath,String accessToken) {
// 請求url
String url = "https://aip.baidubce.com/rest/2.0/ocr/v1/general_basic";
try {
byte[] imgData = FileUtil.readFileByBytes(filePath);
String imgStr = Base64Util.encode(imgData);
String imgParam = URLEncoder.encode(imgStr, "UTF-8");
String param = "image=" + imgParam;
String result = HttpUtil.post(url, accessToken, param);
return getWords(result);
} catch (Exception e) {
e.printStackTrace();
}
return null;
}
public static void main(String[] args) {
String filePath = "D:\\1.jpg";
String accessToken = "百度api的token";
String word=generalBasic(filePath,accessToken);
System.out.println(word);
}
其他Java類如下:
編碼處理類:
package com.example.demo.ocr.common;
/**
* Base64 工具類
*/
public class Base64Util {
private static final char last2byte = (char) Integer.parseInt("00000011", 2);
private static final char last4byte = (char) Integer.parseInt("00001111", 2);
private static final char last6byte = (char) Integer.parseInt("00111111", 2);
private static final char lead6byte = (char) Integer.parseInt("11111100", 2);
private static final char lead4byte = (char) Integer.parseInt("11110000", 2);
private static final char lead2byte = (char) Integer.parseInt("11000000", 2);
private static final char[] encodeTable = new char[]{'A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z', 'a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z', '0', '1', '2', '3', '4', '5', '6', '7', '8', '9', '+', '/'};
public Base64Util() {
}
public static String encode(byte[] from) {
StringBuilder to = new StringBuilder((int) ((double) from.length * 1.34D) + 3);
int num = 0;
char currentByte = 0;
int i;
for (i = 0; i < from.length; ++i) {
for (num %= 8; num < 8; num += 6) {
switch (num) {
case 0:
currentByte = (char) (from[i] & lead6byte);
currentByte = (char) (currentByte >>> 2);
case 1:
case 3:
case 5:
default:
break;
case 2:
currentByte = (char) (from[i] & last6byte);
break;
case 4:
currentByte = (char) (from[i] & last4byte);
currentByte = (char) (currentByte << 2);
if (i + 1 < from.length) {
currentByte = (char) (currentByte | (from[i + 1] & lead2byte) >>> 6);
}
break;
case 6:
currentByte = (char) (from[i] & last2byte);
currentByte = (char) (currentByte << 4);
if (i + 1 < from.length) {
currentByte = (char) (currentByte | (from[i + 1] & lead4byte) >>> 4);
}
}
to.append(encodeTable[currentByte]);
}
}
if (to.length() % 4 != 0) {
for (i = 4 - to.length() % 4; i > 0; --i) {
to.append("=");
}
}
return to.toString();
}
}
讀圖片類:
/**
* 根據檔案路徑讀取byte[] 陣列
*/
public static byte[] readFileByBytes(String filePath) throws IOException {
File file = new File(filePath);
if (!file.exists()) {
throw new FileNotFoundException(filePath);
} else {
ByteArrayOutputStream bos = new ByteArrayOutputStream((int) file.length());
BufferedInputStream in = null;
try {
in = new BufferedInputStream(new FileInputStream(file));
short bufSize = 1024;
byte[] buffer = new byte[bufSize];
int len1;
while (-1 != (len1 = in.read(buffer, 0, bufSize))) {
bos.write(buffer, 0, len1);
}
byte[] var7 = bos.toByteArray();
return var7;
} finally {
try {
if (in != null) {
in.close();
}
} catch (IOException var14) {
var14.printStackTrace();
}
bos.close();
}
}
}
HttpUtil請求類:
/**
* http 工具類
*/
public class HttpUtil {
public static String post(String requestUrl, String accessToken, String params)
throws Exception {
String contentType = "application/x-www-form-urlencoded";
return HttpUtil.post(requestUrl, accessToken, contentType, params);
}
public static String post(String requestUrl, String accessToken, String contentType, String params)
throws Exception {
String encoding = "UTF-8";
if (requestUrl.contains("nlp")) {
encoding = "GBK";
}
return HttpUtil.post(requestUrl, accessToken, contentType, params, encoding);
}
public static String post(String requestUrl, String accessToken, String contentType, String params, String encoding)
throws Exception {
String url = requestUrl + "?access_token=" + accessToken;
return HttpUtil.postGeneralUrl(url, contentType, params, encoding);
}
public static String postGeneralUrl(String generalUrl, String contentType, String params, String encoding)
throws Exception {
URL url = new URL(generalUrl);
// 打開和URL之間的連接
HttpURLConnection connection = (HttpURLConnection) url.openConnection();
connection.setRequestMethod("POST");
// 設定通用的請求屬性
connection.setRequestProperty("Content-Type", contentType);
connection.setRequestProperty("Connection", "Keep-Alive");
connection.setUseCaches(false);
connection.setDoOutput(true);
connection.setDoInput(true);
// 得到請求的輸出流物件
DataOutputStream out = new DataOutputStream(connection.getOutputStream());
out.write(params.getBytes(encoding));
out.flush();
out.close();
// 建立實際的連接
connection.connect();
// 獲取所有回應頭欄位
Map<String, List<String>> headers = connection.getHeaderFields();
// 遍歷所有的回應頭欄位
// for (String key : headers.keySet()) {
// System.err.println(key + "--->" + headers.get(key));
// }
// 定義 BufferedReader輸入流來讀取URL的回應
BufferedReader in = null;
in = new BufferedReader(
new InputStreamReader(connection.getInputStream(), encoding));
String result = "";
String getLine;
while ((getLine = in.readLine()) != null) {
result += getLine;
}
in.close();
// System.err.println("result:" + result);
return result;
}
}
獲取百度token:
import org.json.JSONObject;
import java.io.BufferedReader;
import java.io.InputStreamReader;
import java.net.HttpURLConnection;
import java.net.URL;
import java.util.List;
import java.util.Map;
/**
* 獲取token類
*/
public class AuthService {
public static void main(String[] args){
String token=getAuth();
System.out.println(token);
}
/**
* 獲取權限token
* @return 回傳示例:
* {
* "access_token": "24.460da4889caad24cccdb1fea17221975.2592000.1491995545.282335-1234567",
* "expires_in": 2592000
* }
*/
public static String getAuth() {
// 官網獲取的 API Key 更新為你注冊的
String clientId = "您的API Key";
// 官網獲取的 Secret Key 更新為你注冊的
String clientSecret = "您的Secret Key";
return getAuth(clientId, clientSecret);
}
/**
* 獲取API訪問token
* 該token有一定的有效期,需要自行管理,當失效時需重新獲取.
* @param ak - 百度云官網獲取的 API Key
* @param sk - 百度云官網獲取的 Securet Key
* @return assess_token 示例:
* "24.460da4889caad24cccdb1fea17221975.2592000.1491995545.282335-1234567"
*/
public static String getAuth(String ak, String sk) {
// 獲取token地址
String authHost = "https://aip.baidubce.com/oauth/2.0/token?";
String getAccessTokenUrl = authHost
// 1. grant_type為固定引數
+ "grant_type=client_credentials"
// 2. 官網獲取的 API Key
+ "&client_id=" + ak
// 3. 官網獲取的 Secret Key
+ "&client_secret=" + sk;
try {
URL realUrl = new URL(getAccessTokenUrl);
// 打開和URL之間的連接
HttpURLConnection connection = (HttpURLConnection) realUrl.openConnection();
connection.setRequestMethod("GET");
connection.connect();
// 獲取所有回應頭欄位
Map<String, List<String>> map = connection.getHeaderFields();
// 遍歷所有的回應頭欄位
for (String key : map.keySet()) {
System.err.println(key + "--->" + map.get(key));
}
// 定義 BufferedReader輸入流來讀取URL的回應
BufferedReader in = new BufferedReader(new InputStreamReader(connection.getInputStream()));
String result = "";
String line;
while ((line = in.readLine()) != null) {
result += line;
}
/**
* 回傳結果示例
*/
System.err.println("result:" + result);
JSONObject jsonObject = new JSONObject(result);
String access_token = jsonObject.getString("access_token");
return access_token;
} catch (Exception e) {
System.err.printf("獲取token失敗!");
e.printStackTrace(System.err);
}
return null;
}
}
百度介面呼叫及簡單二次開發:
private static String getWords(String json) {
JSONObject obj=new JSONObject(json);
Integer words_result_num=obj.getInt("words_result_num");
if(words_result_num>0){
JSONArray list=obj.getJSONArray("words_result");
String words="";
for(int i=0;i<list.length();i++){
JSONObject obj2=list.getJSONObject(i);
words=words+("".equals(words)?"":",")+obj2.get("words");
//System.out.println(list.get(i));
//System.out.println(obj2.get("words"));
}
return words;
}
return "";
}
public static String generalBasic(String filePath,String accessToken) {
// 請求url
String url = "https://aip.baidubce.com/rest/2.0/ocr/v1/general_basic";
try {
byte[] imgData = FileUtil.readFileByBytes(filePath);
String imgStr = Base64Util.encode(imgData);
String imgParam = URLEncoder.encode(imgStr, "UTF-8");
String param = "image=" + imgParam;
String result = HttpUtil.post(url, accessToken, param);
return getWords(result);
} catch (Exception e) {
e.printStackTrace();
}
return null;
}
下面拿2張圖,識別出的logo舉例:

這張圖,識別出“hzfwjgs cn alibaba. com”,如下字串:


這張圖,識別出“分南寧市第一中學”,如下字串:

完整代碼地址 http://www.zrscsoft.com/sitepic/12130.html
3.識別圖片中的文字,通過logo文字庫,來判斷是否是logo
這種方式,是在方案2基礎上繼續延續的方案,
主要分三步實作:
1)、識別圖片中的文字
詳細見方案2,這里略,
2)、建立logo文字庫
建立logo文字庫,細節就不詳細說,舉個例子吧,
logo文字庫如“www”,“com”,“網”,“媒體”,“新聞”等等,
3)、使用logo文字庫來判斷圖片是否帶logo
根據圖片中的文字,檢索logo文字庫中的文字,例如圖片中帶“www”,“com”,“網”,“媒體”,“新聞”就判斷為是帶logo的圖片
完整專案地址:http://www.zrscsoft.com/sitepic/12130.html
4.直接識別logo,但需要通過大量帶logo圖片進行訓練,來實作識別的準確性
這種方式,目前還沒有找到實作方法,如果有實作,歡迎在評論區留言,
轉載請註明出處,本文鏈接:https://www.uj5u.com/houduan/262906.html
標籤:java
上一篇:Spring回圈依賴問題的解決
下一篇:Java開發的環境搭建
