问题标签 [audio-fingerprinting]

问问题

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

132 问题

0 投票

0 回答

143 浏览

audio - 音频指纹识别：我可以使用 Echoprint 或 AcusticID 等工具找出准确的音频时间码吗？

我了解基于地标的指纹识别使用时间和频率，但我可以使用开源工具获取地标的时间码/偏移量作为输出吗？或者其他东西可以将某些内容与音频同步？

2015-12-10T13:52:32.160

0 投票

0 回答

481 浏览

solr - echoprint 服务器 - 指纹搜索时间需要 2-3 秒

我们正面临高指纹匹配 solr 查询时间。以下是我们的设置信息：

echonest/echoprint-server 运行在单节点（solr 1.0）上，运行在亚马逊 ec2 实例 m3.2x 大盒子上，具有 30G RAM 和 8 个内核
使用 91G 左右的 solr 1.0 索引大小摄取了 250 万条轨道（段数 19933333）。
应用优化 HashQueryComponent.java https://github.com/playax/echoprint-server/commit/706d26362bbe9141203b2b6e7846684e7a417616#diff-f9e19e870c128c0d64915f304cf43677
还尝试捕获 eval 方法的统计信息，索引阅读器的顺序子阅读器的一些循环迭代花费了超过 1 秒的时间来迭代所有术语。

任何正确方向的建议或指示都会非常有帮助。

solr audio-fingerprinting echonest echoprint

2016-01-04T12:40:20.507

0 投票

1 回答

2219 浏览

machine-learning - How to train a machine learning algorithm using MFCC coefficient vectors?

For my final year project i am trying to identify dog/bark/bird sounds real time (by recording sound clips). I am using MFCC as the audio features. Initially i have extracted altogether 12 MFCC vectors from a sound clip using jAudio library. Now I'm trying to train a machine learning algorithm(at the moment i have not decided the algorithm but it is most probably SVM). The sound clip size is like around 3 seconds. I need to clarify some information about this process. They are,

Do i have to train this algorithm using frame based MFCCs(12 per frame) or or overall clip based MFCCs(12 per sound clip)?
To train the algorithm do i have to consider all the 12 MFCCs as 12 different attributes or do i have to consider those 12 MFCCs as a one attribute ?

These MFCCs are the overall MFCCS for the clip,

-9.598802712290967 -21.644963856237265 -7.405551798816725 -11.638107212413201 -19.441831623156144 -2.780967392843105 -0.5792847321137902 -13.14237288849559 -4.920408873192934 -2.7111507999281925 -7.336670942457227 2.4687330348335212

Any help will be really appreciated to overcome these problems. I couldn't find out a good help on Google. :)

machine-learning signal-processing audio-processing mfcc audio-fingerprinting

2016-02-07T12:05:57.167

0 投票

0 回答

1719 浏览

java - 如何比较两个波形音频文件并生成分数？

我尝试了什么：
我尝试使用musicg API 比较（指纹）两首音频歌曲（没有背景音乐）。

代码如下：
file1.wav 是主文件，file2.wav（反之亦然）是录制文件，两者都是由同一个人（没有背景音乐）和同一首歌演唱的。

但是即使两首歌是由同一个人以相同的方式演唱的，它也不能正确比较这两个文件并且得分很低（0.099384）。

我该如何解决这个问题？
任何其他 java API 将有助于做同样的事情吗？

java audio audio-fingerprinting musicg

2016-02-07T15:06:52.810

0 投票

1 回答

960 浏览

android - 如何在 Android 中使用 KeyGenerator for FingerPrint API 生成密钥

我正在尝试为我的应用程序实现指纹 API。为此，我正在关注 Google 的指纹对话框示例。

如果我的应用程序是 21 并且是 14，它工作正常。为此，我使用compileSdkVersion=23而不是工作正常，但问题在于密钥生成。minSdkVersion=23 compileSdkVersionminSdkVersionFingerprintManagerCompatFingerprintManager

Keystore 包及其类无法生成密钥，所有支持的密钥生成算法都在 18+ API 版本中可用，任何人都可以指导我如何生成密钥以支持较低版本，好吗？

android android-6.0-marshmallow audio-fingerprinting key-generator

2016-02-11T14:43:01.113

0 投票

1 回答

508 浏览

ios - 使用 Gracenote SDK (GNSDK) 读取文件并执行指纹识别

我正在尝试从 iOS 上读取 16 位 16KHz wav（20 秒长）并执行指纹识别。我尝试按照指南“GNSDK-for-Mobile-iOS-Developers-Guide”进行操作，但它在第 53 页（仅 C#）上没有objective-c 示例。我的代码读入整个文件，然后将 2K 块写入函数指纹写入（我也尝试将指针传递给整个文件）。变量“结果”永远不会设置为 1 - 表示已传递的数据不足 - 但 20 秒应该足够了，对吧？有人可以帮忙吗？谢谢。

这就是我所做的：

ios gracenote audio-fingerprinting

2016-02-16T15:20:00.660

0 投票

2 回答

663 浏览

python - 如何更高效地搜索超过 3000 万行的 acoustid 数据库？

我目前正在玩一个名为 acoustid 的开源音乐识别项目。我已经导入了一个包含超过 3000 万行（300gb 数据）的表，但是只需花费大量时间来选择这些行。目前，选择 200,000 行可能需要 30 秒。

该项目通过仅查找指纹的前 15 秒并将其存储在硬盘上来提供 acoustid-index 来索引行……然后将其加载到内存中。https://bitbucket.org/acoustid/acoustid-index/overview

只是，我不知道如何使用它。方向令人困惑。似乎这是为 PostgreSQL 创建的。我正在使用的服务器上使用 MySQL 和 Python。我还能用它来索引我的数据库吗？

关于如何使用它来索引数据库中的行的任何建议？还有其他方法可以使通过该数据库进行的搜索更有效吗？

python mysql postgresql relational-database audio-fingerprinting

2016-03-15T21:36:35.457

0 投票

1 回答

1496 浏览

python - 从 URL 录制和识别音乐

我在 python DeJavu中使用了一个开源音频指纹识别平台，它可以识别来自磁盘和麦克风的音乐。我已经测试了磁盘的识别，这太棒了。100% 准确度。

我寻求有关如何添加类“BroadcastRecognizer”的帮助这将识别来自 URL 在线流示例在线广播的音乐 [ http://bbcmedia.ic.llnwd.net/stream/bbcmedia_radio1_mf_p] 因为广播流中的音乐不断变化，想将其设置为每 10 秒识别一次。

这是识别.py

这是dejavu.py

python json numpy audio audio-fingerprinting

2016-05-28T08:58:56.853

0 投票

1 回答

324 浏览

android - Live Lyrics 如何在内容识别中发挥作用

许多歌曲识别平台已经提供了显示识别歌曲的实时同步歌词的功能。这涉及实时显示歌词并突出显示当前正在歌曲中播放的歌词部分。

它还可能涉及使用带有时间戳的相应歌词文件，以显示要突出显示的行。示例 LRC 文件

如何使用识别结果中的元数据来实现这一点。这是识别结果的示例

android json python-2.7 audio-fingerprinting

2016-06-09T20:57:20.153

0 投票

2 回答

2247 浏览

audio-fingerprinting - 广播mp3的开源音频指纹识别

我正在创建一个应用程序来识别一些网络广播提要的歌曲，我需要解析提要来查找歌曲，我找到了一个付费服务（ACRCloud），可以让你这样做，但考虑到数量众多，成本很高我想分析的收音机，所以我搜索了一下，找到了 Echoprint，我可以将它用于我的目的吗？会合适吗？为什么我找不到文档...我不知道，也许有更好的解决方案？谢谢

audio-fingerprinting echoprint

2016-07-25T11:24:39.587

1 2 3 4 5 6 7 8 9 10

问题标签 [audio-fingerprinting]

这是识别.py

这是dejavu.py

Reference