我必须使用机器学习技术自动分析 json 日志文件,以便更好地了解用户行为并识别经常执行的操作以提高应用程序的可用性。日志文件包含文件系统事件、鼠标事件、键盘事件、应用程序事件(启动、终止、活动、停用……)等条目,这些条目在使用特定应用程序期间记录。
记录的条目如下所示:
{
"timestamp" : "20150427T100724.571637",
"entry" :
{
"eventType" : 1,
"eventTypeString" : "created",
"nodePath" : "/Applications/Firefox.app",
"nodeType" : 3,
"nodeTypeString" : "directory"
},
"entryType" : 4
},
{
"timestamp" : "20150427T100729.072595",
"entry" :
{
"eventType" : 2,
"eventTypeString" : "clickRight",
"mouseType" : 1,
"mouseTypeString" : "local",
"positionX" : 2517,
"positionY" : 166,
"scrollWheel" : 0.0
},
"entryType" : 1
},
{
"timestamp" : "20150427T100730.270000",
"entry" :
{
"eventType" : 2,
"eventTypeString" : "unmounted",
"volumeName" : "Firefox",
"volumePath" : "/Volumes/Firefox"
},
"entryType" : 9
},
{
"timestamp" : "20150427T100730.281202",
"entry" :
{
"eventType" : 2,
"eventTypeString" : "removed",
"nodePath" : "/Volumes/Firefox",
"nodeType" : 3,
"nodeTypeString" : "directory"
},
"entryType" : 4
},
{
"timestamp" : "20150427T100732.668000",
"entry" :
{
"eventType" : 4,
"eventTypeString" : "activated",
"launchDate" : "20150427T100732.668000",
"ownsMenuBar" : true,
"processId" : 7011,
"processName" : "TeamViewer"
},
"entryType" : 5
},
我已经进入了机器学习的话题,但我不知道该领域的哪个特定技术甚至子领域可以帮助我解决这个问题。也许有人有提示或想法在哪里可以找到正确的信息或解决方案的概念?