1

抽象的:

每个客户都有一个特定的 xml 广告提要(publisher_feed 表)。每次对该提要进行查询或点击时,都会记录下来(publisher_stats_raw 表)(每个查询/点击将有多行,具体取决于客户端传递的 subid(我们可以将点击汇总在一起))。第二天,我们从 API 中提取统计数据以获取前几天的收入数字(rev_stats 表)(每个收入统计数据可能有多行,具体取决于点击的国家/地区(我们可以将收入汇总在一起))。很难尝试将这三个表链接在一起以找到前一天每个客户端的平均 RPC。

表结构:

CREATE TABLE `publisher_feed` (
  `publisher_feed_id` int(10) unsigned NOT NULL AUTO_INCREMENT,
  `alias` varchar(45) DEFAULT NULL,
  `user_id` int(10) unsigned DEFAULT NULL,
  `remote_feed_id` int(10) unsigned DEFAULT NULL,
  `subid` varchar(255) DEFAULT '',
  `requirement` enum('tq','tier2','ron','cpv','tos1','tos2','tos3','pv1','pv2','pv3','ar','ht') DEFAULT NULL,
  `status` enum('enabled','disabled') DEFAULT 'enabled',
  `tq` decimal(4,2) DEFAULT '0.00',
  `clicklimit` int(11) DEFAULT '0',
  `prev_rpc` decimal(20,10) DEFAULT '0.0000000000',
  PRIMARY KEY (`publisher_feed_id`),
  UNIQUE KEY `alias_UNIQUE` (`alias`),
  KEY `publisher_feed_idx` (`remote_feed_id`),
  KEY `publisher_feed_user` (`user_id`),
  CONSTRAINT `publisher_feed_feed` FOREIGN KEY (`remote_feed_id`) REFERENCES `remote_feed` (`remote_feed_id`) ON DELETE NO ACTION ON UPDATE NO ACTION,
  CONSTRAINT `publisher_feed_user` FOREIGN KEY (`user_id`) REFERENCES `user` (`user_id`) ON DELETE NO ACTION ON UPDATE NO ACTION
) ENGINE=InnoDB AUTO_INCREMENT=124 DEFAULT CHARSET=latin1$$



CREATE TABLE `publisher_stats_raw` (
  `publisher_stats_id` int(10) unsigned NOT NULL AUTO_INCREMENT,
  `unique_data` varchar(350) NOT NULL,
  `publisher_feed_id` int(10) unsigned DEFAULT NULL,
  `date` date DEFAULT NULL,
  `subid` varchar(255) DEFAULT NULL,
  `queries` int(10) unsigned DEFAULT '0',
  `impressions` int(10) unsigned DEFAULT '0',
  `clicks` int(10) unsigned DEFAULT '0',
  `filtered` int(10) unsigned DEFAULT '0',
  `revenue` decimal(20,10) unsigned DEFAULT '0.0000000000',
  PRIMARY KEY (`publisher_stats_id`),
  UNIQUE KEY `unique_data_UNIQUE` (`unique_data`),
  KEY `publisher_stats_raw_remote_feed_idx` (`publisher_feed_id`)
) ENGINE=InnoDB AUTO_INCREMENT=472 DEFAULT CHARSET=latin1$$




CREATE TABLE `rev_stats` (
  `rev_stats_id` int(10) unsigned NOT NULL AUTO_INCREMENT,
  `date` date DEFAULT NULL,
  `remote_feed_id` int(10) unsigned DEFAULT NULL,
  `typetag` varchar(255) DEFAULT NULL,
  `subid` varchar(255) DEFAULT NULL,
  `country` varchar(2) DEFAULT NULL,
  `revenue` decimal(20,10) DEFAULT NULL,
  `tq` decimal(4,2) DEFAULT NULL,
  `finalized` int(11) DEFAULT '0',
  PRIMARY KEY (`rev_stats_id`),
  KEY `rev_stats_remote_feed_idx` (`remote_feed_id`),
  CONSTRAINT `rev_stats_remote_feed` FOREIGN KEY (`remote_feed_id`) REFERENCES `remote_feed` (`remote_feed_id`) ON DELETE NO ACTION ON UPDATE NO ACTION
) ENGINE=InnoDB AUTO_INCREMENT=58 DEFAULT CHARSET=latin1$$

语境:

每个 remote_feed 都有一个特定的 subid/typetag 给它。因此,我们需要将 publisher_feed 表中的 remote_feed_id 和 subid 列与收入统计表中的 remote_feed_id 和 typetag 列进行匹配。

我当前的非工作实施:

SELECT 
    pf.publisher_feed_id, psr.date, sum(clicks), sum(rs.revenue)
FROM 
    xml_network.publisher_feed pf
JOIN
    xml_network.publisher_stats_raw psr
ON
    psr.publisher_feed_id = pf.publisher_feed_id
JOIN
    xml_network.rev_stats rs
ON
    rs.remote_feed_id = pf.remote_feed_id
WHERE 
    pf.requirement = 'tq'
AND
    pf.subid = rs.typetag
AND
    psr.date <> date(curdate())
GROUP BY
    psr.date
ORDER BY
    psr.date DESC
LIMIT 1;

以上内容不断从 rev_stats 表中提取错误数据(提取正确统计信息的总和,但由于连接而重复它)。任何有关如何正确提取正确数据的帮助都会非常有帮助(我可以使用多个查询和 PHP 来获得正确的结果,但这有什么好玩的!)

4

1 回答 1

0

想出了一个方法来完成这个。它的 def 无论如何都不是一个快速的方法,需要 4 次选择才能完成,但它可以完美地工作 =)

SELECT 
        pf.publisher_feed_id, 

        round(
                (
                        SELECT 
                                SUM(rs.revenue)
                        FROM
                                xml_network.rev_stats rs
                        WHERE
                                rs.remote_feed_id = pf.remote_feed_id
                        AND
                                rs.typetag = pf.subid
                        AND
                                rs.date = subdate(current_date, 1)

                ),10)as revenue,
                (
                        SELECT 
                                MAX(rs.tq)
                        FROM
                                xml_network.rev_stats rs
                        WHERE
                                rs.remote_feed_id = pf.remote_feed_id
                        AND
                                rs.typetag = pf.subid
                        AND
                                rs.date = subdate(current_date, 1)

                ) as tq,
                (
                        SELECT 
                                SUM(psr.clicks)-SUM(psr.filtered)
                        FROM
                                xml_network.publisher_stats_raw psr
                        WHERE
                                psr.publisher_feed_id = pf.publisher_feed_id
                        AND
                                psr.date = subdate(current_date, 1)

                ) as clicks
FROM 
        xml_network.publisher_feed pf

WHERE 
        pf.requirement = 'tq';
于 2013-10-09T02:32:39.187 回答