我只想要一个建议或一种我可以处理情况的方法
我有一个模块,我曾经在其中爬取网站并让当前电影在附近的电影院放映。我有两张桌子 1) 一张是给电影的,另一张是给电影院看的,电影首先插入到电影表中。
现在我已经在每天早上的 cron Job 上设置了我的文件。所以在我的代码中,我首先删除两个表中的所有数据并插入新数据。但是这样一来,我通常会放弃最终用户对该特定电影给出的所有评分。
为了克服这种情况,我想到了一些解决方案
我创建了一个新查询
INSERT INTO jos_movie (movie_name, language, cast,movie_release,director,rating,rating_count,movie_ids)
SELECT * FROM (SELECT 'test','null','yahoo','Dec 21, 2012','himmat',250,230,'43677') AS tmp
WHERE NOT EXISTS (
SELECT movie_name FROM jos_movie WHERE movie_name = 'test')
同样,我也为电影院桌创建了相同的方法。
这样,它将检查并且不会覆盖表中的电影。但是这种方法存在一些问题。如果电影院所有者确实删除了该特定电影的节目,例如“测试”。然后通过上面的查询它不会删除那个。它会留在那里。
对不起我的主题行,因为我无法为这个问题考虑好的主题行。
那么我怎样才能获得一个结果,以便现有电影如果在表中就不会得到更新,如果它不在我的脚本的抓取结果数组中,就会被删除。
这是我的表格结果
这是电影表结果
这是电影院的桌子
这是我使用的代码。
$con=mysql_connect('localhost','test','test');
mysql_select_db('test',$con);
// Use cURL to get the RSS feed into a PHP string variable.
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL,'myrsslink.xml');
curl_setopt($ch, CURLOPT_HEADER, false);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
$xml = curl_exec($ch);
curl_close($ch);
$arrData = array();
// Create an array of item elements from the XML feed.
$news_items = element_set('item', $xml);
$del_movie = "delete from jos_movie";
mysql_query($del_movie);
$del_cinema = "delete from jos_cinema";
mysql_query($del_cinema);
foreach($news_items as $item) {
$title = value_in('title', $item);
$url = value_in('link', $item);
$cast = value_in('description', $item);
//curl_setopt($ch, CURLOPT_URL,$url);
//curl_setopt($ch, CURLOPT_HEADER, false);
//curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
//$html = curl_exec($ch);
$arrTitle = explode('-',$title);
$html = file_get_html($url);
$htmlShowTime = '';
// find all span tags with class=gb1 moviTimes moviTmngBox
foreach($html->find('ul[style=line-height:2em;]') as $e)
$htmlShowTime = $e->plaintext;
$movie_name = $arrTitle[0];
$apiKey = '30f44b6ef9472d414e50d2acaa058b60';
$url = sprintf('http://api.themoviedb.org/2.1/Movie.search/en/xml/%s/"%s"',$apiKey,rawurlencode(trim($movie_name)));
//$xml = simplexml_load_file("http://api.themoviedb.org/2.1/Movie.search/en/xml/accd3ddbbae37c0315fb5c8e19b815a5/"$movie_name"");
$xml = simplexml_load_file($url);
$movies = $xml->movies->movie;
foreach ($movies as $movie){
$arrMovie_id = $movie->id;
}
$arrStr = explode(':',$htmlShowTime);
$release = substr($arrStr[3],0,strlen($arrStr[3])-8);
$director = substr($arrStr[5],0,strlen($arrStr[5])-11);
$sql_movie = "insert into jos_movie(movie_name,language,cast,movie_release,director,rating,rating_count,movie_ids)values('$movie_name','null','$cast','$release','$director',250,230,'$arrMovie_id')";
//echo $sql.'<br>';
// echo $sql_movie;
mysql_query($sql_movie);
$sqlCount = 'select max(id) from jos_movie' or die("cannot select DB");
$data = mysql_query($sqlCount);
echo $data;
print_r($data);
$result = mysql_fetch_array($data);
$id = $result[0];
echo '<br>'.$id.'<br>';
//$id = mysql_insert_id();
//echo $id;
// find all span tags with class=gb1
foreach($html->find('div.moviTmngBox') as $e){
$tagTitle = $e->find('a',0);
$tagTime = $e->find('div.moviTimes',0);
$name = $tagTitle->title;
$time = $tagTime->innertext;
$trimName = '';
$temName = strtolower(str_replace(' ','',$name));
if(strpos($temName,'indraaudi1') !== false)
$trimName = 'Indra Audi 1' and $cinemaId = '1' and $long='32.726602' and $lat='74.857026';
elseif(strpos($temName,'indraaudi2') !== false)
$trimName = 'Indra Audi 2' and $cinemaId = '2'and $long='32.726602' and $lat='74.857026';
elseif(strpos($temName,'indraaudi3') !== false)
$trimName = 'Indra Audi 3'and $cinemaId = '3' and $long='32.726602' and $lat='74.857026';
elseif(strpos($temName,'apsra') !== false)
$trimName = 'Apsra' and $cinemaId = '4' and $long='32.700314' and $lat='74.858023';
else{
$trimName = trim(substr($name,18,strlen($name))) and $cinemaId = '5' and $long='32.7300' and $lat='74.8700' ;
}
//echo $tagTime->innertext.'<br/>';
$sql = "insert into jos_cinema(cinema_name,show_time,movie_id,cinemaId,logitude,latitude)values('$trimName','$time',$id,$cinemaId,$long,$lat)";
//echo $sql.'<br/>';
mysql_query($sql);
//$arrTem = array($tagTitle->title,$tagTime->innertext);
}
}//end rss feed loop
?>
请注意,我正在插入电影评分的默认值。
谢谢