DedeCms采集没有自动生成摘要和关键字的解决办法-狐狸库

织梦5.6采集到了内容，可没有自动生成摘要和关键字，关键字和摘要都是空的。

可能很多人都遇到了和我一样的问题，在论坛实在没找到好的解决办法，就自己动手去改改文件。

下面是我的解决办法：

比如我采集了网站：http://www.xxx.com

首页的meta代码是： Copy to Clipboard

<meta http-equiv=\"Content-Type\" content=\"text/html; charset=gb2312\" /> 
<title>iphone苹果站长网</title> 
<meta name=description content=\"苹果iPhone手机中文官网,为您提供苹果iPhone 4代手机最新报价，同时包括苹果iPhone手机图片、苹果iPhone手机参数、苹果iPhone手机主题、苹果iPhone手机软件、苹果iPhone手机游戏、苹果iPhone手机评测行情、苹果iphone视频、苹果iphone铃声、苹果iPhone手机论坛、苹果iPhone手机下载等信息。\" /> 
<meta name=keywords content=\"苹果手机,iphone,苹果中文网,苹果手机官网,iphone4代,苹果iphone,iPhone手机,苹果手机报价,图片,论坛,游戏,电影,主题,铃声,铃声,下载,软件,应用\" />

如果用dedecms的采集的时候就没有办法自动获取description和keywords的值的。

大家注意到了这段代码和其它网站的区别了没？？

<meta name=description content=... /> 
<meta name=keywords content=... />

就是name=”description” 和 name=”keywords” 少了双引号””

这个可能是dedecms中自动分析关键字和摘要的一个bug吧,没有考虑到<meta name=keywords content =” />这种情况！

采集的选项里面也没有自己填写关键字和摘要的选项。那我只能自己动手修改文件了。

1、修改 include/dedecollection.class.php Copy to Clipboard

//自动分析关键字和摘要 
preg_match(\"/<meta[s]+name=[\'\"]keywords[\'\"] content=[\'\"](.*)[\'\"]/isU\",$this->tmpHtml,$inarr); 
preg_match(\"/<meta[s]+content=[\'\"](.*)[\'\"] name=[\'\"]keywords[\'\"]/isU\",$this->tmpHtml,$inarr2); 
if(!isset($inarr[1]) && isset($inarr2[1])) 
{ 
$inarr[1] = $inarr2[1]; 
}

用下面一段代码替换上面的 Copy to Clipboard

//自动分析关键字和摘要 
preg_match(\"/<meta[s]+name=[\'\"]keywords[\'\"] content=[\'\"](.*)[\'\"]/isU\",$this->tmpHtml,$inarr); preg_match(\"/<meta[s]+content=[\'\"](.*)[\'\"] name=[\'\"]keywords[\'\"]/isU\",$this->tmpHtml,$inarr2); preg_match(\"/<meta[s]+name=keywords content=[\'\"](.*)[\'\"]/isU\",$this->tmpHtml,$inarr3); if(!isset($inarr[1]) && isset($inarr2[1])) { $inarr[1] = $inarr2[1]; } if(!isset($inarr[1]) && isset($inarr3[1])) { $inarr[1] = $inarr3[1]; }

改了上面的，还有另一个地方需要修改的

Copy to ClipboardLiehuo.Net Codes preg_match(\"/<meta[s]+name=[\'\"]description[\'\"] content=[\'\"](.*)[\'\"]/isU\",$this->tmpHtml,$inarr); preg_match(\"/<meta[s]+content=[\'\"](.*)[\'\"] name=[\'\"]description[\'\"]/isU\",$this->tmpHtml,$inarr2); if(!isset($inarr[1]) && isset($inarr2[1])) { $inarr[1] = $inarr2[1]; }

用下面的代码替换掉上面的：

Copy to ClipboardLiehuo.Net Codes preg_match(\"/<meta[s]+name=[\'\"]description[\'\"] content=[\'\"](.*)[\'\"]/isU\",$this->tmpHtml,$inarr); preg_match(\"/<meta[s]+content=[\'\"](.*)[\'\"] name=[\'\"]description[\'\"]/isU\",$this->tmpHtml,$inarr2); preg_match(\"/<meta[s]+name=description content=[\'\"](.*)[\'\"]/isU\",$this->tmpHtml,$inarr3); if(!isset($inarr[1]) && isset($inarr2[1])) { $inarr[1] = $inarr2[1]; } if(!isset($inarr[1]) && isset($inarr3[1])) { $inarr[1] = $inarr3[1]; }

温馨提示：本文最后更新于2022/10/20 02:57:41。若文章内容或图片失效，请留言或联系站长反馈！

也想出现在这里？点击联系我们吧

本站除原创内容，其余内容均收集自互联网，仅限用于学习和研究目的，本站不对其内容的合法性、可用性承担任何责任。本站禁止以任何形式发布或转载任何违法相关信息，若您发现请立即向站长举报；用户投稿一切资源内容不代表本站立场，并不代表本站赞同其观点和对其真实性负责。
如有版权内容，其版权均归原作者所有，本站虽力求保存原有版权信息，但因众多资源经多次转载，已无法确定其真实来源，故敬请原作者谅解！为尊重作者劳动成果，请购买正版支持作者，谢谢！若您对「HULIKU.COM」所载资源作品版权归属存有异议，请发送邮箱：ihuliku@qq.com 进行处理。
本站资源文件大多存储至云盘，如您发现图片或链接失效，请联系站长或作者，我们将及时进行更新。资源都是经过站长或作者收集测试优化后进行发布分享。如若转载请在贵站文内以超链形式注明狐狸库文章出处，谢谢合作！

THE END