资源简介
程序利用中科院的软件,分词、词性标注以后聚类,利用tf-idf值求出30个特征性,生成特征向量,可放在weka中聚类!
代码片段和文件信息
//////////////////////////////////////////////////////////////////////
//ICTCLAS简介:计算所汉语词法分析系统ICTCLAS(Institute of Computing Technology Chinese Lexical Analysis System),
// 功能有:中文分词;词性标注;未登录词识别。
// 分词正确率高达97.58%(973专家评测结果),
// 未登录词识别召回率均高于90%,其中中国人名的识别召回率接近98%;
// 处理速度为31.5Kbytes/s。
//著作权: Copyright?2002-2005中科院计算所 职务著作权人:张华平 刘群
//遵循协议:自然语言处理开放资源许可证1.0
//Email: zhanghp@software.ict.ac.cn
//Homepage:www.nlp.org.cn;mtgroup.ict.ac.cn
// ICTCLAS_Win.cpp : Defines the class behaviors for the application.
//
#include “stdafx.h“
#include “ICTCLAS_Win.h“
#include “ICTCLAS_WinDlg.h“
#ifdef _DEBUG
#define new DEBUG_NEW
#undef THIS_FILE
static char THIS_FILE[] = __FILE__;
#endif
/////////////////////////////////////////////////////////////////////////////
// CICTCLAS_WinApp
BEGIN_MESSAGE_MAP(CICTCLAS_WinApp CWinApp)
//{{AFX_MSG_MAP(CICTCLAS_WinApp)
// NOTE - the ClassWizard will add and remove mapping macros here.
// DO NOT EDIT what you see in these blocks of generated code!
//}}AFX_MSG
ON_COMMAND(ID_HELP CWinApp::onhelp)
END_MESSAGE_MAP()
/////////////////////////////////////////////////////////////////////////////
// CICTCLAS_WinApp construction
CICTCLAS_WinApp::CICTCLAS_WinApp()
{
// TODO: add construction code here
// Place all significant initialization in InitInstance
}
/////////////////////////////////////////////////////////////////////////////
// The one and only CICTCLAS_WinApp object
CICTCLAS_WinApp theApp;
/////////////////////////////////////////////////////////////////////////////
// CICTCLAS_WinApp initialization
BOOL CICTCLAS_WinApp::InitInstance()
{
AfxEnableControlContainer();
// Standard initialization
// If you are not using these features and wish to reduce the size
// of your final executable you should remove from the following
// the specific initialization routines you do not need.
#ifdef _AFXDLL
Enable3dControls(); // Call this when using MFC in a shared DLL
#else
Enable3dControlsStatic(); // Call this when linking to MFC statically
#endif
CICTCLAS_WinDlg dlg;
m_pMainWnd = &dlg;
int nResponse = dlg.DoModal();
if (nResponse == IDOK)
{
// TODO: Place code here to handle when the dialog is
// dismissed with OK
}
else if (nResponse == IDCANCEL)
{
// TODO: Place code here to handle when the dialog is
// dismissed with Cancel
}
// Since the dialog has been closed return FALSE so that we exit the
// application rather than start the application‘s message pump.
return FALSE;
}
属性 大小 日期 时间 名称
----------- --------- ---------- ----- ----
目录 0 2012-03-02 15:36 Free Software\
文件 4042 2011-12-04 20:18 Free Software\author.html
目录 0 2012-03-04 19:02 Free Software\Codes and Application\
目录 0 2012-03-02 15:36 Free Software\Codes and Application\Data\
文件 7544244 2011-12-04 20:18 Free Software\Codes and Application\Data\BigramDict.dct
文件 1565689 2011-12-04 20:18 Free Software\Codes and Application\Data\coreDict.dct
文件 10412 2011-12-04 20:18 Free Software\Codes and Application\Data\lexical.ctx
文件 1032 2011-12-04 20:18 Free Software\Codes and Application\Data\nr.ctx
文件 113780 2011-12-04 20:18 Free Software\Codes and Application\Data\nr.dct
文件 408 2011-12-04 20:18 Free Software\Codes and Application\Data\ns.ctx
文件 54278 2011-12-04 20:18 Free Software\Codes and Application\Data\ns.dct
文件 408 2011-12-04 20:18 Free Software\Codes and Application\Data\tr.ctx
文件 64000 2011-12-04 20:18 Free Software\Codes and Application\Data\tr.dct
文件 23324 2012-03-03 16:05 Free Software\Codes and Application\ICTCLAS_Win.aps
文件 2348 2012-03-04 19:01 Free Software\Codes and Application\ICTCLAS_Win.clw
文件 2751 2011-12-04 20:18 Free Software\Codes and Application\ICTCLAS_WIN.cpp
文件 6045 2011-12-04 20:18 Free Software\Codes and Application\ICTCLAS_Win.dsp
文件 547 2012-02-11 14:55 Free Software\Codes and Application\ICTCLAS_WIN.dsw
文件 1997 2011-12-04 20:18 Free Software\Codes and Application\ICTCLAS_WIN.h
文件 607232 2012-03-04 19:02 Free Software\Codes and Application\ICTCLAS_WIN.ncb
文件 55808 2012-03-04 19:02 Free Software\Codes and Application\ICTCLAS_WIN.opt
文件 258 2012-03-03 16:05 Free Software\Codes and Application\ICTCLAS_Win.plg
文件 8386 2012-02-29 20:15 Free Software\Codes and Application\ICTCLAS_Win.rc
文件 18563 2012-03-02 16:53 Free Software\Codes and Application\ICTCLAS_WinDlg.cpp
文件 3049 2012-02-29 20:23 Free Software\Codes and Application\ICTCLAS_WinDlg.h
文件 570 2011-12-04 20:18 Free Software\Codes and Application\log.txt
文件 3669 2011-12-04 20:18 Free Software\Codes and Application\ReadMe.txt
目录 0 2012-03-02 15:36 Free Software\Codes and Application\Release\
文件 8273 2012-02-11 14:55 Free Software\Codes and Application\Release\ContextStat.obj
文件 18075 2012-02-11 14:55 Free Software\Codes and Application\Release\Dictionary.obj
文件 5574 2012-02-11 14:55 Free Software\Codes and Application\Release\DynamicArray.obj
............此处省略105个文件信息
评论
共有 条评论