# office-parse **Repository Path**: guoliben/office-parse ## Basic Information - **Project Name**: office-parse - **Description**: office文档解析 - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-04-29 - **Last Updated**: 2026-04-14 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # golang Ole2 文档解析 提取文本功能 文档解析, 文本内容提取,返回字符串 ## 支持类型 1. *.doc 2. *.xls 3. *.ppt ## 使用方式 ### 根据文件路径提取文本 office_parse.ParseOleDocFile office_parse.ParseOleXlsFile office_parse.ParseOlePptFile doctask.ProcessFile xlstask.ProcessFile ppttask.ProcessFile ### 根据文件路径提取文本 office_parse.ParseOleDocBuffer office_parse.ParseOleXlsBuffer office_parse.ParseOlePptBuffer doctask.ProcessBuffer xlstask.ProcessBuffer ppttask.ProcessBuffer ## 依赖 ### doc问题提取 无依赖 ### xls文本提取 extreme.xls extreme.ole ps:此包进行bug修复版本,解决加密和大文件crush问题 ### ppt文本提取 无依赖 ## demo ### main1.go package main import ( "fmt" office_parse "gitee.com/guoliben/office-parse" ) func main() { fmt.Println("------------") fmt.Println(office_parse.ParseOleDocFile("demo.doc")) fmt.Println(office_parse.ParseOleXlsFile("demo.xls")) fmt.Println(office_parse.ParseOlePptFile("demo.ppt")) fmt.Println("------------") } ### main2.go package main import ( "fmt" doctask "gitee.com/guoliben/office-parse/doc-parse" ppttask "gitee.com/guoliben/office-parse/ppt-parse" xlstask "gitee.com/guoliben/office-parse/xls-parse" ) func main() { fmt.Println("------------") fmt.Println(ppttask.ProcessFile("demo.ppt")) fmt.Println(xlstask.ProcessFile("demo.xls")) fmt.Println(doctask.ProcessFile("demo.doc")) fmt.Println("------------") }