Converting PDF to Word in C# and .NET

Converting PDF files to Word documents is a common requirement in many applications. Whether you need to edit the contents of a PDF file or just extract text and images, converting a PDF file to a Word document can be very useful and greatly simplify your tasks.

PDF files are great for saving the formatting and layout of a document, but they are not always easy to edit. Word documents, on the other hand, are designed for editing. By converting a PDF file to a Word document, you can:

  • Edit text and images in a document.
  • Reformat the document according to your needs.
  • Extract specific content for use in other documents.

In this article, we will look at how to convert a PDF file to a Word document using C# and .NET, with the SautinSoft.PDF applying the following steps:

  1. Add SautinSoft.PDF from NuGet.
  2. Load and convert the PDF file using a code snippet.
  3. Launch the application and save the document.

Output result:

Полный код

using System;
using System.IO;
using System.Reflection;
using SautinSoft;
using SautinSoft.Pdf;
using SautinSoft.Pdf.Content;

namespace Sample
{
    class Sample
    {
        /// <summary>
        /// Convert PDF to DOCX.
        /// </summary>
        /// <remarks>
        /// Details: https://sautinsoft.com/products/pdf/help/net/developer-guide/convert-pdf-to-docx.php
        /// </remarks>
        static void Main(string[] args)
        {
            // Before starting this example, please get a free 100-day trial key:
            // https://sautinsoft.com/start-for-free/

            // Apply the key here:
            // PdfDocument.SetLicense("...");

            // Load a PDF document.
            using (var document = PdfDocument.Load(Path.GetFullPath(@"..\..\..\simple text.pdf")))
            {  
                // Save a PDF document to a DOCX file.
                document.Save("Output.docx");
            }

            System.Diagnostics.Process.Start(new System.Diagnostics.ProcessStartInfo("Output.docx") { UseShellExecute = true });
        }
    }
}

Download

Option Infer On

Imports System
Imports System.IO
Imports System.Reflection.Metadata
Imports SautinSoft
Imports SautinSoft.Pdf
Imports SautinSoft.Pdf.Content

Namespace Sample
	Friend Class Sample
		''' <summary>
		''' Convert PDF to DOCX.
		''' </summary>
		''' <remarks>
		''' Details: https://sautinsoft.com/products/pdf/help/net/developer-guide/convert-pdf-to-docx.php
		''' </remarks>
		Shared Sub Main(ByVal args() As String)
			' Before starting this example, please get a free license:
			' https://sautinsoft.com/start-for-free/

			' Apply the key here:
			' PdfDocument.SetLicense("...");

			' Load a PDF document.
			Using document = PdfDocument.Load(Path.GetFullPath("..\..\..\simple text.pdf"))
				' Save a PDF document to a DOCX file.
				document.Save("Output.docx")
			End Using

			System.Diagnostics.Process.Start(New System.Diagnostics.ProcessStartInfo("Output.docx") With {.UseShellExecute = True})
		End Sub
	End Class
End Namespace

Download


Если вам нужен пример кода или у вас есть вопрос: напишите нам по адресу support@sautinsoft.ru или спросите в онлайн-чате (правый нижний угол этой страницы) или используйте форму ниже:



Вопросы и предложения всегда приветствуются!

Мы разрабатываем компоненты .Net с 2002 года. Мы знаем форматы PDF, DOCX, RTF, HTML, XLSX и Images. Если вам нужна помощь в создании, изменении или преобразовании документов в различных форматах, мы можем вам помочь. Мы напишем для вас любой пример кода абсолютно бесплатно.