2 posts tagged with "AI"

View all tags

PDF Craft: Convert scanned PDF books to EPUB

· 3 min read
Moskize91
Engineer of OOMOL

Do you also collect a lot of scanned PDF documents? Those academic papers, e-books or work materials, although the content is precious, are very difficult to read - rigid layout, unadjustable fonts, always need to zoom in and out when reading on mobile phones.

Now, these PDFs can be easily converted into comfortable EPUB format through pdf-craft. Just like organizing a pile of paper documents into a portable e-book, you can finally browse these contents in the most suitable way for you on your favorite EPUB reader: adjust the font size, switch to night mode, or even listen to AI reading.

pdf-craft is an open source library dedicated to processing scanned book PDFs. It can accurately identify text content, headers and footers, reference annotations, etc. in PDF files. It can maintain the coherence of cross-page content and restore the correct reading order. In addition, it will use LLM to build a complete EPUB contents structure.

It is very simple to use pdf-craft in oomol. First, create a blank project. Then type "pdf-craft" in the search box in the oomol store to find it.

Drag the “Analyse PDF” and “Generate EPUB” blocks onto the empty flow. Then, connect their output_dir and analysed_dir fields as shown in the figure.

Then, set the pdf field to the PDF source file to be processed, and then set the epub_file_path to the converted EPUB file path. Finally, click the Run button in the upper right corner to start the conversion.

AI translation to create bilingual EPUB e-books

· 4 min read
Moskize91
Engineer of OOMOL

Are you tired of the bad experience of messy formatting and no layout after e-book translation? EPUB Translator's intelligent translation technology not only perfectly preserves the exquisite layout of the original book, but also has a unique bilingual comparison function, making language learning more intuitive and efficient than ever before.

Whether reading original novels or studying professional literature, you can now get precisely aligned bilingual texts, with all illustrations, chapters and special formats fully preserved. The original sentence and the translation are presented side by side, helping you to easily compare and learn, just like having a translation mentor at your side at any time, making cross-language reading a truly pleasant learning journey.

epub-translator is an open source library that uses AI big models to automatically translate EPUB e-books and 100% retains the format, illustrations, table of contents and layout of the original book. It is encapsulated as a shared block in oomol and can be found in the oomol store.

First, enter the Community page in the left navigation bar and search for the "books rranslator" template. This preset template contains all the necessary operating environment and configuration. Click the "Use" button to quickly initialize a new project. The system will automatically create a workspace containing all the dependencies of epub-translator, and there is no need to manually install any components.

After the project is created, you will see a preset workflow interface. Click the source_file field on the block to upload the e-book file to be translated. The standard epub format is supported here.

Next, enter the key parameter setting stage. In the language selection drop-down menu, you can specify the target translation language in the language field, such as Simplified Chinese, English, or Japanese. Then, you need to configure which large language model to use for translation. We recommend DeepSeek Chat as the default translation engine, which excels in maintaining literary and technical accuracy. In addition, you can also add prompts in the prompt field to tell the large language model how to handle considerations such as character names and terminology.

The translation process is fully automated, and you can view the progress in real time in the log window. After processing, the system will generate a new bilingual e-book. When you open it, you will find that all the original format elements are perfectly preserved - illustrations remain in their original positions, special fonts are rendered correctly, and the directory structure is automatically generated in bilingual versions. What's more thoughtful is that the system automatically processes various metadata inside the e-book to ensure that the author, publisher and other information can be correctly displayed in the new language environment.