Text Copy Assistance Plugin for SAT

The "SAT Taisho Shinshū Daizōkyō Text Database Copy & Paste Assistance Plugin" is a PC browser plugin that formats text copied from the SAT Taishō Shinshū Daizōkyō Text Database by removing unrelated strings and organizing the text.

*Text Formatting: Removes unrelated strings from the main content and formats elements such as verses.
*Automatic Annotation: Automatically inserts information such as scripture titles and page numbers.
*Old Character Conversion (Variant Character Identification): Converts old characters into modern equivalents, improving readability.
*User Dictionary: Allows specific characters to be replaced with alternatives.
*N-gram Analysis: Analyzes the most frequently used strings in scriptures.

This plugin reduces the burden of quoting scriptures in writing. Additionally, its analysis tools can be used to identify themes in scriptures or compare different text versions.

Access the Chrome Web Store and select Install.

The plugin functionality can be accessed from the puzzle icon ( ) or by clicking the pinned TCAPSAT icon in the toolbar. The "Enabled/Disabled" button can be toggled to temporarily disable the functionality.

Users can choose from a variety of copy methods to suit their preferences.
The plugin functionality can be accessed from the puzzle icon ( ) or by clicking the pinned TCAPSAT icon in the toolbar.

Enabling the "Remove Periods" mode removes periods () from the text.

Enabling the "Remove Newlines" mode removes newlines from the text.

Note:

Please note that all newlines will be removed.
It is recommended to use this in combination with the "Sentence Breaks" mode.

The "Sentence Breaks" mode inserts line breaks at sentence boundaries.
By using it in conjunction with the "Remove Newlines" mode, unnecessary line breaks are removed while adding breaks at sentence boundaries.

Note:

The Taishō Shinshū Daizōkyō (The Taisho Tripitaka) is generally composed of lines with about 16 to 17 characters. However, there are some exceptions. For example:
(1) The number of periods results in a line being fewer than 16 characters.
(2) Characters not present in Unicode are provided as images.

Therefore, this feature treats lines with 14 characters or fewer (excluding periods and images) as sentence breaks and adds line breaks.
(No line breaks are added for lines with 15 to 17 characters.)
Additionally, if the first character is a full-width space, a line break will be added to that line (from version 1.01.01 onward).
In most cases, this setting will work, but if there are exceptions, please adjust the line breaks manually.

The "Remove Spaces on Verse" mode removes spaces before and after verses and shifts the text by one character for copying.
When used in conjunction with the "Remove Newlines" mode, line breaks in the verse sections remain intact

Example:

Text data before processing.Text data after processing.
時梵童子以偈報曰
〇〇〇〇典尊汝所修爲欲何志求〇〇〇〇
〇〇〇〇今設此供養當爲汝受之
又告大典尊。汝若有所問自恣問之。當爲
時梵童子以偈報曰
典尊汝所修爲欲何志求
今設此供養當爲汝受之
又告大典尊。汝若有所問自恣問之。當爲
"〇" represents a full-width space.

Note:

Some verses are not supported. This feature will not recognize a line as a verse if there are not four consecutive full-width spaces at the beginning of the line.

The "Convert Characters" mode converts many old-style (and variant) characters to "new-style characters".

Example:

Text data before processing.Text data after processing.
依三。悉順行。略説如前。地大大神。除疑惑。依三。悉順行。略説如前。地大大神。除疑惑。
Red text represents the characters to be converted.

This plug-in is not responsible for any issues that may arise from using this plugin, nor for any failure to properly process strings.

You are more than welcome to post my website URL on other websites.
However, copying or reproducing any content of the website is strictly prohibited.
Top page: https://sosesha.com/
Plugin distribution page: https://sosesha.com/sat_plugin