Text Copy Assistance Plugin for SAT
Text Copy Assistance Plugin for SAT
The "SAT Taisho Shinshū Daizōkyō Text Database Copy & Paste Assistance Plugin" is a PC browser plugin that formats text copied from the SAT Taishō Shinshū Daizōkyō Text Database by removing unrelated strings and organizing the text.
*Text Formatting: Removes unrelated strings from the main content and formats elements such as verses.
*Automatic Annotation: Automatically inserts information such as scripture titles and page numbers.
*Old Character Conversion (Variant Character Identification): Converts old characters into modern equivalents, improving readability.
*User Dictionary: Allows specific characters to be replaced with alternatives.
*N-gram Analysis: Analyzes the most frequently used strings in scriptures.
This plugin reduces the burden of quoting scriptures in writing. Additionally, its analysis tools can be used to identify themes in scriptures or compare different text versions.
Image
How to Install
Access the Chrome Web Store and select Install.
How to Pause
◆ Setting
The plugin functionality can be accessed from the puzzle icon ( ) or by clicking the pinned TCAPSAT icon in the toolbar. The "Enabled/Disabled" button can be toggled to temporarily disable the functionality.
About Advanced Options
Users can choose from a variety of copy methods to suit their preferences.
The plugin functionality can be accessed from the puzzle icon ( ) or by clicking the pinned TCAPSAT icon in the toolbar.
◆ Remove Periods
Enabling the "Remove Periods" mode removes periods (。
) from the text.
◆ Remove Newlines
Enabling the "Remove Newlines" mode removes newlines from the text.
Note:
Please note that all newlines will be removed.
It is recommended to use this in combination with the "Sentence Breaks" mode.
◆ Insert Sentence Breaks
The "Sentence Breaks" mode inserts line breaks at sentence boundaries.
By using it in conjunction with the "Remove Newlines" mode, unnecessary line breaks are removed while adding breaks at sentence boundaries.
Note:
The Taishō Shinshū Daizōkyō (The Taisho Tripitaka) is generally composed of lines with about 16 to 17 characters. However, there are some exceptions. For example:
(1) The number of periods results in a line being fewer than 16 characters.
(2) Characters not present in Unicode are provided as images.
Therefore, this feature treats lines with 14 characters or fewer (excluding periods and images) as sentence breaks and adds line breaks.
(No line breaks are added for lines with 15 to 17 characters.)
Additionally, if the first character is a full-width space, a line break will be added to that line (from version 1.01.01 onward).
In most cases, this setting will work, but if there are exceptions, please adjust the line breaks manually.
◆ Remove Spaces on Verse
The "Remove Spaces on Verse" mode removes spaces before and after verses and shifts the text by one character for copying.
When used in conjunction with the "Remove Newlines" mode, line breaks in the verse sections remain intact
Example:
Text data before processing. | Text data after processing. |
---|---|
時梵童子以偈報曰〇〇〇〇 典尊汝所修〇 爲欲何志求〇〇〇〇 〇〇〇〇 今設此供養〇 當爲汝受之又告大典尊。汝若有所問自恣問之。當爲 | 時梵童子以偈報曰〇 典尊汝所修〇 爲欲何志求〇 今設此供養〇 當爲汝受之又告大典尊。汝若有所問自恣問之。當爲 |
Note:
Some verses are not supported. This feature will not recognize a line as a verse if there are not four consecutive full-width spaces at the beginning of the line.
◆ Convert Characters
The "Convert Characters" mode converts many old-style (and variant) characters to "new-style characters".
Example:
Text data before processing. | Text data after processing. |
---|---|
歸依三寶。悉隨順行。略説如前。地大大神。斷除疑惑。 | 帰依三宝。悉随順行。略説如前。地大大神。断除疑惑。 |
◆ Auto Annotation
◆ N-gram analysis
Disclaimer
This plug-in is not responsible for any issues that may arise from using this plugin, nor for any failure to properly process strings.
About Linking
You are more than welcome to post my website URL on other websites.
However, copying or reproducing any content of the website is strictly prohibited.
Top page: https://sosesha.com/
Plugin distribution page: https://sosesha.com/sat_plugin