Smart Content Analysis Check Method
Smart Content Analysis check method lets you monitor any part(s) of an
HTML document or presence\absence of specified keyword(s). So you can create
your own analysis conditions to analyse resource most precisely. Right Web
Monitor lets you create any number of conditions for each monitoring resource.
All the conditions you will create will be placed one-by-one and in case if at
least one condition is satisfied1 the resource
presumes Modified2.
The conditions can be of the following types:
- "Analyse entire page"
condition can be used for analysing full content
of page from the very beginning to the end.
- "Analyse specified fragment"
condition can be used for analysing
specified fragment of page. To determine its bounds you should
specify the initial and ending fragments.
- "Look up for specified keywords"
condition can be used if you want the program to
analyse specified keywords presence or absence.
1. Analysing entire page
Once resource is downloaded the program applies the
source processing flags
and compares the resulting data for previously saved version and currently
downloaded
version. In case if it finds any difference the condition presumes satisfied.
2. Analysing specified fragment
| Screenshot #1 |
 |
Once resource is downloaded the program selects specified fragment, applies
the source processing flags and compares the resulting data for previously saved
version and last saved version. In case if it finds any difference the condition
presumes satisfied.
If you want Internet Explorer core to extract fragments from HTML just mark
"Use IE core to extract fragments". This option must be enabled when Page
Content Recorder has been used to define initial and ending fragments. If this
option is enabled the extracting of the fragments will become slower.
Example #1:
If you need to check the latest news on the RightSoft site you should create
a new link
with http://www.right-soft.com/news/
URL, go to its properties, select "Analysis"
page, then select "Smart Content Analysis", select the default
condition (it will be created automatically) and press the "Edit"
button. The following dialog will appear (screenshot #1):
Press
"View source" button to browse the source of the document. Find the following
text:
<!-- LastNews -->
...
<b><a href="../webmon/">Right Web Monitor</a></b>
1.5 is now available.<br>
...
<!-- /LastNews -->
or any other text placed between the "<!-- LastNews -->"
and the "<!-- /LastNews -->"
fragments. This is the latest news text. Select the initial and ending fragments and
fill in the corresponding fields as shown on the screenshot #1. Press "OK"
buttons on this dialog and on "Link Properties" dialog. Now, when the
program detects that text between the initial and ending fragments changed the
condition presumes satisfied.
3. Looking up for specified keywords
Keywords analysing can be of two types:
- Analysing for presence - keyword have to start with the +(plus)
sign, for example +"keyword1". If the program detects that such keyword
is present in the page it presumes that this keyword satisfies the condition.
- Analysing for absence - keyword have to start with the -(minus)
sign, for example -"keyword2". If the program detects that such keyword
is absent in the page it presumes that this keyword satisfies the condition.
Once resource is downloaded the program applies the
source processing flags
and searches for specified keywords within the resulting data for currently
downloaded version. In case if at least one keyword satisfies the condition, the
condition presumes satisfied.
| Screenshot #2 |
 |
If you want the program to notify you
only if all the keywords satisfy the condition just mark on the corresponding
checkbox.
If you want the program to look up for keywords only within specified
region just mark the corresponding checkbox and enter variable(s) that
represents a region where keywords should be looked up.
Keywords analysing of the both types can be used in the same condition
simultaneously. Keywords must have the following syntax:
+"keyword1" -"keyword2"
-"a keyphrase with space" i.e. keywords can contain spaces, but have
to start with + or - (plus or minus) signs, be placed in quotation marks (") and
separated with spaces. Keywords must not contain quotation marks themselves.
Example #2:
If you want the program to notify you only when Right Web Monitor related
news are published just follow the steps below:
- Create a condition as described in Example #1
.
- Create a new condition and set its type to "Look up for specified
keywords".
- Enter +"Right Web Monitor" into the
"Keywords" edit field.
- Enable "Look for keywords only within" check box and enter
%SELECTEDFRAGMENTS% into the edit field
below. Content Analysis options windows should look like the one on the
screenshot #2. Press the "OK" button.
- Mark on the "Mark
as Modified only when all the conditions are satisfied" checkbox.
The program will act this way:
- Step #1: if the program detects that text between the "<!-- LastNews -->"
and the "<!-- /LastNews -->"
fragments (hereinafter TEXT) has been
changed the first condition presumes satisfied and the program proceeds to the
Step #2.
- Step #2: if the program finds "Right Web Monitor" phrase in the
TEXT the second condition presumes
satisfied. So that we have only two conditions and both of them are satisfied
the program presumes resource Modified2.
Source processing flags
Right Web Monitor applies source processing flags once resource is
downloaded and before analysing it. Source processing flags can be of the
following types:
- Page source / Visible text only - if Page source flag is selected,
the program analyses HTML source and visible text. If Visible text only flag
is
selected the program removes all the HTML tags and scripts before analysing.
- Ignoring tags - if this flag is selected the program removes the
specified HTML tags from the analysed data. This flag makes sense only if
Page source flag is selected.
- Trim multiple spaces and line breaks - if this flag is selected
the program removes all the multiple spaces between words (leaves only one
space between two words) and CF/LF (Caret Return / Line Feed) sequences.
Default condition
Once you've selected Smart Content Analysis as analysis method for a resource,
the program automatically creates a condition with "Entire page, analysing visible
text only" description. This condition is of "Analysing entire page" type
with
"Visible text only" flag selected. So the program will analyse visible text of
the whole page, i.e. if any differences between visual parts of previously saved
copy and currently downloaded copy are detected the resource presumes Modified2. It
works similar to Exact Content Analysis but ignores all the HTML tags, so it is
convenient to use it instead of Exact Content Analysis if monitoring resource is
a page with dynamical links, visitor counters, date/time stamps and so on.
1- However if
"Link Properties > Analysis
> Smart Content Analysis options > Mark as Modified only when all the conditions
are satisfied" option is enabled all the condition have to be satisfied
to mark link as Modified and fire notification alerts.
2 - The
fact that resource presumes Modified means that status of the link is changed,
link representation in Links View
becomes in bold typeface and notification alerts fire.
Examples:
Example -
Smart Content Analysis: analysing certain fragment of a web page,
Example -
Smart Content Analysis:
monitoring keywords within certain region .
See also:
Summary strings.
|