glenda.party
term% ls -F
term% pwd
$home/manuals/9front/1/uhtml
term% cat index.txt
UHTML(1)                    General Commands Manual                   UHTML(1)



NAME
       uhtml - convert foreign character set HTML file to unicode

SYNOPSIS
       uhtml [ -p ] [ -c charset ] [ file ]

DESCRIPTION
       HTML  comes in various character set encodings and has special forms to
       encode characters. To make it easier to process html, uhtml is used  to
       normalize it to a unicode only form.

       Uhtml detects the character set of the html input file and calls tcs(1)
       to convert it to utf replacing html-entity forms by ther unicode  char‐
       acter  representations  except  for lt gt amp quot and apos .  The con‐
       verted html is written to standard output. If no file was given, it  is
       read from standard input. If the -p option is given, the detected char‐
       acter set is printed and the program exits without conversion.  In case
       character  set  detection fails, the default (utf) is assumed. This de‐
       fault can be changed with the -c option.

SOURCE
       /sys/src/cmd/uhtml.c

SEE ALSO
       tcs(1)



                                                                      UHTML(1)