Anjusha Mune et al.
INTRODUCTION
Louping ill virus (LIV) is a tick-born member of the genus
Flavivirus in Flaviviridae family. It is a positive single
stranded, 40-50 nm RNA virus whose genome comprises
a single open reading frame (ORF) that is approximately
11 kb in length (Grard et al.,2007;Jeffries et al., 2014).
The ORF encodes a polyprotein that consists of three
structural and seven non-structural proteins. The virus
show high degree of genetic homology to tick-borne
encephalitis virus (TBEV) of the same family (McGuire
et al., 1998; Jiang et al., 1993). It is mainly transmitted
by ticks and the primary vector is Ixodes ricinus (Dobler
et al., 2010).LIV mainly causes febrile illness in sheep,
cattle, horse, pigs and some other animals that may
eventually result in fatal encephalitis.
Sheep are the most important reservoir host for LIV.
The disease is dominantly detected in animals from
upland areas of British Isles (Gao et al., 1997) though
the disease is also reported in Scotland, Ireland, and
northern England where the tick vector Ixodes ricinus
is found. Infection with LIV was rst reported in sheep
of Basque region of northern Spain in 1987 (Gonzalez
et al., 1987). Most of the cases of LI infection occur
in spring / early summer when ticks are common. In
endemic areas morbidity and mortality depends upon
animal’s immune status, concurrent infection and other
factors. All age group of animal get infected by it and
once encephalitis is developed the case fatality rate goes
up to 50%. The mortality rate is even higher in animals
that are less than two years old. Currently, there is no
speci c treatment for LIV with only supportive therapies
being helpful to some extent (Hyde et al., 2007 Mans-
eld et al., 2015 Butt et al., 2016).
The molecular sequence data started to be accu-
mulated nearly 20 years ago. It was observed that the
genetic code is redundant and most amino acids can be
translated by more than one codon (Wang et al., 2011).
This redundancy is a key factor regulating the ef ciency
and accuracy of protein production.Alternative codons
within the same group that encode the same amino acid
are often called ‘synonymous’ codons. These codons are
not randomly selected within and between genomes.
This is referred to as ‘codon usage bias’ (CUB). CUB are
widespread across the tree of life and are in uenced by
mutation pressure, natural or translational selection,
secondary protein structure, replication, selective tran-
scription, hydrophobicity and hydrophilicity of the pro-
tein, and the external environment (Xiang et al., 2015
Butt et al., 2016 Mune et al., 2017).
As viruses are intracellular pathogens they have to
co-evolve with host molecular mechanisms. The inter-
play between the codon usage of the virus and its host is
expected to affect the overall viral survival, tness, evasion
of the host immune system and evolution. The knowledge
of the codon usage of viruses can provide information
about their molecular evolution and extend our under-
standing of the regulation of viral gene expression. This
may also offer signi cant improvement in vaccine design
for which the ef cient expression of viral proteins may be
required to generate immunity (Tao et al., 2009 Velazquez
et al., 2016). To gain insight into the characteristics of the
viral genome and evolution, the codon usage patterns of
the three components of transmission cycle, namely - the
virus (LIV), vector (Ixodes ricinus), and hosts (Sheep (Ovis
aries), Pig (Sus scrofa) and cattle (Bos taurus)) were inves-
tigated in our study.
MATERIALS AND METHODS
SEQUENCE DATA
The complete genome sequences were downloaded from
the National Centre for Biotechnology (NCBI) data-
base (http: //www.ncbi.nlm. nih.gov) in FASTA format.
The detailed information (accession numbers, coun-
try, sequence length etc.) of the selected genomes were
listed [Table. S1]. Open reading frames (ORF) of all the
genomic sequences were identi ed by using NCBI ORF
nder (https://www.ncbi.nlm.nih.gov/orf nder/). The
host (Ovis aries, Sus scrofa and Bos taurus) and vec-
tor (Ixodes ricinus) codon usage were obtained from the
Codon Usage Data Base (CUD).
CODON USAGE ANALYSIS
The overall frequency of occurrence of the nucleotides
(A %, C %, U %, and G %) was calculated along with
the frequency of each nucleotide at the third site of the
synonymous codons (A
3
, C
3
, U
3
and G
3
).Also the overall
GC, AU and GC
3
content were calculated using MEGA7
software to investigate the compositional properties of
coding region of LIV. To investigate the codon usage
pattern, the RSCU (Relative synonymous codon usage)
values for synonymous codons were calculated accord-
ing to the published equation (Sharp et al., 1986). The
stop codons (UAA, UAG and UGA) and AUG for Met,
UCG for Try were not introduced into the RSCU anal-
ysis. Further, ENC (Effective number of codon) values
were calculated to measure the magnitude of codon
usage bias in the coding sequences of viral genome. The
ENC value ranges from 20 (when only one synonymous
codon is chosen by the corresponding amino acid) to 61
(when all synonymous codons are used equally). A low
ENC value indicates a strong codon usage bias (Wright
et al., 1990; Zhang et al., 2011 Butt et al., 2013).
The CAI (Codon adaptation index) was used to esti-
mate the adaptation of LIV to its host and vector codons.
BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS A COMPARATIVE ANALYSIS OF OVERALL CODON USAGE PATTERN OF LOUPING ILL VIRUS WITH 301