Amino acid dipepetide frequency for Ligustrum virus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.043AlaAla: 5.043 ± 2.095
2.161AlaCys: 2.161 ± 1.086
3.242AlaAsp: 3.242 ± 0.684
3.963AlaGlu: 3.963 ± 1.117
3.242AlaPhe: 3.242 ± 0.875
5.043AlaGly: 5.043 ± 0.939
1.081AlaHis: 1.081 ± 0.86
4.683AlaIle: 4.683 ± 1.306
5.403AlaLys: 5.403 ± 1.743
5.764AlaLeu: 5.764 ± 1.703
2.161AlaMet: 2.161 ± 0.532
3.242AlaAsn: 3.242 ± 1.184
3.242AlaPro: 3.242 ± 1.935
1.441AlaGln: 1.441 ± 0.591
2.882AlaArg: 2.882 ± 0.93
5.764AlaSer: 5.764 ± 1.077
4.323AlaThr: 4.323 ± 0.687
3.602AlaVal: 3.602 ± 2.331
0.36AlaTrp: 0.36 ± 0.631
0.72AlaTyr: 0.72 ± 0.67
0.0AlaXaa: 0.0 ± 0.0
Cys
2.161CysAla: 2.161 ± 1.146
0.0CysCys: 0.0 ± 0.0
1.081CysAsp: 1.081 ± 1.212
1.441CysGlu: 1.441 ± 0.758
2.522CysPhe: 2.522 ± 1.327
1.801CysGly: 1.801 ± 0.87
0.36CysHis: 0.36 ± 0.951
1.441CysIle: 1.441 ± 0.758
0.72CysLys: 0.72 ± 0.379
3.242CysLeu: 3.242 ± 1.043
0.36CysMet: 0.36 ± 1.145
1.081CysAsn: 1.081 ± 0.603
1.081CysPro: 1.081 ± 0.603
0.36CysGln: 0.36 ± 0.19
1.441CysArg: 1.441 ± 0.551
1.441CysSer: 1.441 ± 1.161
1.801CysThr: 1.801 ± 0.948
3.602CysVal: 3.602 ± 3.117
0.0CysTrp: 0.0 ± 0.0
1.441CysTyr: 1.441 ± 0.937
0.0CysXaa: 0.0 ± 0.0
Asp
1.801AspAla: 1.801 ± 0.948
1.801AspCys: 1.801 ± 0.948
1.081AspAsp: 1.081 ± 0.513
3.963AspGlu: 3.963 ± 1.505
2.161AspPhe: 2.161 ± 0.774
5.043AspGly: 5.043 ± 2.079
1.081AspHis: 1.081 ± 0.603
2.882AspIle: 2.882 ± 2.331
2.161AspLys: 2.161 ± 0.53
4.683AspLeu: 4.683 ± 1.99
0.72AspMet: 0.72 ± 0.543
2.522AspAsn: 2.522 ± 1.566
2.882AspPro: 2.882 ± 2.233
1.081AspGln: 1.081 ± 0.569
1.441AspArg: 1.441 ± 0.758
2.882AspSer: 2.882 ± 1.007
2.161AspThr: 2.161 ± 0.53
3.963AspVal: 3.963 ± 0.976
1.441AspTrp: 1.441 ± 0.551
2.882AspTyr: 2.882 ± 1.088
0.0AspXaa: 0.0 ± 0.0
Glu
6.124GluAla: 6.124 ± 0.896
0.36GluCys: 0.36 ± 0.19
2.882GluAsp: 2.882 ± 1.183
5.043GluGlu: 5.043 ± 1.489
2.161GluPhe: 2.161 ± 1.086
3.242GluGly: 3.242 ± 1.804
1.441GluHis: 1.441 ± 0.758
6.484GluIle: 6.484 ± 1.368
2.522GluLys: 2.522 ± 0.86
5.403GluLeu: 5.403 ± 2.342
2.161GluMet: 2.161 ± 0.774
1.801GluAsn: 1.801 ± 1.193
2.522GluPro: 2.522 ± 0.925
4.683GluGln: 4.683 ± 1.695
2.522GluArg: 2.522 ± 1.047
3.963GluSer: 3.963 ± 0.965
1.081GluThr: 1.081 ± 1.191
8.646GluVal: 8.646 ± 1.673
0.36GluTrp: 0.36 ± 0.19
2.161GluTyr: 2.161 ± 0.53
0.0GluXaa: 0.0 ± 0.0
Phe
3.242PheAla: 3.242 ± 0.684
1.441PheCys: 1.441 ± 0.758
3.602PheAsp: 3.602 ± 1.289
5.403PheGlu: 5.403 ± 1.322
1.801PhePhe: 1.801 ± 0.645
4.323PheGly: 4.323 ± 1.28
0.72PheHis: 0.72 ± 0.379
4.323PheIle: 4.323 ± 1.28
2.882PheLys: 2.882 ± 1.188
6.844PheLeu: 6.844 ± 2.121
0.0PheMet: 0.0 ± 0.0
1.801PheAsn: 1.801 ± 1.365
0.72PhePro: 0.72 ± 1.343
2.161PheGln: 2.161 ± 0.733
2.161PheArg: 2.161 ± 0.952
4.323PheSer: 4.323 ± 1.287
4.323PheThr: 4.323 ± 1.681
3.242PheVal: 3.242 ± 2.157
0.36PheTrp: 0.36 ± 0.19
1.441PheTyr: 1.441 ± 0.676
0.0PheXaa: 0.0 ± 0.0
Gly
2.161GlyAla: 2.161 ± 1.278
1.801GlyCys: 1.801 ± 1.162
3.602GlyAsp: 3.602 ± 0.602
3.602GlyGlu: 3.602 ± 0.987
3.242GlyPhe: 3.242 ± 0.897
4.683GlyGly: 4.683 ± 2.966
0.72GlyHis: 0.72 ± 0.738
3.963GlyIle: 3.963 ± 0.976
6.484GlyLys: 6.484 ± 1.316
3.602GlyLeu: 3.602 ± 0.853
1.081GlyMet: 1.081 ± 0.569
2.161GlyAsn: 2.161 ± 0.774
1.081GlyPro: 1.081 ± 0.513
2.161GlyGln: 2.161 ± 1.206
2.522GlyArg: 2.522 ± 1.403
3.602GlySer: 3.602 ± 1.421
5.403GlyThr: 5.403 ± 1.553
3.963GlyVal: 3.963 ± 1.128
1.081GlyTrp: 1.081 ± 0.569
1.441GlyTyr: 1.441 ± 0.591
0.0GlyXaa: 0.0 ± 0.0
His
1.081HisAla: 1.081 ± 0.569
1.081HisCys: 1.081 ± 0.603
1.801HisAsp: 1.801 ± 0.948
1.081HisGlu: 1.081 ± 0.569
0.72HisPhe: 0.72 ± 0.67
0.36HisGly: 0.36 ± 0.835
0.36HisHis: 0.36 ± 0.19
0.72HisIle: 0.72 ± 0.379
1.801HisLys: 1.801 ± 0.873
3.242HisLeu: 3.242 ± 1.258
0.36HisMet: 0.36 ± 0.728
1.081HisAsn: 1.081 ± 0.513
0.36HisPro: 0.36 ± 0.19
0.36HisGln: 0.36 ± 0.835
1.801HisArg: 1.801 ± 2.156
3.963HisSer: 3.963 ± 0.976
0.0HisThr: 0.0 ± 0.0
0.36HisVal: 0.36 ± 0.19
0.36HisTrp: 0.36 ± 0.19
0.36HisTyr: 0.36 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
3.963IleAla: 3.963 ± 2.256
1.801IleCys: 1.801 ± 1.006
3.242IleAsp: 3.242 ± 1.166
4.683IleGlu: 4.683 ± 1.249
3.242IlePhe: 3.242 ± 0.818
3.963IleGly: 3.963 ± 2.327
1.441IleHis: 1.441 ± 0.979
4.323IleIle: 4.323 ± 2.551
2.882IleLys: 2.882 ± 1.188
3.602IleLeu: 3.602 ± 0.983
1.801IleMet: 1.801 ± 0.645
1.441IleAsn: 1.441 ± 0.758
2.522IlePro: 2.522 ± 1.494
2.882IleGln: 2.882 ± 1.435
2.161IleArg: 2.161 ± 2.838
3.602IleSer: 3.602 ± 1.433
2.161IleThr: 2.161 ± 2.381
5.043IleVal: 5.043 ± 1.89
0.72IleTrp: 0.72 ± 0.738
2.161IleTyr: 2.161 ± 1.273
0.0IleXaa: 0.0 ± 0.0
Lys
3.242LysAla: 3.242 ± 1.258
0.72LysCys: 0.72 ± 0.872
2.522LysAsp: 2.522 ± 0.917
5.764LysGlu: 5.764 ± 1.849
3.602LysPhe: 3.602 ± 1.895
1.801LysGly: 1.801 ± 0.948
0.72LysHis: 0.72 ± 0.379
3.602LysIle: 3.602 ± 0.983
7.205LysLys: 7.205 ± 2.466
7.925LysLeu: 7.925 ± 1.107
1.081LysMet: 1.081 ± 0.569
1.801LysAsn: 1.801 ± 0.873
3.602LysPro: 3.602 ± 2.835
2.522LysGln: 2.522 ± 1.393
3.242LysArg: 3.242 ± 1.331
5.043LysSer: 5.043 ± 2.079
3.602LysThr: 3.602 ± 1.745
2.882LysVal: 2.882 ± 1.188
0.36LysTrp: 0.36 ± 0.19
2.161LysTyr: 2.161 ± 1.206
0.0LysXaa: 0.0 ± 0.0
Leu
8.285LeuAla: 8.285 ± 2.299
2.882LeuCys: 2.882 ± 1.302
5.403LeuAsp: 5.403 ± 1.217
5.043LeuGlu: 5.043 ± 0.84
3.963LeuPhe: 3.963 ± 1.611
5.764LeuGly: 5.764 ± 1.482
3.242LeuHis: 3.242 ± 1.258
5.403LeuIle: 5.403 ± 3.674
7.565LeuLys: 7.565 ± 1.95
9.366LeuLeu: 9.366 ± 2.662
0.72LeuMet: 0.72 ± 0.379
3.963LeuAsn: 3.963 ± 2.091
5.403LeuPro: 5.403 ± 1.63
2.161LeuGln: 2.161 ± 1.278
4.323LeuArg: 4.323 ± 1.792
7.565LeuSer: 7.565 ± 1.786
6.124LeuThr: 6.124 ± 1.097
5.764LeuVal: 5.764 ± 1.929
1.801LeuTrp: 1.801 ± 0.645
3.242LeuTyr: 3.242 ± 1.258
0.0LeuXaa: 0.0 ± 0.0
Met
3.242MetAla: 3.242 ± 1.184
1.081MetCys: 1.081 ± 0.569
1.081MetAsp: 1.081 ± 0.603
1.081MetGlu: 1.081 ± 0.569
0.0MetPhe: 0.0 ± 0.0
1.801MetGly: 1.801 ± 1.039
0.36MetHis: 0.36 ± 0.631
1.441MetIle: 1.441 ± 1.161
1.081MetLys: 1.081 ± 0.569
1.801MetLeu: 1.801 ± 0.948
0.36MetMet: 0.36 ± 0.19
0.36MetAsn: 0.36 ± 0.951
1.081MetPro: 1.081 ± 0.682
0.36MetGln: 0.36 ± 0.19
1.441MetArg: 1.441 ± 0.551
0.36MetSer: 0.36 ± 0.631
1.081MetThr: 1.081 ± 0.513
0.36MetVal: 0.36 ± 0.19
0.0MetTrp: 0.0 ± 0.0
0.72MetTyr: 0.72 ± 0.379
0.0MetXaa: 0.0 ± 0.0
Asn
2.882AsnAla: 2.882 ± 0.895
2.161AsnCys: 2.161 ± 1.137
1.441AsnAsp: 1.441 ± 0.551
1.801AsnGlu: 1.801 ± 1.147
5.043AsnPhe: 5.043 ± 1.779
1.081AsnGly: 1.081 ± 0.569
0.36AsnHis: 0.36 ± 0.19
0.72AsnIle: 0.72 ± 0.738
3.242AsnLys: 3.242 ± 3.567
4.683AsnLeu: 4.683 ± 1.301
1.081AsnMet: 1.081 ± 0.958
2.522AsnAsn: 2.522 ± 3.678
1.801AsnPro: 1.801 ± 0.645
1.441AsnGln: 1.441 ± 0.591
3.242AsnArg: 3.242 ± 0.684
2.161AsnSer: 2.161 ± 3.503
1.801AsnThr: 1.801 ± 2.104
4.323AsnVal: 4.323 ± 1.194
0.36AsnTrp: 0.36 ± 0.19
2.161AsnTyr: 2.161 ± 0.774
0.0AsnXaa: 0.0 ± 0.0
Pro
3.242ProAla: 3.242 ± 1.338
0.72ProCys: 0.72 ± 0.379
2.882ProAsp: 2.882 ± 0.736
5.043ProGlu: 5.043 ± 1.246
1.441ProPhe: 1.441 ± 0.676
2.161ProGly: 2.161 ± 1.273
1.081ProHis: 1.081 ± 0.682
1.441ProIle: 1.441 ± 0.551
2.161ProLys: 2.161 ± 0.774
2.882ProLeu: 2.882 ± 1.711
0.36ProMet: 0.36 ± 0.19
2.522ProAsn: 2.522 ± 1.494
4.323ProPro: 4.323 ± 3.713
1.801ProGln: 1.801 ± 0.86
3.602ProArg: 3.602 ± 1.382
1.801ProSer: 1.801 ± 1.7
2.522ProThr: 2.522 ± 1.872
3.242ProVal: 3.242 ± 1.885
0.72ProTrp: 0.72 ± 0.738
1.081ProTyr: 1.081 ± 0.83
0.0ProXaa: 0.0 ± 0.0
Gln
2.161GlnAla: 2.161 ± 1.206
1.081GlnCys: 1.081 ± 0.603
2.161GlnAsp: 2.161 ± 0.775
3.242GlnGlu: 3.242 ± 1.258
2.161GlnPhe: 2.161 ± 0.774
2.161GlnGly: 2.161 ± 1.137
1.081GlnHis: 1.081 ± 0.569
0.72GlnIle: 0.72 ± 0.67
1.081GlnLys: 1.081 ± 0.603
5.043GlnLeu: 5.043 ± 1.135
0.36GlnMet: 0.36 ± 0.523
1.081GlnAsn: 1.081 ± 1.162
1.801GlnPro: 1.801 ± 1.161
1.081GlnGln: 1.081 ± 0.513
2.882GlnArg: 2.882 ± 1.697
4.683GlnSer: 4.683 ± 2.037
1.081GlnThr: 1.081 ± 0.603
2.161GlnVal: 2.161 ± 1.045
0.36GlnTrp: 0.36 ± 0.19
0.72GlnTyr: 0.72 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
3.602ArgAla: 3.602 ± 1.559
1.081ArgCys: 1.081 ± 2.171
3.242ArgAsp: 3.242 ± 1.166
1.801ArgGlu: 1.801 ± 0.601
4.683ArgPhe: 4.683 ± 1.232
4.323ArgGly: 4.323 ± 1.489
1.441ArgHis: 1.441 ± 0.979
2.522ArgIle: 2.522 ± 1.794
2.522ArgLys: 2.522 ± 1.327
6.844ArgLeu: 6.844 ± 1.405
1.081ArgMet: 1.081 ± 0.513
2.522ArgAsn: 2.522 ± 0.521
1.441ArgPro: 1.441 ± 2.07
3.242ArgGln: 3.242 ± 1.184
3.242ArgArg: 3.242 ± 2.501
3.602ArgSer: 3.602 ± 0.602
1.081ArgThr: 1.081 ± 0.569
2.161ArgVal: 2.161 ± 0.82
1.081ArgTrp: 1.081 ± 1.212
3.242ArgTyr: 3.242 ± 0.897
0.0ArgXaa: 0.0 ± 0.0
Ser
4.683SerAla: 4.683 ± 1.301
2.161SerCys: 2.161 ± 1.194
3.242SerAsp: 3.242 ± 0.684
3.602SerGlu: 3.602 ± 1.484
3.602SerPhe: 3.602 ± 0.904
2.161SerGly: 2.161 ± 1.137
2.161SerHis: 2.161 ± 0.944
4.323SerIle: 4.323 ± 1.709
4.323SerLys: 4.323 ± 1.141
5.764SerLeu: 5.764 ± 1.295
0.72SerMet: 0.72 ± 0.379
4.683SerAsn: 4.683 ± 1.371
3.963SerPro: 3.963 ± 1.999
2.522SerGln: 2.522 ± 0.521
5.403SerArg: 5.403 ± 1.185
5.403SerSer: 5.403 ± 1.637
3.602SerThr: 3.602 ± 0.901
4.683SerVal: 4.683 ± 4.609
0.36SerTrp: 0.36 ± 0.19
4.323SerTyr: 4.323 ± 1.194
0.0SerXaa: 0.0 ± 0.0
Thr
2.522ThrAla: 2.522 ± 1.165
1.801ThrCys: 1.801 ± 2.449
1.441ThrAsp: 1.441 ± 0.551
3.242ThrGlu: 3.242 ± 1.087
5.764ThrPhe: 5.764 ± 0.989
3.242ThrGly: 3.242 ± 1.096
1.441ThrHis: 1.441 ± 0.758
2.161ThrIle: 2.161 ± 1.66
3.602ThrLys: 3.602 ± 0.853
3.242ThrLeu: 3.242 ± 0.818
1.441ThrMet: 1.441 ± 0.758
3.242ThrAsn: 3.242 ± 0.851
2.161ThrPro: 2.161 ± 0.774
1.801ThrGln: 1.801 ± 0.948
3.602ThrArg: 3.602 ± 1.793
4.323ThrSer: 4.323 ± 2.09
3.242ThrThr: 3.242 ± 1.222
2.161ThrVal: 2.161 ± 1.137
0.36ThrTrp: 0.36 ± 0.19
2.522ThrTyr: 2.522 ± 1.403
0.0ThrXaa: 0.0 ± 0.0
Val
5.043ValAla: 5.043 ± 1.556
1.801ValCys: 1.801 ± 1.557
2.522ValAsp: 2.522 ± 1.327
3.602ValGlu: 3.602 ± 2.131
3.242ValPhe: 3.242 ± 4.346
3.602ValGly: 3.602 ± 2.037
1.441ValHis: 1.441 ± 0.758
3.602ValIle: 3.602 ± 1.911
3.602ValLys: 3.602 ± 0.993
7.925ValLeu: 7.925 ± 1.179
1.801ValMet: 1.801 ± 0.948
2.522ValAsn: 2.522 ± 1.055
2.882ValPro: 2.882 ± 1.435
2.882ValGln: 2.882 ± 1.338
5.043ValArg: 5.043 ± 1.635
5.043ValSer: 5.043 ± 1.585
4.683ValThr: 4.683 ± 1.666
2.522ValVal: 2.522 ± 0.521
0.0ValTrp: 0.0 ± 0.0
2.161ValTyr: 2.161 ± 1.416
0.0ValXaa: 0.0 ± 0.0
Trp
1.081TrpAla: 1.081 ± 1.021
0.72TrpCys: 0.72 ± 0.379
0.36TrpAsp: 0.36 ± 0.19
0.0TrpGlu: 0.0 ± 0.0
0.72TrpPhe: 0.72 ± 0.379
0.36TrpGly: 0.36 ± 0.951
0.36TrpHis: 0.36 ± 0.19
0.36TrpIle: 0.36 ± 0.779
0.0TrpLys: 0.0 ± 0.0
1.441TrpLeu: 1.441 ± 0.758
0.36TrpMet: 0.36 ± 0.19
1.081TrpAsn: 1.081 ± 1.162
0.36TrpPro: 0.36 ± 0.19
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.72TrpSer: 0.72 ± 0.738
1.081TrpThr: 1.081 ± 0.569
1.081TrpVal: 1.081 ± 0.569
0.0TrpTrp: 0.0 ± 0.0
0.36TrpTyr: 0.36 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.801TyrAla: 1.801 ± 1.365
1.081TyrCys: 1.081 ± 0.513
1.801TyrAsp: 1.801 ± 0.948
2.161TyrGlu: 2.161 ± 0.733
1.801TyrPhe: 1.801 ± 1.006
1.441TyrGly: 1.441 ± 0.551
0.36TyrHis: 0.36 ± 0.19
2.522TyrIle: 2.522 ± 1.327
2.161TyrLys: 2.161 ± 0.733
4.683TyrLeu: 4.683 ± 1.483
0.72TyrMet: 0.72 ± 0.543
2.882TyrAsn: 2.882 ± 1.222
1.801TyrPro: 1.801 ± 0.873
2.161TyrGln: 2.161 ± 1.693
1.801TyrArg: 1.801 ± 0.873
1.441TyrSer: 1.441 ± 0.758
2.161TyrThr: 2.161 ± 1.206
2.161TyrVal: 2.161 ± 1.026
0.36TyrTrp: 0.36 ± 0.19
0.36TyrTyr: 0.36 ± 0.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2777 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski