Amino acid dipepetide frequency for Beihai tombus-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.193AlaAla: 3.193 ± 0.748
1.277AlaCys: 1.277 ± 0.522
3.831AlaAsp: 3.831 ± 1.181
5.109AlaGlu: 5.109 ± 2.826
6.386AlaPhe: 6.386 ± 0.911
7.024AlaGly: 7.024 ± 1.105
2.554AlaHis: 2.554 ± 2.155
5.747AlaIle: 5.747 ± 3.09
7.024AlaLys: 7.024 ± 1.729
3.193AlaLeu: 3.193 ± 2.25
2.554AlaMet: 2.554 ± 0.882
2.554AlaAsn: 2.554 ± 1.266
1.916AlaPro: 1.916 ± 0.499
1.277AlaGln: 1.277 ± 0.522
7.663AlaArg: 7.663 ± 3.287
5.109AlaSer: 5.109 ± 2.449
3.193AlaThr: 3.193 ± 0.971
3.831AlaVal: 3.831 ± 1.979
0.0AlaTrp: 0.0 ± 0.0
3.193AlaTyr: 3.193 ± 0.702
0.0AlaXaa: 0.0 ± 0.0
Cys
1.277CysAla: 1.277 ± 1.41
0.0CysCys: 0.0 ± 0.0
1.277CysAsp: 1.277 ± 0.997
1.916CysGlu: 1.916 ± 0.906
1.277CysPhe: 1.277 ± 0.522
0.639CysGly: 0.639 ± 0.705
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.554CysLys: 2.554 ± 1.994
3.193CysLeu: 3.193 ± 0.895
0.0CysMet: 0.0 ± 0.0
1.277CysAsn: 1.277 ± 0.633
3.193CysPro: 3.193 ± 1.228
3.193CysGln: 3.193 ± 1.036
1.916CysArg: 1.916 ± 0.499
1.277CysSer: 1.277 ± 0.633
1.277CysThr: 1.277 ± 0.633
0.639CysVal: 0.639 ± 1.17
0.0CysTrp: 0.0 ± 0.0
1.277CysTyr: 1.277 ± 0.633
0.0CysXaa: 0.0 ± 0.0
Asp
1.277AspAla: 1.277 ± 0.633
1.277AspCys: 1.277 ± 0.914
3.193AspAsp: 3.193 ± 0.702
3.831AspGlu: 3.831 ± 1.225
1.916AspPhe: 1.916 ± 0.906
3.193AspGly: 3.193 ± 0.977
0.0AspHis: 0.0 ± 0.0
1.916AspIle: 1.916 ± 0.949
1.916AspLys: 1.916 ± 0.906
5.747AspLeu: 5.747 ± 2.235
0.639AspMet: 0.639 ± 0.316
3.193AspAsn: 3.193 ± 1.582
4.47AspPro: 4.47 ± 2.215
1.277AspGln: 1.277 ± 0.633
1.277AspArg: 1.277 ± 0.633
3.193AspSer: 3.193 ± 1.036
2.554AspThr: 2.554 ± 1.393
2.554AspVal: 2.554 ± 0.652
2.554AspTrp: 2.554 ± 1.266
1.916AspTyr: 1.916 ± 0.822
0.0AspXaa: 0.0 ± 0.0
Glu
2.554GluAla: 2.554 ± 0.926
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
1.916GluGlu: 1.916 ± 0.949
3.193GluPhe: 3.193 ± 1.036
1.916GluGly: 1.916 ± 0.949
3.831GluHis: 3.831 ± 1.811
4.47GluIle: 4.47 ± 0.918
1.277GluLys: 1.277 ± 0.522
4.47GluLeu: 4.47 ± 1.723
1.277GluMet: 1.277 ± 0.784
1.277GluAsn: 1.277 ± 0.633
3.193GluPro: 3.193 ± 1.036
5.747GluGln: 5.747 ± 1.653
5.109GluArg: 5.109 ± 1.77
3.193GluSer: 3.193 ± 0.971
3.831GluThr: 3.831 ± 1.811
2.554GluVal: 2.554 ± 1.266
0.639GluTrp: 0.639 ± 0.316
1.277GluTyr: 1.277 ± 0.633
0.0GluXaa: 0.0 ± 0.0
Phe
1.916PheAla: 1.916 ± 0.499
0.639PheCys: 0.639 ± 0.316
1.277PheAsp: 1.277 ± 0.997
1.916PheGlu: 1.916 ± 1.164
0.639PhePhe: 0.639 ± 0.316
1.916PheGly: 1.916 ± 1.933
0.639PheHis: 0.639 ± 0.316
1.277PheIle: 1.277 ± 0.914
0.639PheLys: 0.639 ± 0.705
4.47PheLeu: 4.47 ± 1.552
0.639PheMet: 0.639 ± 0.312
0.639PheAsn: 0.639 ± 0.316
2.554PhePro: 2.554 ± 0.92
0.639PheGln: 0.639 ± 0.316
2.554PheArg: 2.554 ± 1.393
2.554PheSer: 2.554 ± 0.652
1.916PheThr: 1.916 ± 0.499
2.554PheVal: 2.554 ± 0.652
0.639PheTrp: 0.639 ± 0.316
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.109GlyAla: 5.109 ± 3.22
1.916GlyCys: 1.916 ± 0.949
4.47GlyAsp: 4.47 ± 2.215
0.0GlyGlu: 0.0 ± 0.0
1.916GlyPhe: 1.916 ± 1.124
1.916GlyGly: 1.916 ± 0.949
3.831GlyHis: 3.831 ± 1.899
1.916GlyIle: 1.916 ± 0.499
1.916GlyLys: 1.916 ± 0.949
8.301GlyLeu: 8.301 ± 2.639
0.0GlyMet: 0.0 ± 0.0
1.916GlyAsn: 1.916 ± 0.949
4.47GlyPro: 4.47 ± 1.995
1.916GlyGln: 1.916 ± 1.2
4.47GlyArg: 4.47 ± 2.215
3.193GlySer: 3.193 ± 1.709
2.554GlyThr: 2.554 ± 1.045
5.109GlyVal: 5.109 ± 1.77
3.193GlyTrp: 3.193 ± 2.899
3.193GlyTyr: 3.193 ± 0.977
0.0GlyXaa: 0.0 ± 0.0
His
1.277HisAla: 1.277 ± 0.914
1.277HisCys: 1.277 ± 0.997
1.277HisAsp: 1.277 ± 0.633
1.916HisGlu: 1.916 ± 0.906
0.0HisPhe: 0.0 ± 0.0
1.277HisGly: 1.277 ± 0.633
0.639HisHis: 0.639 ± 1.17
0.639HisIle: 0.639 ± 1.17
1.277HisLys: 1.277 ± 0.914
4.47HisLeu: 4.47 ± 4.23
2.554HisMet: 2.554 ± 1.647
0.639HisAsn: 0.639 ± 0.316
2.554HisPro: 2.554 ± 0.92
2.554HisGln: 2.554 ± 1.045
1.916HisArg: 1.916 ± 0.949
1.916HisSer: 1.916 ± 1.164
0.639HisThr: 0.639 ± 0.316
1.916HisVal: 1.916 ± 0.906
0.639HisTrp: 0.639 ± 0.316
1.277HisTyr: 1.277 ± 0.633
0.0HisXaa: 0.0 ± 0.0
Ile
3.193IleAla: 3.193 ± 1.369
0.0IleCys: 0.0 ± 0.0
1.916IleAsp: 1.916 ± 0.499
4.47IleGlu: 4.47 ± 1.427
0.639IlePhe: 0.639 ± 0.705
3.831IleGly: 3.831 ± 1.181
1.277IleHis: 1.277 ± 0.997
2.554IleIle: 2.554 ± 1.045
1.277IleLys: 1.277 ± 0.522
5.109IleLeu: 5.109 ± 2.407
1.277IleMet: 1.277 ± 1.41
1.916IleAsn: 1.916 ± 0.949
0.0IlePro: 0.0 ± 0.0
1.916IleGln: 1.916 ± 0.949
4.47IleArg: 4.47 ± 0.751
3.831IleSer: 3.831 ± 1.876
6.386IleThr: 6.386 ± 2.821
1.916IleVal: 1.916 ± 0.499
0.0IleTrp: 0.0 ± 0.0
0.639IleTyr: 0.639 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
1.277LysAla: 1.277 ± 0.997
0.639LysCys: 0.639 ± 0.316
1.916LysAsp: 1.916 ± 0.499
1.916LysGlu: 1.916 ± 1.2
1.277LysPhe: 1.277 ± 0.633
3.193LysGly: 3.193 ± 1.582
0.0LysHis: 0.0 ± 0.0
1.916LysIle: 1.916 ± 1.2
0.639LysLys: 0.639 ± 0.316
3.831LysLeu: 3.831 ± 1.125
1.277LysMet: 1.277 ± 0.914
0.639LysAsn: 0.639 ± 0.316
3.193LysPro: 3.193 ± 0.895
1.916LysGln: 1.916 ± 0.499
3.831LysArg: 3.831 ± 1.811
0.639LysSer: 0.639 ± 0.316
1.916LysThr: 1.916 ± 1.164
5.109LysVal: 5.109 ± 1.186
0.0LysTrp: 0.0 ± 0.0
1.916LysTyr: 1.916 ± 0.949
0.639LysXaa: 0.639 ± 0.316
Leu
10.856LeuAla: 10.856 ± 0.681
3.831LeuCys: 3.831 ± 1.125
3.193LeuAsp: 3.193 ± 1.036
4.47LeuGlu: 4.47 ± 0.751
1.277LeuPhe: 1.277 ± 0.997
4.47LeuGly: 4.47 ± 1.468
3.193LeuHis: 3.193 ± 2.05
2.554LeuIle: 2.554 ± 0.652
3.193LeuLys: 3.193 ± 1.711
12.133LeuLeu: 12.133 ± 5.394
3.831LeuMet: 3.831 ± 1.125
2.554LeuAsn: 2.554 ± 1.266
4.47LeuPro: 4.47 ± 1.995
4.47LeuGln: 4.47 ± 2.656
10.217LeuArg: 10.217 ± 2.182
10.217LeuSer: 10.217 ± 1.302
7.024LeuThr: 7.024 ± 1.759
3.831LeuVal: 3.831 ± 2.9
0.639LeuTrp: 0.639 ± 0.705
2.554LeuTyr: 2.554 ± 1.828
0.0LeuXaa: 0.0 ± 0.0
Met
2.554MetAla: 2.554 ± 1.045
0.0MetCys: 0.0 ± 0.0
2.554MetAsp: 2.554 ± 2.656
0.0MetGlu: 0.0 ± 0.0
0.639MetPhe: 0.639 ± 0.705
1.277MetGly: 1.277 ± 0.633
0.639MetHis: 0.639 ± 1.17
0.639MetIle: 0.639 ± 1.094
0.639MetLys: 0.639 ± 0.316
1.277MetLeu: 1.277 ± 1.433
1.277MetMet: 1.277 ± 1.41
1.916MetAsn: 1.916 ± 1.124
1.277MetPro: 1.277 ± 1.41
0.639MetGln: 0.639 ± 1.094
1.916MetArg: 1.916 ± 0.499
3.193MetSer: 3.193 ± 1.369
1.916MetThr: 1.916 ± 0.822
0.0MetVal: 0.0 ± 0.0
0.639MetTrp: 0.639 ± 0.316
1.277MetTyr: 1.277 ± 0.633
0.0MetXaa: 0.0 ± 0.0
Asn
6.386AsnAla: 6.386 ± 0.882
1.277AsnCys: 1.277 ± 0.633
3.831AsnAsp: 3.831 ± 1.225
1.916AsnGlu: 1.916 ± 0.949
0.639AsnPhe: 0.639 ± 1.094
1.277AsnGly: 1.277 ± 0.633
1.277AsnHis: 1.277 ± 0.633
0.639AsnIle: 0.639 ± 0.316
0.0AsnLys: 0.0 ± 0.0
3.193AsnLeu: 3.193 ± 0.971
0.639AsnMet: 0.639 ± 0.316
1.277AsnAsn: 1.277 ± 0.633
3.193AsnPro: 3.193 ± 1.582
2.554AsnGln: 2.554 ± 0.846
3.831AsnArg: 3.831 ± 1.128
2.554AsnSer: 2.554 ± 0.882
1.277AsnThr: 1.277 ± 0.522
1.277AsnVal: 1.277 ± 0.633
0.0AsnTrp: 0.0 ± 0.0
0.639AsnTyr: 0.639 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
3.831ProAla: 3.831 ± 1.567
0.639ProCys: 0.639 ± 0.316
5.109ProAsp: 5.109 ± 0.931
3.193ProGlu: 3.193 ± 0.895
0.639ProPhe: 0.639 ± 0.316
1.277ProGly: 1.277 ± 1.396
3.193ProHis: 3.193 ± 1.878
3.831ProIle: 3.831 ± 1.181
3.831ProLys: 3.831 ± 0.997
6.386ProLeu: 6.386 ± 0.664
0.639ProMet: 0.639 ± 0.316
0.639ProAsn: 0.639 ± 0.316
7.663ProPro: 7.663 ± 2.348
0.0ProGln: 0.0 ± 0.0
3.193ProArg: 3.193 ± 1.582
7.663ProSer: 7.663 ± 1.299
4.47ProThr: 4.47 ± 1.552
7.663ProVal: 7.663 ± 2.348
0.639ProTrp: 0.639 ± 0.316
0.639ProTyr: 0.639 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
4.47GlnAla: 4.47 ± 1.427
3.193GlnCys: 3.193 ± 0.971
0.639GlnAsp: 0.639 ± 0.316
1.277GlnGlu: 1.277 ± 0.633
0.0GlnPhe: 0.0 ± 0.0
1.916GlnGly: 1.916 ± 0.949
0.639GlnHis: 0.639 ± 0.316
1.916GlnIle: 1.916 ± 1.922
0.639GlnLys: 0.639 ± 1.17
4.47GlnLeu: 4.47 ± 1.995
3.193GlnMet: 3.193 ± 2.019
2.554GlnAsn: 2.554 ± 0.846
1.916GlnPro: 1.916 ± 0.499
1.277GlnGln: 1.277 ± 0.522
6.386GlnArg: 6.386 ± 1.845
1.916GlnSer: 1.916 ± 0.822
1.277GlnThr: 1.277 ± 0.522
2.554GlnVal: 2.554 ± 0.652
1.277GlnTrp: 1.277 ± 1.41
0.639GlnTyr: 0.639 ± 0.316
0.0GlnXaa: 0.0 ± 0.0
Arg
5.109ArgAla: 5.109 ± 1.304
2.554ArgCys: 2.554 ± 2.274
1.916ArgAsp: 1.916 ± 0.949
7.024ArgGlu: 7.024 ± 1.604
2.554ArgPhe: 2.554 ± 0.846
7.663ArgGly: 7.663 ± 1.299
1.277ArgHis: 1.277 ± 0.633
2.554ArgIle: 2.554 ± 1.045
2.554ArgLys: 2.554 ± 0.92
7.663ArgLeu: 7.663 ± 2.348
1.277ArgMet: 1.277 ± 0.633
5.747ArgAsn: 5.747 ± 1.98
3.193ArgPro: 3.193 ± 0.895
5.109ArgGln: 5.109 ± 1.836
5.109ArgArg: 5.109 ± 1.72
5.109ArgSer: 5.109 ± 1.691
5.109ArgThr: 5.109 ± 1.72
6.386ArgVal: 6.386 ± 0.882
1.277ArgTrp: 1.277 ± 0.633
2.554ArgTyr: 2.554 ± 1.266
0.0ArgXaa: 0.0 ± 0.0
Ser
6.386SerAla: 6.386 ± 3.847
1.916SerCys: 1.916 ± 0.949
2.554SerAsp: 2.554 ± 1.266
4.47SerGlu: 4.47 ± 2.228
0.0SerPhe: 0.0 ± 0.0
7.663SerGly: 7.663 ± 2.186
2.554SerHis: 2.554 ± 2.507
5.747SerIle: 5.747 ± 1.934
3.193SerLys: 3.193 ± 0.748
5.109SerLeu: 5.109 ± 1.77
0.0SerMet: 0.0 ± 0.0
3.193SerAsn: 3.193 ± 1.374
6.386SerPro: 6.386 ± 1.44
4.47SerGln: 4.47 ± 2.615
3.831SerArg: 3.831 ± 0.639
3.193SerSer: 3.193 ± 2.259
1.277SerThr: 1.277 ± 0.914
6.386SerVal: 6.386 ± 0.882
0.639SerTrp: 0.639 ± 0.316
1.916SerTyr: 1.916 ± 0.822
0.0SerXaa: 0.0 ± 0.0
Thr
5.109ThrAla: 5.109 ± 0.804
3.193ThrCys: 3.193 ± 3.141
1.916ThrAsp: 1.916 ± 0.949
1.916ThrGlu: 1.916 ± 0.499
1.277ThrPhe: 1.277 ± 0.633
5.109ThrGly: 5.109 ± 1.453
3.193ThrHis: 3.193 ± 0.748
2.554ThrIle: 2.554 ± 1.045
1.277ThrLys: 1.277 ± 0.633
3.831ThrLeu: 3.831 ± 1.992
1.916ThrMet: 1.916 ± 2.086
1.916ThrAsn: 1.916 ± 2.086
4.47ThrPro: 4.47 ± 1.117
1.277ThrGln: 1.277 ± 0.633
4.47ThrArg: 4.47 ± 1.468
5.747ThrSer: 5.747 ± 1.385
3.193ThrThr: 3.193 ± 1.582
1.916ThrVal: 1.916 ± 1.2
1.916ThrTrp: 1.916 ± 0.949
1.916ThrTyr: 1.916 ± 1.164
0.0ThrXaa: 0.0 ± 0.0
Val
3.831ValAla: 3.831 ± 0.639
1.277ValCys: 1.277 ± 0.633
3.831ValAsp: 3.831 ± 1.174
1.277ValGlu: 1.277 ± 0.633
3.193ValPhe: 3.193 ± 1.374
1.916ValGly: 1.916 ± 0.499
0.639ValHis: 0.639 ± 1.094
3.193ValIle: 3.193 ± 0.971
1.277ValLys: 1.277 ± 0.633
5.109ValLeu: 5.109 ± 1.49
0.639ValMet: 0.639 ± 0.769
1.916ValAsn: 1.916 ± 0.499
5.747ValPro: 5.747 ± 1.332
1.277ValGln: 1.277 ± 0.522
7.663ValArg: 7.663 ± 3.008
3.831ValSer: 3.831 ± 0.997
4.47ValThr: 4.47 ± 2.634
6.386ValVal: 6.386 ± 1.404
1.916ValTrp: 1.916 ± 0.906
4.47ValTyr: 4.47 ± 0.751
0.0ValXaa: 0.0 ± 0.0
Trp
1.277TrpAla: 1.277 ± 0.633
1.277TrpCys: 1.277 ± 0.633
1.916TrpAsp: 1.916 ± 1.991
0.0TrpGlu: 0.0 ± 0.0
1.916TrpPhe: 1.916 ± 1.164
0.639TrpGly: 0.639 ± 0.316
0.639TrpHis: 0.639 ± 0.316
1.277TrpIle: 1.277 ± 0.522
1.277TrpLys: 1.277 ± 0.633
3.193TrpLeu: 3.193 ± 0.895
0.0TrpMet: 0.0 ± 0.0
0.639TrpAsn: 0.639 ± 0.316
0.639TrpPro: 0.639 ± 0.316
0.0TrpGln: 0.0 ± 0.0
0.639TrpArg: 0.639 ± 0.316
0.0TrpSer: 0.0 ± 0.0
0.639TrpThr: 0.639 ± 0.316
0.639TrpVal: 0.639 ± 1.094
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.747TyrAla: 5.747 ± 2.448
0.0TyrCys: 0.0 ± 0.0
1.277TyrAsp: 1.277 ± 0.633
3.193TyrGlu: 3.193 ± 1.036
0.639TyrPhe: 0.639 ± 0.316
2.554TyrGly: 2.554 ± 0.882
0.639TyrHis: 0.639 ± 0.316
1.277TyrIle: 1.277 ± 0.633
1.277TyrLys: 1.277 ± 0.633
3.831TyrLeu: 3.831 ± 1.899
0.0TyrMet: 0.0 ± 0.0
1.277TyrAsn: 1.277 ± 1.433
0.639TyrPro: 0.639 ± 0.316
0.639TyrGln: 0.639 ± 0.316
1.277TyrArg: 1.277 ± 0.633
2.554TyrSer: 2.554 ± 1.266
3.193TyrThr: 3.193 ± 0.977
1.277TyrVal: 1.277 ± 0.914
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.639XaaGly: 0.639 ± 0.316
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1567 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski