Amino acid dipepetide frequency for Kaisodi virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.291AlaAla: 3.291 ± 1.862
1.92AlaCys: 1.92 ± 0.346
3.017AlaAsp: 3.017 ± 0.79
1.92AlaGlu: 1.92 ± 0.992
1.92AlaPhe: 1.92 ± 0.561
3.291AlaGly: 3.291 ± 0.496
1.097AlaHis: 1.097 ± 1.142
4.663AlaIle: 4.663 ± 0.857
1.92AlaLys: 1.92 ± 1.184
5.76AlaLeu: 5.76 ± 0.933
1.097AlaMet: 1.097 ± 0.464
1.646AlaAsn: 1.646 ± 1.022
1.92AlaPro: 1.92 ± 1.331
0.823AlaGln: 0.823 ± 0.836
4.114AlaArg: 4.114 ± 0.72
4.388AlaSer: 4.388 ± 1.378
1.646AlaThr: 1.646 ± 0.451
3.017AlaVal: 3.017 ± 0.746
0.549AlaTrp: 0.549 ± 0.192
1.097AlaTyr: 1.097 ± 1.142
0.0AlaXaa: 0.0 ± 0.0
Cys
0.549CysAla: 0.549 ± 0.659
0.274CysCys: 0.274 ± 0.283
1.097CysAsp: 1.097 ± 0.383
1.371CysGlu: 1.371 ± 0.635
2.468CysPhe: 2.468 ± 1.739
1.646CysGly: 1.646 ± 0.397
0.274CysHis: 0.274 ± 0.283
0.823CysIle: 0.823 ± 0.507
2.743CysLys: 2.743 ± 1.768
3.017CysLeu: 3.017 ± 1.236
0.549CysMet: 0.549 ± 0.192
0.823CysAsn: 0.823 ± 0.452
1.371CysPro: 1.371 ± 1.01
1.097CysGln: 1.097 ± 0.636
1.92CysArg: 1.92 ± 0.346
3.566CysSer: 3.566 ± 2.601
1.097CysThr: 1.097 ± 0.636
1.097CysVal: 1.097 ± 0.349
0.549CysTrp: 0.549 ± 0.479
1.097CysTyr: 1.097 ± 1.224
0.0CysXaa: 0.0 ± 0.0
Asp
3.566AspAla: 3.566 ± 1.025
2.743AspCys: 2.743 ± 1.768
3.017AspAsp: 3.017 ± 1.489
3.566AspGlu: 3.566 ± 1.071
3.291AspPhe: 3.291 ± 0.519
1.371AspGly: 1.371 ± 0.383
0.549AspHis: 0.549 ± 0.192
1.371AspIle: 1.371 ± 0.685
2.194AspLys: 2.194 ± 0.68
6.034AspLeu: 6.034 ± 1.597
0.823AspMet: 0.823 ± 0.225
2.194AspAsn: 2.194 ± 0.482
3.017AspPro: 3.017 ± 0.746
3.017AspGln: 3.017 ± 0.933
1.92AspArg: 1.92 ± 0.561
3.84AspSer: 3.84 ± 0.795
3.291AspThr: 3.291 ± 0.496
4.388AspVal: 4.388 ± 1.137
1.646AspTrp: 1.646 ± 0.613
1.097AspTyr: 1.097 ± 0.396
0.0AspXaa: 0.0 ± 0.0
Glu
3.017GluAla: 3.017 ± 0.506
1.646GluCys: 1.646 ± 1.695
4.388GluAsp: 4.388 ± 0.787
4.663GluGlu: 4.663 ± 1.471
3.84GluPhe: 3.84 ± 1.219
2.468GluGly: 2.468 ± 0.484
0.549GluHis: 0.549 ± 0.338
3.291GluIle: 3.291 ± 1.597
3.566GluLys: 3.566 ± 1.045
4.663GluLeu: 4.663 ± 1.302
1.92GluMet: 1.92 ± 0.87
1.646GluAsn: 1.646 ± 0.451
0.823GluPro: 0.823 ± 0.225
1.371GluGln: 1.371 ± 0.933
6.308GluArg: 6.308 ± 0.808
5.211GluSer: 5.211 ± 0.453
5.485GluThr: 5.485 ± 1.566
3.566GluVal: 3.566 ± 0.238
1.097GluTrp: 1.097 ± 0.464
2.194GluTyr: 2.194 ± 1.351
0.0GluXaa: 0.0 ± 0.0
Phe
1.646PheAla: 1.646 ± 0.66
1.097PheCys: 1.097 ± 0.729
2.194PheAsp: 2.194 ± 0.68
2.468PheGlu: 2.468 ± 0.464
3.291PhePhe: 3.291 ± 0.901
2.743PheGly: 2.743 ± 0.733
1.646PheHis: 1.646 ± 0.66
1.646PheIle: 1.646 ± 0.66
4.663PheLys: 4.663 ± 1.025
8.228PheLeu: 8.228 ± 1.076
1.371PheMet: 1.371 ± 0.558
1.646PheAsn: 1.646 ± 0.451
1.92PhePro: 1.92 ± 0.823
1.371PheGln: 1.371 ± 0.845
2.743PheArg: 2.743 ± 0.924
5.211PheSer: 5.211 ± 1.331
3.017PheThr: 3.017 ± 0.909
1.92PheVal: 1.92 ± 0.374
0.274PheTrp: 0.274 ± 0.169
1.92PheTyr: 1.92 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
2.194GlyAla: 2.194 ± 2.585
1.097GlyCys: 1.097 ± 0.383
2.194GlyAsp: 2.194 ± 0.971
1.646GlyGlu: 1.646 ± 0.905
4.114GlyPhe: 4.114 ± 1.021
4.114GlyGly: 4.114 ± 1.024
0.823GlyHis: 0.823 ± 0.225
4.388GlyIle: 4.388 ± 0.75
3.566GlyLys: 3.566 ± 2.134
3.291GlyLeu: 3.291 ± 0.855
3.017GlyMet: 3.017 ± 1.457
2.468GlyAsn: 2.468 ± 0.484
2.194GlyPro: 2.194 ± 0.598
2.743GlyGln: 2.743 ± 0.395
2.743GlyArg: 2.743 ± 0.914
4.388GlySer: 4.388 ± 1.977
1.92GlyThr: 1.92 ± 0.823
4.937GlyVal: 4.937 ± 0.906
0.823GlyTrp: 0.823 ± 0.783
1.646GlyTyr: 1.646 ± 0.422
0.0GlyXaa: 0.0 ± 0.0
His
0.274HisAla: 0.274 ± 0.169
1.097HisCys: 1.097 ± 0.383
0.549HisAsp: 0.549 ± 0.479
1.097HisGlu: 1.097 ± 0.383
2.194HisPhe: 2.194 ± 0.767
1.371HisGly: 1.371 ± 0.685
2.468HisHis: 2.468 ± 0.676
1.371HisIle: 1.371 ± 0.5
1.646HisLys: 1.646 ± 0.795
1.646HisLeu: 1.646 ± 1.167
0.549HisMet: 0.549 ± 0.904
1.097HisAsn: 1.097 ± 0.383
1.097HisPro: 1.097 ± 0.349
1.371HisGln: 1.371 ± 0.5
1.92HisArg: 1.92 ± 1.183
3.291HisSer: 3.291 ± 1.047
1.097HisThr: 1.097 ± 0.464
1.646HisVal: 1.646 ± 0.66
0.823HisTrp: 0.823 ± 0.507
0.549HisTyr: 0.549 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
2.194IleAla: 2.194 ± 0.988
2.194IleCys: 2.194 ± 0.598
1.371IleAsp: 1.371 ± 0.5
1.646IleGlu: 1.646 ± 0.66
2.743IlePhe: 2.743 ± 0.959
3.84IleGly: 3.84 ± 1.041
1.92IleHis: 1.92 ± 0.821
4.388IleIle: 4.388 ± 1.012
3.291IleLys: 3.291 ± 0.776
8.502IleLeu: 8.502 ± 0.818
2.194IleMet: 2.194 ± 0.819
1.646IleAsn: 1.646 ± 1.088
3.017IlePro: 3.017 ± 0.819
4.388IleGln: 4.388 ± 1.774
2.743IleArg: 2.743 ± 0.859
6.308IleSer: 6.308 ± 1.509
3.291IleThr: 3.291 ± 0.181
3.017IleVal: 3.017 ± 1.467
0.549IleTrp: 0.549 ± 0.479
1.371IleTyr: 1.371 ± 0.933
0.0IleXaa: 0.0 ± 0.0
Lys
4.937LysAla: 4.937 ± 0.715
1.097LysCys: 1.097 ± 0.383
2.743LysAsp: 2.743 ± 1.786
4.388LysGlu: 4.388 ± 0.948
1.097LysPhe: 1.097 ± 0.396
2.743LysGly: 2.743 ± 0.569
1.371LysHis: 1.371 ± 0.5
5.211LysIle: 5.211 ± 2.663
4.937LysLys: 4.937 ± 0.395
5.76LysLeu: 5.76 ± 2.242
3.84LysMet: 3.84 ± 1.319
1.371LysAsn: 1.371 ± 0.365
1.097LysPro: 1.097 ± 0.349
2.468LysGln: 2.468 ± 0.747
2.468LysArg: 2.468 ± 0.851
4.663LysSer: 4.663 ± 0.851
4.663LysThr: 4.663 ± 0.494
3.291LysVal: 3.291 ± 1.092
1.097LysTrp: 1.097 ± 0.383
1.097LysTyr: 1.097 ± 0.349
0.0LysXaa: 0.0 ± 0.0
Leu
6.308LeuAla: 6.308 ± 1.601
1.097LeuCys: 1.097 ± 0.518
5.211LeuAsp: 5.211 ± 1.077
8.502LeuGlu: 8.502 ± 1.653
4.114LeuPhe: 4.114 ± 1.511
7.405LeuGly: 7.405 ± 0.547
3.017LeuHis: 3.017 ± 0.746
6.034LeuIle: 6.034 ± 1.306
6.583LeuLys: 6.583 ± 0.771
9.6LeuLeu: 9.6 ± 1.226
1.646LeuMet: 1.646 ± 0.732
3.017LeuAsn: 3.017 ± 0.476
3.566LeuPro: 3.566 ± 1.106
5.211LeuGln: 5.211 ± 1.304
6.857LeuArg: 6.857 ± 1.656
6.583LeuSer: 6.583 ± 1.166
7.954LeuThr: 7.954 ± 1.736
6.857LeuVal: 6.857 ± 1.88
0.274LeuTrp: 0.274 ± 0.169
1.92LeuTyr: 1.92 ± 1.184
0.0LeuXaa: 0.0 ± 0.0
Met
0.823MetAla: 0.823 ± 0.563
0.274MetCys: 0.274 ± 0.283
2.743MetAsp: 2.743 ± 1.37
1.097MetGlu: 1.097 ± 0.349
1.097MetPhe: 1.097 ± 0.676
0.549MetGly: 0.549 ± 0.338
0.549MetHis: 0.549 ± 0.571
1.646MetIle: 1.646 ± 0.613
1.371MetLys: 1.371 ± 1.093
3.291MetLeu: 3.291 ± 1.274
1.92MetMet: 1.92 ± 0.374
1.371MetAsn: 1.371 ± 0.741
0.549MetPro: 0.549 ± 0.338
1.371MetGln: 1.371 ± 0.383
2.468MetArg: 2.468 ± 0.464
3.291MetSer: 3.291 ± 0.8
1.646MetThr: 1.646 ± 0.409
1.097MetVal: 1.097 ± 0.383
0.549MetTrp: 0.549 ± 0.192
0.549MetTyr: 0.549 ± 0.338
0.0MetXaa: 0.0 ± 0.0
Asn
1.92AsnAla: 1.92 ± 0.508
1.097AsnCys: 1.097 ± 1.13
1.097AsnAsp: 1.097 ± 1.13
2.194AsnGlu: 2.194 ± 0.988
2.194AsnPhe: 2.194 ± 1.351
2.194AsnGly: 2.194 ± 0.557
0.549AsnHis: 0.549 ± 0.338
0.823AsnIle: 0.823 ± 0.225
1.646AsnLys: 1.646 ± 1.088
3.566AsnLeu: 3.566 ± 1.005
0.823AsnMet: 0.823 ± 1.186
0.549AsnAsn: 0.549 ± 0.571
3.566AsnPro: 3.566 ± 1.193
1.371AsnGln: 1.371 ± 0.5
1.646AsnArg: 1.646 ± 0.422
3.017AsnSer: 3.017 ± 1.158
1.92AsnThr: 1.92 ± 0.961
1.371AsnVal: 1.371 ± 1.903
0.823AsnTrp: 0.823 ± 0.452
0.274AsnTyr: 0.274 ± 0.283
0.0AsnXaa: 0.0 ± 0.0
Pro
2.194ProAla: 2.194 ± 0.478
0.549ProCys: 0.549 ± 0.612
1.097ProAsp: 1.097 ± 0.349
3.017ProGlu: 3.017 ± 0.756
3.84ProPhe: 3.84 ± 1.206
3.291ProGly: 3.291 ± 0.588
1.371ProHis: 1.371 ± 0.5
1.646ProIle: 1.646 ± 0.575
2.194ProLys: 2.194 ± 0.373
2.194ProLeu: 2.194 ± 0.557
0.823ProMet: 0.823 ± 0.452
1.371ProAsn: 1.371 ± 1.01
3.017ProPro: 3.017 ± 0.746
1.371ProGln: 1.371 ± 0.635
0.823ProArg: 0.823 ± 0.225
4.663ProSer: 4.663 ± 0.892
2.743ProThr: 2.743 ± 1.005
3.291ProVal: 3.291 ± 1.486
0.549ProTrp: 0.549 ± 0.192
0.549ProTyr: 0.549 ± 0.338
0.0ProXaa: 0.0 ± 0.0
Gln
2.468GlnAla: 2.468 ± 0.859
1.371GlnCys: 1.371 ± 1.253
2.194GlnAsp: 2.194 ± 0.598
4.114GlnGlu: 4.114 ± 1.105
2.468GlnPhe: 2.468 ± 0.55
2.194GlnGly: 2.194 ± 0.867
1.371GlnHis: 1.371 ± 0.5
2.468GlnIle: 2.468 ± 0.424
2.194GlnLys: 2.194 ± 0.463
2.194GlnLeu: 2.194 ± 0.478
1.371GlnMet: 1.371 ± 0.503
2.468GlnAsn: 2.468 ± 0.846
1.097GlnPro: 1.097 ± 0.636
0.823GlnGln: 0.823 ± 0.836
1.371GlnArg: 1.371 ± 0.523
3.017GlnSer: 3.017 ± 0.564
2.194GlnThr: 2.194 ± 0.988
1.646GlnVal: 1.646 ± 0.713
0.274GlnTrp: 0.274 ± 0.283
0.823GlnTyr: 0.823 ± 0.507
0.0GlnXaa: 0.0 ± 0.0
Arg
2.194ArgAla: 2.194 ± 1.59
2.194ArgCys: 2.194 ± 1.396
3.566ArgAsp: 3.566 ± 1.15
4.114ArgGlu: 4.114 ± 0.117
2.468ArgPhe: 2.468 ± 1.155
2.194ArgGly: 2.194 ± 0.398
1.92ArgHis: 1.92 ± 0.562
4.388ArgIle: 4.388 ± 1.459
3.84ArgLys: 3.84 ± 1.123
6.583ArgLeu: 6.583 ± 1.119
1.371ArgMet: 1.371 ± 0.503
1.371ArgAsn: 1.371 ± 0.523
3.017ArgPro: 3.017 ± 1.307
1.646ArgGln: 1.646 ± 0.409
3.291ArgArg: 3.291 ± 0.8
4.388ArgSer: 4.388 ± 1.95
2.743ArgThr: 2.743 ± 0.769
3.84ArgVal: 3.84 ± 1.643
1.371ArgTrp: 1.371 ± 0.612
1.646ArgTyr: 1.646 ± 0.66
0.0ArgXaa: 0.0 ± 0.0
Ser
3.291SerAla: 3.291 ± 0.395
1.646SerCys: 1.646 ± 1.695
7.954SerAsp: 7.954 ± 1.682
4.114SerGlu: 4.114 ± 0.727
3.291SerPhe: 3.291 ± 0.791
4.388SerGly: 4.388 ± 0.469
2.194SerHis: 2.194 ± 0.988
4.937SerIle: 4.937 ± 1.471
6.308SerLys: 6.308 ± 2.21
10.971SerLeu: 10.971 ± 1.848
1.097SerMet: 1.097 ± 0.349
2.194SerAsn: 2.194 ± 0.988
4.114SerPro: 4.114 ± 0.845
3.566SerGln: 3.566 ± 0.958
5.485SerArg: 5.485 ± 0.996
10.148SerSer: 10.148 ± 1.647
6.308SerThr: 6.308 ± 1.056
5.485SerVal: 5.485 ± 1.085
2.468SerTrp: 2.468 ± 0.859
0.823SerTyr: 0.823 ± 0.487
0.0SerXaa: 0.0 ± 0.0
Thr
2.743ThrAla: 2.743 ± 0.875
2.743ThrCys: 2.743 ± 0.733
2.468ThrAsp: 2.468 ± 1.01
5.485ThrGlu: 5.485 ± 1.08
2.468ThrPhe: 2.468 ± 0.676
4.937ThrGly: 4.937 ± 1.025
1.097ThrHis: 1.097 ± 0.396
5.485ThrIle: 5.485 ± 0.431
3.291ThrLys: 3.291 ± 0.791
6.034ThrLeu: 6.034 ± 1.371
0.549ThrMet: 0.549 ± 0.338
2.468ThrAsn: 2.468 ± 1.155
1.371ThrPro: 1.371 ± 0.503
0.823ThrGln: 0.823 ± 0.225
4.114ThrArg: 4.114 ± 1.024
5.485ThrSer: 5.485 ± 1.02
3.566ThrThr: 3.566 ± 0.721
3.566ThrVal: 3.566 ± 1.393
1.097ThrTrp: 1.097 ± 0.396
1.92ThrTyr: 1.92 ± 0.563
0.0ThrXaa: 0.0 ± 0.0
Val
3.566ValAla: 3.566 ± 1.475
2.194ValCys: 2.194 ± 1.224
3.84ValAsp: 3.84 ± 1.67
4.663ValGlu: 4.663 ± 1.218
2.743ValPhe: 2.743 ± 1.0
1.92ValGly: 1.92 ± 0.821
2.743ValHis: 2.743 ± 0.367
3.291ValIle: 3.291 ± 1.655
2.743ValLys: 2.743 ± 0.591
5.76ValLeu: 5.76 ± 0.566
1.097ValMet: 1.097 ± 0.349
1.92ValAsn: 1.92 ± 0.821
1.371ValPro: 1.371 ± 0.635
2.194ValGln: 2.194 ± 0.698
3.017ValArg: 3.017 ± 1.144
5.211ValSer: 5.211 ± 1.346
4.937ValThr: 4.937 ± 1.373
3.566ValVal: 3.566 ± 0.736
0.823ValTrp: 0.823 ± 0.452
3.291ValTyr: 3.291 ± 0.884
0.0ValXaa: 0.0 ± 0.0
Trp
0.549TrpAla: 0.549 ± 0.192
0.823TrpCys: 0.823 ± 0.507
0.823TrpAsp: 0.823 ± 1.01
0.274TrpGlu: 0.274 ± 0.169
0.549TrpPhe: 0.549 ± 0.192
1.097TrpGly: 1.097 ± 0.636
0.0TrpHis: 0.0 ± 0.0
1.097TrpIle: 1.097 ± 0.729
1.097TrpLys: 1.097 ± 0.604
1.92TrpLeu: 1.92 ± 0.821
1.371TrpMet: 1.371 ± 0.383
0.549TrpAsn: 0.549 ± 0.571
0.549TrpPro: 0.549 ± 1.086
0.274TrpGln: 0.274 ± 0.169
0.823TrpArg: 0.823 ± 0.487
1.646TrpSer: 1.646 ± 0.451
1.097TrpThr: 1.097 ± 0.349
1.371TrpVal: 1.371 ± 0.635
0.0TrpTrp: 0.0 ± 0.0
1.097TrpTyr: 1.097 ± 0.383
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.371TyrAla: 1.371 ± 0.523
0.274TyrCys: 0.274 ± 0.283
1.371TyrAsp: 1.371 ± 1.196
1.371TyrGlu: 1.371 ± 0.845
0.823TyrPhe: 0.823 ± 0.225
0.549TyrGly: 0.549 ± 0.338
1.92TyrHis: 1.92 ± 0.974
1.646TyrIle: 1.646 ± 0.451
0.549TyrLys: 0.549 ± 0.479
2.743TyrLeu: 2.743 ± 0.924
0.549TyrMet: 0.549 ± 0.571
0.823TyrAsn: 0.823 ± 0.225
1.92TyrPro: 1.92 ± 0.56
0.823TyrGln: 0.823 ± 0.544
1.371TyrArg: 1.371 ± 0.845
2.468TyrSer: 2.468 ± 0.424
1.097TyrThr: 1.097 ± 0.383
1.92TyrVal: 1.92 ± 0.508
1.371TyrTrp: 1.371 ± 0.5
0.549TyrTyr: 0.549 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3647 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski