Amino acid dipepetide frequency for Kadiweu virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.134AlaAla: 2.134 ± 0.843
1.13AlaCys: 1.13 ± 0.345
1.381AlaAsp: 1.381 ± 0.496
3.64AlaGlu: 3.64 ± 0.958
3.389AlaPhe: 3.389 ± 1.392
1.506AlaGly: 1.506 ± 1.117
0.628AlaHis: 0.628 ± 0.171
4.017AlaIle: 4.017 ± 0.499
3.264AlaLys: 3.264 ± 0.518
5.649AlaLeu: 5.649 ± 1.446
0.377AlaMet: 0.377 ± 1.221
2.636AlaAsn: 2.636 ± 0.881
2.511AlaPro: 2.511 ± 2.804
2.009AlaGln: 2.009 ± 0.874
2.009AlaArg: 2.009 ± 0.373
2.762AlaSer: 2.762 ± 2.736
4.017AlaThr: 4.017 ± 0.444
1.883AlaVal: 1.883 ± 1.029
0.0AlaTrp: 0.0 ± 0.0
2.385AlaTyr: 2.385 ± 0.788
0.0AlaXaa: 0.0 ± 0.0
Cys
0.502CysAla: 0.502 ± 0.148
0.126CysCys: 0.126 ± 0.076
0.628CysAsp: 0.628 ± 0.171
2.26CysGlu: 2.26 ± 0.479
0.879CysPhe: 0.879 ± 0.238
1.506CysGly: 1.506 ± 0.427
0.628CysHis: 0.628 ± 0.215
2.762CysIle: 2.762 ± 0.769
2.385CysLys: 2.385 ± 0.364
1.255CysLeu: 1.255 ± 0.331
0.251CysMet: 0.251 ± 0.091
1.883CysAsn: 1.883 ± 0.645
0.753CysPro: 0.753 ± 0.63
0.126CysGln: 0.126 ± 0.076
0.251CysArg: 0.251 ± 0.153
0.753CysSer: 0.753 ± 0.625
1.757CysThr: 1.757 ± 0.466
1.13CysVal: 1.13 ± 0.345
0.0CysTrp: 0.0 ± 0.0
2.134CysTyr: 2.134 ± 0.419
0.0CysXaa: 0.0 ± 0.0
Asp
2.134AspAla: 2.134 ± 1.538
1.255AspCys: 1.255 ± 0.331
1.757AspAsp: 1.757 ± 0.452
2.26AspGlu: 2.26 ± 0.385
2.762AspPhe: 2.762 ± 0.87
0.879AspGly: 0.879 ± 0.36
2.009AspHis: 2.009 ± 0.358
3.64AspIle: 3.64 ± 1.1
2.385AspLys: 2.385 ± 0.822
4.896AspLeu: 4.896 ± 0.235
1.757AspMet: 1.757 ± 0.476
3.389AspAsn: 3.389 ± 0.881
2.887AspPro: 2.887 ± 0.43
2.009AspGln: 2.009 ± 1.633
1.13AspArg: 1.13 ± 0.561
4.143AspSer: 4.143 ± 0.639
4.268AspThr: 4.268 ± 0.522
2.762AspVal: 2.762 ± 0.418
0.502AspTrp: 0.502 ± 0.182
3.892AspTyr: 3.892 ± 0.46
0.0AspXaa: 0.0 ± 0.0
Glu
2.636GluAla: 2.636 ± 0.46
1.632GluCys: 1.632 ± 0.525
1.757GluAsp: 1.757 ± 0.427
3.138GluGlu: 3.138 ± 0.808
3.389GluPhe: 3.389 ± 0.467
1.13GluGly: 1.13 ± 0.345
2.009GluHis: 2.009 ± 0.522
3.64GluIle: 3.64 ± 0.948
3.64GluLys: 3.64 ± 1.385
6.653GluLeu: 6.653 ± 1.714
0.377GluMet: 0.377 ± 0.229
4.143GluAsn: 4.143 ± 0.741
1.757GluPro: 1.757 ± 0.514
2.887GluGln: 2.887 ± 0.757
2.134GluArg: 2.134 ± 0.547
3.013GluSer: 3.013 ± 0.611
3.766GluThr: 3.766 ± 0.994
1.255GluVal: 1.255 ± 0.331
0.377GluTrp: 0.377 ± 0.096
2.009GluTyr: 2.009 ± 0.592
0.0GluXaa: 0.0 ± 0.0
Phe
1.632PheAla: 1.632 ± 0.427
0.879PheCys: 0.879 ± 0.257
2.887PheAsp: 2.887 ± 0.817
2.385PheGlu: 2.385 ± 0.911
1.506PhePhe: 1.506 ± 0.427
3.013PheGly: 3.013 ± 0.385
0.502PheHis: 0.502 ± 0.148
2.26PheIle: 2.26 ± 0.578
2.385PheLys: 2.385 ± 0.656
3.766PheLeu: 3.766 ± 0.388
1.883PheMet: 1.883 ± 0.514
2.762PheAsn: 2.762 ± 0.713
0.753PhePro: 0.753 ± 0.193
2.009PheGln: 2.009 ± 0.525
1.506PheArg: 1.506 ± 0.482
2.009PheSer: 2.009 ± 0.433
4.519PheThr: 4.519 ± 0.745
3.138PheVal: 3.138 ± 0.35
0.0PheTrp: 0.0 ± 0.0
2.009PheTyr: 2.009 ± 0.464
0.0PheXaa: 0.0 ± 0.0
Gly
1.757GlyAla: 1.757 ± 0.452
0.251GlyCys: 0.251 ± 0.153
1.381GlyAsp: 1.381 ± 0.384
1.255GlyGlu: 1.255 ± 0.343
1.381GlyPhe: 1.381 ± 0.357
1.004GlyGly: 1.004 ± 0.61
1.883GlyHis: 1.883 ± 0.514
2.009GlyIle: 2.009 ± 0.522
3.264GlyLys: 3.264 ± 0.768
3.138GlyLeu: 3.138 ± 0.809
1.004GlyMet: 1.004 ± 0.263
1.255GlyAsn: 1.255 ± 0.43
1.13GlyPro: 1.13 ± 0.474
0.502GlyGln: 0.502 ± 0.148
0.753GlyArg: 0.753 ± 1.192
1.381GlySer: 1.381 ± 1.126
2.762GlyThr: 2.762 ± 0.769
1.381GlyVal: 1.381 ± 1.712
0.377GlyTrp: 0.377 ± 0.643
2.762GlyTyr: 2.762 ± 1.052
0.0GlyXaa: 0.0 ± 0.0
His
1.757HisAla: 1.757 ± 0.476
1.13HisCys: 1.13 ± 0.55
1.004HisAsp: 1.004 ± 0.435
1.381HisGlu: 1.381 ± 0.384
1.13HisPhe: 1.13 ± 0.361
0.753HisGly: 0.753 ± 0.193
0.628HisHis: 0.628 ± 0.171
2.762HisIle: 2.762 ± 0.713
2.26HisLys: 2.26 ± 0.722
3.515HisLeu: 3.515 ± 0.903
1.255HisMet: 1.255 ± 0.433
2.385HisAsn: 2.385 ± 0.617
2.636HisPro: 2.636 ± 0.695
1.506HisGln: 1.506 ± 0.482
0.879HisArg: 0.879 ± 0.36
1.004HisSer: 1.004 ± 0.296
3.892HisThr: 3.892 ± 0.408
2.134HisVal: 2.134 ± 0.547
0.251HisTrp: 0.251 ± 0.153
2.762HisTyr: 2.762 ± 1.47
0.0HisXaa: 0.0 ± 0.0
Ile
3.64IleAla: 3.64 ± 0.49
1.13IleCys: 1.13 ± 0.289
4.519IleAsp: 4.519 ± 1.074
4.017IleGlu: 4.017 ± 1.097
1.381IlePhe: 1.381 ± 0.546
2.26IleGly: 2.26 ± 0.301
2.385IleHis: 2.385 ± 0.328
4.645IleIle: 4.645 ± 1.194
5.398IleLys: 5.398 ± 1.382
6.402IleLeu: 6.402 ± 0.825
2.385IleMet: 2.385 ± 0.684
7.155IleAsn: 7.155 ± 1.833
4.268IlePro: 4.268 ± 0.815
3.515IleGln: 3.515 ± 0.952
4.017IleArg: 4.017 ± 1.027
3.766IleSer: 3.766 ± 0.544
7.03IleThr: 7.03 ± 1.259
3.766IleVal: 3.766 ± 0.964
1.004IleTrp: 1.004 ± 0.263
5.398IleTyr: 5.398 ± 1.407
0.0IleXaa: 0.0 ± 0.0
Lys
4.519LysAla: 4.519 ± 1.738
2.134LysCys: 2.134 ± 0.534
4.77LysAsp: 4.77 ± 0.68
3.138LysGlu: 3.138 ± 0.285
2.385LysPhe: 2.385 ± 0.339
1.13LysGly: 1.13 ± 0.289
3.138LysHis: 3.138 ± 0.809
8.662LysIle: 8.662 ± 0.49
2.762LysLys: 2.762 ± 0.331
5.147LysLeu: 5.147 ± 0.852
1.13LysMet: 1.13 ± 0.49
4.143LysAsn: 4.143 ± 0.612
4.77LysPro: 4.77 ± 2.531
1.255LysGln: 1.255 ± 1.156
2.762LysArg: 2.762 ± 0.257
4.017LysSer: 4.017 ± 0.865
4.017LysThr: 4.017 ± 0.512
2.385LysVal: 2.385 ± 0.679
0.377LysTrp: 0.377 ± 0.096
3.892LysTyr: 3.892 ± 0.495
0.0LysXaa: 0.0 ± 0.0
Leu
5.398LeuAla: 5.398 ± 0.936
2.385LeuCys: 2.385 ± 0.402
6.528LeuAsp: 6.528 ± 1.028
4.77LeuGlu: 4.77 ± 0.606
4.268LeuPhe: 4.268 ± 0.601
1.757LeuGly: 1.757 ± 0.476
2.385LeuHis: 2.385 ± 0.523
6.528LeuIle: 6.528 ± 1.051
6.653LeuLys: 6.653 ± 1.714
7.783LeuLeu: 7.783 ± 1.901
1.883LeuMet: 1.883 ± 0.514
5.775LeuAsn: 5.775 ± 1.515
4.143LeuPro: 4.143 ± 0.881
4.645LeuGln: 4.645 ± 1.232
3.766LeuArg: 3.766 ± 0.688
6.402LeuSer: 6.402 ± 0.713
7.281LeuThr: 7.281 ± 1.868
5.775LeuVal: 5.775 ± 1.402
0.879LeuTrp: 0.879 ± 0.238
5.021LeuTyr: 5.021 ± 1.481
0.0LeuXaa: 0.0 ± 0.0
Met
1.632MetAla: 1.632 ± 0.427
0.502MetCys: 0.502 ± 0.148
1.632MetAsp: 1.632 ± 0.427
1.381MetGlu: 1.381 ± 0.435
1.381MetPhe: 1.381 ± 0.357
0.628MetGly: 0.628 ± 1.227
0.502MetHis: 0.502 ± 0.182
2.636MetIle: 2.636 ± 0.771
0.879MetLys: 0.879 ± 0.613
2.134MetLeu: 2.134 ± 0.656
0.0MetMet: 0.0 ± 0.0
1.506MetAsn: 1.506 ± 0.427
1.004MetPro: 1.004 ± 0.263
1.13MetGln: 1.13 ± 0.289
1.13MetArg: 1.13 ± 0.361
1.255MetSer: 1.255 ± 0.474
1.255MetThr: 1.255 ± 0.343
1.757MetVal: 1.757 ± 0.638
0.0MetTrp: 0.0 ± 0.0
1.13MetTyr: 1.13 ± 0.55
0.0MetXaa: 0.0 ± 0.0
Asn
2.636AsnAla: 2.636 ± 4.173
1.004AsnCys: 1.004 ± 0.296
1.632AsnAsp: 1.632 ± 0.433
2.636AsnGlu: 2.636 ± 0.266
3.389AsnPhe: 3.389 ± 0.9
3.64AsnGly: 3.64 ± 0.806
2.762AsnHis: 2.762 ± 0.257
4.645AsnIle: 4.645 ± 1.396
4.896AsnLys: 4.896 ± 1.419
8.034AsnLeu: 8.034 ± 1.504
2.009AsnMet: 2.009 ± 0.304
6.277AsnAsn: 6.277 ± 1.561
5.147AsnPro: 5.147 ± 1.332
4.394AsnGln: 4.394 ± 3.38
2.511AsnArg: 2.511 ± 0.685
2.134AsnSer: 2.134 ± 0.544
5.398AsnThr: 5.398 ± 0.584
3.389AsnVal: 3.389 ± 0.975
0.502AsnTrp: 0.502 ± 0.148
4.645AsnTyr: 4.645 ± 0.557
0.0AsnXaa: 0.0 ± 0.0
Pro
2.009ProAla: 2.009 ± 0.937
0.879ProCys: 0.879 ± 0.257
1.757ProAsp: 1.757 ± 0.476
1.883ProGlu: 1.883 ± 0.645
1.255ProPhe: 1.255 ± 0.331
1.506ProGly: 1.506 ± 0.444
2.385ProHis: 2.385 ± 1.072
4.143ProIle: 4.143 ± 0.72
2.887ProLys: 2.887 ± 2.945
5.021ProLeu: 5.021 ± 1.495
1.004ProMet: 1.004 ± 0.263
3.138ProAsn: 3.138 ± 0.35
2.009ProPro: 2.009 ± 0.592
3.013ProGln: 3.013 ± 1.302
1.381ProArg: 1.381 ± 1.035
2.762ProSer: 2.762 ± 0.713
6.402ProThr: 6.402 ± 0.738
2.762ProVal: 2.762 ± 0.713
0.251ProTrp: 0.251 ± 0.091
1.255ProTyr: 1.255 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
2.636GlnAla: 2.636 ± 0.74
0.377GlnCys: 0.377 ± 0.096
2.887GlnAsp: 2.887 ± 1.505
2.134GlnGlu: 2.134 ± 0.547
1.757GlnPhe: 1.757 ± 1.041
0.628GlnGly: 0.628 ± 0.382
2.887GlnHis: 2.887 ± 0.355
3.138GlnIle: 3.138 ± 0.86
3.389GlnLys: 3.389 ± 2.176
3.892GlnLeu: 3.892 ± 1.231
0.251GlnMet: 0.251 ± 0.153
5.147GlnAsn: 5.147 ± 3.666
1.381GlnPro: 1.381 ± 1.134
2.762GlnGln: 2.762 ± 2.799
2.385GlnArg: 2.385 ± 1.631
2.26GlnSer: 2.26 ± 0.808
1.883GlnThr: 1.883 ± 0.482
1.004GlnVal: 1.004 ± 0.494
0.0GlnTrp: 0.0 ± 0.0
3.264GlnTyr: 3.264 ± 0.835
0.0GlnXaa: 0.0 ± 0.0
Arg
1.13ArgAla: 1.13 ± 0.478
1.255ArgCys: 1.255 ± 0.343
1.381ArgAsp: 1.381 ± 0.526
2.636ArgGlu: 2.636 ± 0.266
0.879ArgPhe: 0.879 ± 0.238
0.628ArgGly: 0.628 ± 0.171
1.883ArgHis: 1.883 ± 0.532
3.515ArgIle: 3.515 ± 0.905
3.138ArgLys: 3.138 ± 0.759
3.64ArgLeu: 3.64 ± 0.948
0.377ArgMet: 0.377 ± 0.45
3.138ArgAsn: 3.138 ± 1.449
1.004ArgPro: 1.004 ± 0.666
2.009ArgGln: 2.009 ± 0.992
2.009ArgArg: 2.009 ± 0.304
2.26ArgSer: 2.26 ± 0.834
3.013ArgThr: 3.013 ± 0.281
1.506ArgVal: 1.506 ± 0.405
0.126ArgTrp: 0.126 ± 0.076
3.892ArgTyr: 3.892 ± 0.51
0.0ArgXaa: 0.0 ± 0.0
Ser
3.013SerAla: 3.013 ± 0.617
1.381SerCys: 1.381 ± 0.496
2.887SerAsp: 2.887 ± 0.539
3.64SerGlu: 3.64 ± 0.454
2.385SerPhe: 2.385 ± 0.617
1.381SerGly: 1.381 ± 0.546
1.506SerHis: 1.506 ± 0.573
3.264SerIle: 3.264 ± 1.652
3.766SerLys: 3.766 ± 0.579
3.389SerLeu: 3.389 ± 0.404
2.134SerMet: 2.134 ± 0.386
3.64SerAsn: 3.64 ± 1.903
1.632SerPro: 1.632 ± 0.646
1.506SerGln: 1.506 ± 2.418
2.385SerArg: 2.385 ± 0.265
2.134SerSer: 2.134 ± 0.386
5.398SerThr: 5.398 ± 1.422
2.887SerVal: 2.887 ± 1.727
0.377SerTrp: 0.377 ± 0.096
4.017SerTyr: 4.017 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
3.515ThrAla: 3.515 ± 0.397
1.13ThrCys: 1.13 ± 0.361
4.017ThrAsp: 4.017 ± 0.451
3.515ThrGlu: 3.515 ± 0.905
2.762ThrPhe: 2.762 ± 0.257
4.017ThrGly: 4.017 ± 0.583
2.134ThrHis: 2.134 ± 0.787
5.775ThrIle: 5.775 ± 1.5
3.766ThrLys: 3.766 ± 1.112
9.164ThrLeu: 9.164 ± 1.202
2.26ThrMet: 2.26 ± 0.691
5.398ThrAsn: 5.398 ± 0.926
5.272ThrPro: 5.272 ± 0.196
4.143ThrGln: 4.143 ± 1.117
3.892ThrArg: 3.892 ± 1.001
5.398ThrSer: 5.398 ± 1.653
8.411ThrThr: 8.411 ± 0.788
5.272ThrVal: 5.272 ± 0.758
0.502ThrTrp: 0.502 ± 0.69
4.143ThrTyr: 4.143 ± 0.599
0.0ThrXaa: 0.0 ± 0.0
Val
2.385ValAla: 2.385 ± 0.617
1.632ValCys: 1.632 ± 0.433
2.636ValAsp: 2.636 ± 0.713
1.883ValGlu: 1.883 ± 0.532
3.013ValPhe: 3.013 ± 0.385
0.879ValGly: 0.879 ± 1.242
2.134ValHis: 2.134 ± 0.706
3.64ValIle: 3.64 ± 0.484
4.77ValLys: 4.77 ± 1.235
4.143ValLeu: 4.143 ± 0.336
2.009ValMet: 2.009 ± 0.602
3.64ValAsn: 3.64 ± 0.958
2.511ValPro: 2.511 ± 0.643
2.762ValGln: 2.762 ± 2.076
1.757ValArg: 1.757 ± 0.375
2.385ValSer: 2.385 ± 0.567
3.766ValThr: 3.766 ± 0.853
2.009ValVal: 2.009 ± 0.4
0.251ValTrp: 0.251 ± 0.091
2.762ValTyr: 2.762 ± 0.769
0.0ValXaa: 0.0 ± 0.0
Trp
0.251TrpAla: 0.251 ± 0.729
0.0TrpCys: 0.0 ± 0.0
0.628TrpAsp: 0.628 ± 0.215
0.0TrpGlu: 0.0 ± 0.0
0.502TrpPhe: 0.502 ± 0.675
0.126TrpGly: 0.126 ± 0.076
0.126TrpHis: 0.126 ± 0.076
0.377TrpIle: 0.377 ± 0.096
0.251TrpLys: 0.251 ± 0.153
0.753TrpLeu: 0.753 ± 0.193
0.377TrpMet: 0.377 ± 0.096
0.126TrpAsn: 0.126 ± 0.076
0.0TrpPro: 0.0 ± 0.0
0.126TrpGln: 0.126 ± 0.076
0.377TrpArg: 0.377 ± 0.643
0.251TrpSer: 0.251 ± 0.091
0.502TrpThr: 0.502 ± 0.182
0.377TrpVal: 0.377 ± 0.096
0.0TrpTrp: 0.0 ± 0.0
0.879TrpTyr: 0.879 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.009TyrAla: 2.009 ± 0.433
1.757TyrCys: 1.757 ± 1.19
4.645TyrAsp: 4.645 ± 0.724
3.389TyrGlu: 3.389 ± 0.9
2.009TyrPhe: 2.009 ± 0.522
2.26TyrGly: 2.26 ± 0.622
2.636TyrHis: 2.636 ± 0.675
6.026TyrIle: 6.026 ± 0.758
4.394TyrLys: 4.394 ± 0.528
5.649TyrLeu: 5.649 ± 0.787
0.628TyrMet: 0.628 ± 0.215
4.143TyrAsn: 4.143 ± 1.07
2.134TyrPro: 2.134 ± 0.569
1.757TyrGln: 1.757 ± 0.452
2.385TyrArg: 2.385 ± 0.788
2.636TyrSer: 2.636 ± 1.03
5.147TyrThr: 5.147 ± 0.455
4.394TyrVal: 4.394 ± 1.199
0.251TyrTrp: 0.251 ± 0.729
3.766TyrTyr: 3.766 ± 0.388
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (7967 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski