Amino acid dipepetide frequency for Kumasi rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.314AlaAla: 2.314 ± 1.474
2.893AlaCys: 2.893 ± 0.999
2.314AlaAsp: 2.314 ± 0.8
1.157AlaGlu: 1.157 ± 0.795
1.157AlaPhe: 1.157 ± 0.688
2.314AlaGly: 2.314 ± 1.31
1.446AlaHis: 1.446 ± 0.542
2.893AlaIle: 2.893 ± 0.875
2.893AlaLys: 2.893 ± 2.407
4.339AlaLeu: 4.339 ± 0.792
0.868AlaMet: 0.868 ± 1.095
2.314AlaAsn: 2.314 ± 1.472
1.736AlaPro: 1.736 ± 2.076
2.025AlaGln: 2.025 ± 1.171
2.603AlaArg: 2.603 ± 0.914
3.182AlaSer: 3.182 ± 0.379
3.182AlaThr: 3.182 ± 0.459
3.182AlaVal: 3.182 ± 0.459
1.157AlaTrp: 1.157 ± 0.819
2.893AlaTyr: 2.893 ± 0.937
0.0AlaXaa: 0.0 ± 0.0
Cys
1.157CysAla: 1.157 ± 0.702
0.0CysCys: 0.0 ± 0.0
0.579CysAsp: 0.579 ± 0.383
1.446CysGlu: 1.446 ± 0.712
0.579CysPhe: 0.579 ± 0.388
1.446CysGly: 1.446 ± 0.747
0.579CysHis: 0.579 ± 0.562
1.157CysIle: 1.157 ± 0.5
1.446CysLys: 1.446 ± 0.712
2.603CysLeu: 2.603 ± 0.931
0.0CysMet: 0.0 ± 0.0
1.736CysAsn: 1.736 ± 0.454
1.157CysPro: 1.157 ± 0.702
0.579CysGln: 0.579 ± 1.187
1.157CysArg: 1.157 ± 0.441
2.025CysSer: 2.025 ± 1.174
2.025CysThr: 2.025 ± 1.152
2.314CysVal: 2.314 ± 0.881
0.579CysTrp: 0.579 ± 0.677
1.157CysTyr: 1.157 ± 0.441
0.0CysXaa: 0.0 ± 0.0
Asp
1.736AspAla: 1.736 ± 0.689
1.157AspCys: 1.157 ± 0.702
1.736AspAsp: 1.736 ± 1.302
3.76AspGlu: 3.76 ± 1.963
1.736AspPhe: 1.736 ± 0.666
2.603AspGly: 2.603 ± 0.727
1.446AspHis: 1.446 ± 1.236
3.182AspIle: 3.182 ± 1.057
4.628AspLys: 4.628 ± 0.656
5.207AspLeu: 5.207 ± 2.569
2.603AspMet: 2.603 ± 0.782
1.446AspAsn: 1.446 ± 0.747
3.182AspPro: 3.182 ± 1.712
2.025AspGln: 2.025 ± 1.07
2.603AspArg: 2.603 ± 0.867
2.603AspSer: 2.603 ± 0.431
3.471AspThr: 3.471 ± 0.802
2.603AspVal: 2.603 ± 1.069
1.736AspTrp: 1.736 ± 0.681
2.893AspTyr: 2.893 ± 1.326
0.0AspXaa: 0.0 ± 0.0
Glu
2.893GluAla: 2.893 ± 0.766
0.868GluCys: 0.868 ± 0.381
3.182GluAsp: 3.182 ± 0.885
5.207GluGlu: 5.207 ± 1.138
1.446GluPhe: 1.446 ± 1.05
4.05GluGly: 4.05 ± 0.956
1.157GluHis: 1.157 ± 0.635
4.918GluIle: 4.918 ± 0.919
2.314GluLys: 2.314 ± 1.122
4.628GluLeu: 4.628 ± 1.232
2.025GluMet: 2.025 ± 1.59
1.736GluAsn: 1.736 ± 0.762
2.314GluPro: 2.314 ± 0.632
1.157GluGln: 1.157 ± 0.795
2.025GluArg: 2.025 ± 1.375
4.628GluSer: 4.628 ± 1.383
3.76GluThr: 3.76 ± 1.134
6.075GluVal: 6.075 ± 1.477
0.868GluTrp: 0.868 ± 0.418
2.025GluTyr: 2.025 ± 0.721
0.0GluXaa: 0.0 ± 0.0
Phe
1.157PheAla: 1.157 ± 0.5
1.446PheCys: 1.446 ± 0.398
2.314PheAsp: 2.314 ± 0.632
1.446PheGlu: 1.446 ± 0.528
0.868PhePhe: 0.868 ± 0.477
1.736PheGly: 1.736 ± 0.454
1.736PheHis: 1.736 ± 1.259
1.157PheIle: 1.157 ± 0.441
4.339PheLys: 4.339 ± 1.458
4.339PheLeu: 4.339 ± 0.877
0.868PheMet: 0.868 ± 0.418
2.025PheAsn: 2.025 ± 0.556
2.314PhePro: 2.314 ± 1.279
1.157PheGln: 1.157 ± 0.441
2.603PheArg: 2.603 ± 0.727
4.918PheSer: 4.918 ± 0.888
1.446PheThr: 1.446 ± 0.662
2.025PheVal: 2.025 ± 0.721
1.157PheTrp: 1.157 ± 1.54
1.736PheTyr: 1.736 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
3.471GlyAla: 3.471 ± 2.078
0.289GlyCys: 0.289 ± 0.159
4.05GlyAsp: 4.05 ± 1.276
2.893GlyGlu: 2.893 ± 0.807
2.603GlyPhe: 2.603 ± 1.001
3.182GlyGly: 3.182 ± 1.397
1.157GlyHis: 1.157 ± 0.441
1.736GlyIle: 1.736 ± 0.717
3.76GlyLys: 3.76 ± 1.265
8.389GlyLeu: 8.389 ± 2.185
1.157GlyMet: 1.157 ± 0.41
2.314GlyAsn: 2.314 ± 0.675
2.893GlyPro: 2.893 ± 1.019
1.446GlyGln: 1.446 ± 1.206
4.05GlyArg: 4.05 ± 0.717
6.075GlySer: 6.075 ± 1.448
6.075GlyThr: 6.075 ± 2.482
4.05GlyVal: 4.05 ± 0.896
1.157GlyTrp: 1.157 ± 0.441
2.314GlyTyr: 2.314 ± 1.271
0.0GlyXaa: 0.0 ± 0.0
His
2.025HisAla: 2.025 ± 0.869
0.0HisCys: 0.0 ± 0.0
1.736HisAsp: 1.736 ± 1.166
1.446HisGlu: 1.446 ± 0.747
1.157HisPhe: 1.157 ± 0.583
0.289HisGly: 0.289 ± 0.159
1.446HisHis: 1.446 ± 0.907
2.603HisIle: 2.603 ± 0.809
3.76HisLys: 3.76 ± 0.614
2.603HisLeu: 2.603 ± 1.175
0.0HisMet: 0.0 ± 0.0
0.868HisAsn: 0.868 ± 0.477
3.182HisPro: 3.182 ± 1.245
0.579HisGln: 0.579 ± 0.383
1.157HisArg: 1.157 ± 0.635
0.868HisSer: 0.868 ± 0.651
0.868HisThr: 0.868 ± 0.461
2.314HisVal: 2.314 ± 0.506
1.736HisTrp: 1.736 ± 0.997
1.736HisTyr: 1.736 ± 0.666
0.0HisXaa: 0.0 ± 0.0
Ile
2.025IleAla: 2.025 ± 1.173
1.157IleCys: 1.157 ± 0.5
3.76IleAsp: 3.76 ± 0.628
3.471IleGlu: 3.471 ± 0.909
1.157IlePhe: 1.157 ± 0.702
4.339IleGly: 4.339 ± 1.316
1.157IleHis: 1.157 ± 0.765
5.496IleIle: 5.496 ± 2.622
4.339IleLys: 4.339 ± 0.836
5.496IleLeu: 5.496 ± 1.405
2.314IleMet: 2.314 ± 1.005
4.05IleAsn: 4.05 ± 0.824
3.182IlePro: 3.182 ± 1.145
1.446IleGln: 1.446 ± 0.791
3.182IleArg: 3.182 ± 0.379
5.496IleSer: 5.496 ± 1.582
2.025IleThr: 2.025 ± 0.628
2.603IleVal: 2.603 ± 1.001
1.446IleTrp: 1.446 ± 1.272
2.603IleTyr: 2.603 ± 1.094
0.0IleXaa: 0.0 ± 0.0
Lys
2.893LysAla: 2.893 ± 1.776
2.025LysCys: 2.025 ± 0.653
2.893LysAsp: 2.893 ± 0.795
4.339LysGlu: 4.339 ± 1.011
2.603LysPhe: 2.603 ± 1.342
4.918LysGly: 4.918 ± 0.997
1.736LysHis: 1.736 ± 0.454
4.05LysIle: 4.05 ± 1.105
4.05LysLys: 4.05 ± 1.341
6.364LysLeu: 6.364 ± 2.208
2.025LysMet: 2.025 ± 3.052
3.76LysAsn: 3.76 ± 1.948
1.736LysPro: 1.736 ± 1.585
0.579LysGln: 0.579 ± 0.84
5.207LysArg: 5.207 ± 1.325
3.182LysSer: 3.182 ± 0.776
4.339LysThr: 4.339 ± 1.256
4.339LysVal: 4.339 ± 1.564
1.446LysTrp: 1.446 ± 0.542
0.868LysTyr: 0.868 ± 0.477
0.0LysXaa: 0.0 ± 0.0
Leu
4.05LeuAla: 4.05 ± 0.817
2.025LeuCys: 2.025 ± 0.803
4.918LeuAsp: 4.918 ± 1.436
5.496LeuGlu: 5.496 ± 1.764
4.628LeuPhe: 4.628 ± 1.28
8.1LeuGly: 8.1 ± 1.379
3.182LeuHis: 3.182 ± 1.204
5.496LeuIle: 5.496 ± 1.681
4.628LeuLys: 4.628 ± 1.244
6.653LeuLeu: 6.653 ± 1.477
3.182LeuMet: 3.182 ± 1.382
5.785LeuAsn: 5.785 ± 2.653
4.339LeuPro: 4.339 ± 1.888
3.182LeuGln: 3.182 ± 0.781
6.942LeuArg: 6.942 ± 1.925
8.1LeuSer: 8.1 ± 2.628
7.232LeuThr: 7.232 ± 1.405
4.628LeuVal: 4.628 ± 1.703
1.446LeuTrp: 1.446 ± 0.736
3.471LeuTyr: 3.471 ± 1.251
0.0LeuXaa: 0.0 ± 0.0
Met
1.736MetAla: 1.736 ± 0.68
0.579MetCys: 0.579 ± 0.318
1.446MetAsp: 1.446 ± 0.766
1.446MetGlu: 1.446 ± 0.528
1.446MetPhe: 1.446 ± 0.662
2.314MetGly: 2.314 ± 1.027
0.579MetHis: 0.579 ± 0.318
2.603MetIle: 2.603 ± 0.41
1.446MetLys: 1.446 ± 1.459
0.868MetLeu: 0.868 ± 0.793
1.157MetMet: 1.157 ± 1.385
0.0MetAsn: 0.0 ± 0.0
0.289MetPro: 0.289 ± 0.159
0.289MetGln: 0.289 ± 0.159
2.025MetArg: 2.025 ± 2.72
2.893MetSer: 2.893 ± 0.875
2.314MetThr: 2.314 ± 2.284
0.579MetVal: 0.579 ± 0.318
0.0MetTrp: 0.0 ± 0.0
0.289MetTyr: 0.289 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.76AsnAla: 3.76 ± 0.999
1.446AsnCys: 1.446 ± 0.398
2.025AsnAsp: 2.025 ± 0.882
1.736AsnGlu: 1.736 ± 0.68
2.025AsnPhe: 2.025 ± 1.112
3.471AsnGly: 3.471 ± 0.702
2.603AsnHis: 2.603 ± 1.107
2.314AsnIle: 2.314 ± 0.881
2.314AsnLys: 2.314 ± 0.946
3.182AsnLeu: 3.182 ± 1.136
1.157AsnMet: 1.157 ± 0.635
2.603AsnAsn: 2.603 ± 0.975
2.603AsnPro: 2.603 ± 1.098
1.736AsnGln: 1.736 ± 1.409
0.868AsnArg: 0.868 ± 1.08
4.339AsnSer: 4.339 ± 0.92
2.893AsnThr: 2.893 ± 0.951
2.893AsnVal: 2.893 ± 1.478
0.868AsnTrp: 0.868 ± 1.19
1.446AsnTyr: 1.446 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
0.868ProAla: 0.868 ± 0.815
0.579ProCys: 0.579 ± 0.677
3.471ProAsp: 3.471 ± 1.496
2.893ProGlu: 2.893 ± 1.602
1.446ProPhe: 1.446 ± 1.319
2.603ProGly: 2.603 ± 1.098
1.736ProHis: 1.736 ± 0.666
2.314ProIle: 2.314 ± 1.092
2.314ProLys: 2.314 ± 0.946
4.628ProLeu: 4.628 ± 1.125
0.289ProMet: 0.289 ± 0.445
2.314ProAsn: 2.314 ± 1.058
3.182ProPro: 3.182 ± 2.472
1.736ProGln: 1.736 ± 0.742
1.736ProArg: 1.736 ± 2.077
6.075ProSer: 6.075 ± 1.239
2.893ProThr: 2.893 ± 0.795
4.918ProVal: 4.918 ± 1.822
1.157ProTrp: 1.157 ± 0.776
1.446ProTyr: 1.446 ± 0.766
0.0ProXaa: 0.0 ± 0.0
Gln
2.314GlnAla: 2.314 ± 1.278
0.289GlnCys: 0.289 ± 0.445
1.736GlnAsp: 1.736 ± 0.79
2.025GlnGlu: 2.025 ± 1.079
1.446GlnPhe: 1.446 ± 0.662
2.893GlnGly: 2.893 ± 0.875
1.157GlnHis: 1.157 ± 1.07
1.736GlnIle: 1.736 ± 0.889
1.736GlnLys: 1.736 ± 0.454
2.603GlnLeu: 2.603 ± 0.992
0.868GlnMet: 0.868 ± 0.887
2.025GlnAsn: 2.025 ± 0.808
1.157GlnPro: 1.157 ± 0.5
0.579GlnGln: 0.579 ± 0.318
2.314GlnArg: 2.314 ± 2.018
2.025GlnSer: 2.025 ± 2.305
1.446GlnThr: 1.446 ± 0.398
1.736GlnVal: 1.736 ± 0.454
0.0GlnTrp: 0.0 ± 0.0
1.446GlnTyr: 1.446 ± 1.158
0.0GlnXaa: 0.0 ± 0.0
Arg
3.471ArgAla: 3.471 ± 1.006
2.603ArgCys: 2.603 ± 1.278
3.182ArgAsp: 3.182 ± 1.044
4.339ArgGlu: 4.339 ± 0.6
2.893ArgPhe: 2.893 ± 0.807
1.736ArgGly: 1.736 ± 0.717
1.157ArgHis: 1.157 ± 0.635
2.603ArgIle: 2.603 ± 0.802
2.603ArgLys: 2.603 ± 0.737
7.232ArgLeu: 7.232 ± 2.542
1.157ArgMet: 1.157 ± 0.4
2.314ArgAsn: 2.314 ± 1.433
2.314ArgPro: 2.314 ± 0.844
1.157ArgGln: 1.157 ± 0.664
3.182ArgArg: 3.182 ± 3.126
3.76ArgSer: 3.76 ± 2.885
3.76ArgThr: 3.76 ± 1.073
3.182ArgVal: 3.182 ± 2.545
0.868ArgTrp: 0.868 ± 0.381
2.025ArgTyr: 2.025 ± 1.862
0.0ArgXaa: 0.0 ± 0.0
Ser
4.05SerAla: 4.05 ± 1.43
1.446SerCys: 1.446 ± 0.542
6.075SerAsp: 6.075 ± 2.309
4.339SerGlu: 4.339 ± 1.513
5.496SerPhe: 5.496 ± 0.962
2.893SerGly: 2.893 ± 1.576
2.603SerHis: 2.603 ± 0.931
4.628SerIle: 4.628 ± 1.369
5.496SerLys: 5.496 ± 2.413
10.414SerLeu: 10.414 ± 3.202
0.868SerMet: 0.868 ± 1.766
2.314SerAsn: 2.314 ± 1.005
3.76SerPro: 3.76 ± 2.291
3.471SerGln: 3.471 ± 1.071
3.471SerArg: 3.471 ± 1.024
2.893SerSer: 2.893 ± 1.327
4.339SerThr: 4.339 ± 1.693
2.314SerVal: 2.314 ± 0.506
2.893SerTrp: 2.893 ± 1.497
2.025SerTyr: 2.025 ± 0.628
0.0SerXaa: 0.0 ± 0.0
Thr
2.603ThrAla: 2.603 ± 0.662
1.736ThrCys: 1.736 ± 1.479
2.603ThrAsp: 2.603 ± 1.102
4.339ThrGlu: 4.339 ± 0.994
2.314ThrPhe: 2.314 ± 0.946
5.496ThrGly: 5.496 ± 1.45
3.182ThrHis: 3.182 ± 0.904
2.893ThrIle: 2.893 ± 1.588
2.314ThrLys: 2.314 ± 0.8
5.496ThrLeu: 5.496 ± 1.378
1.157ThrMet: 1.157 ± 0.566
2.025ThrAsn: 2.025 ± 0.803
3.76ThrPro: 3.76 ± 1.336
3.471ThrGln: 3.471 ± 0.909
4.05ThrArg: 4.05 ± 3.507
3.471ThrSer: 3.471 ± 1.58
3.471ThrThr: 3.471 ± 1.189
4.339ThrVal: 4.339 ± 0.915
2.314ThrTrp: 2.314 ± 0.624
0.868ThrTyr: 0.868 ± 0.815
0.0ThrXaa: 0.0 ± 0.0
Val
2.603ValAla: 2.603 ± 2.039
1.736ValCys: 1.736 ± 0.467
2.314ValAsp: 2.314 ± 1.208
2.893ValGlu: 2.893 ± 1.527
3.182ValPhe: 3.182 ± 0.994
5.496ValGly: 5.496 ± 1.466
1.446ValHis: 1.446 ± 0.907
4.628ValIle: 4.628 ± 1.357
4.628ValLys: 4.628 ± 0.724
5.496ValLeu: 5.496 ± 1.342
1.157ValMet: 1.157 ± 0.664
3.76ValAsn: 3.76 ± 0.786
2.893ValPro: 2.893 ± 0.525
2.603ValGln: 2.603 ± 1.964
2.603ValArg: 2.603 ± 0.786
4.339ValSer: 4.339 ± 0.777
4.05ValThr: 4.05 ± 1.626
6.075ValVal: 6.075 ± 1.375
0.579ValTrp: 0.579 ± 1.187
1.446ValTyr: 1.446 ± 1.363
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.289TrpCys: 0.289 ± 0.737
1.736TrpAsp: 1.736 ± 0.454
1.157TrpGlu: 1.157 ± 0.583
1.446TrpPhe: 1.446 ± 0.727
2.025TrpGly: 2.025 ± 1.398
0.579TrpHis: 0.579 ± 0.89
2.603TrpIle: 2.603 ± 0.727
2.025TrpLys: 2.025 ± 0.869
1.157TrpLeu: 1.157 ± 1.51
0.579TrpMet: 0.579 ± 0.318
1.157TrpAsn: 1.157 ± 0.441
0.579TrpPro: 0.579 ± 0.318
0.579TrpGln: 0.579 ± 0.677
0.579TrpArg: 0.579 ± 0.894
1.446TrpSer: 1.446 ± 1.05
1.446TrpThr: 1.446 ± 1.19
1.736TrpVal: 1.736 ± 1.497
0.289TrpTrp: 0.289 ± 0.159
0.579TrpTyr: 0.579 ± 0.383
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.736TyrAla: 1.736 ± 0.742
1.157TyrCys: 1.157 ± 0.664
0.289TyrAsp: 0.289 ± 0.737
1.446TyrGlu: 1.446 ± 0.542
1.157TyrPhe: 1.157 ± 0.441
1.446TyrGly: 1.446 ± 0.613
0.289TyrHis: 0.289 ± 0.759
1.736TyrIle: 1.736 ± 0.467
2.314TyrLys: 2.314 ± 1.255
5.785TyrLeu: 5.785 ± 1.439
0.579TyrMet: 0.579 ± 0.562
1.736TyrAsn: 1.736 ± 0.836
2.025TyrPro: 2.025 ± 0.808
1.736TyrGln: 1.736 ± 0.518
3.182TyrArg: 3.182 ± 1.1
3.471TyrSer: 3.471 ± 0.896
0.868TyrThr: 0.868 ± 1.257
2.025TyrVal: 2.025 ± 0.687
0.289TyrTrp: 0.289 ± 0.159
0.868TyrTyr: 0.868 ± 0.381
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3458 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski