Amino acid dipepetide frequency for American bat vesiculovirus TFFN-2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.884AlaAla: 2.884 ± 1.219
0.865AlaCys: 0.865 ± 0.306
3.173AlaAsp: 3.173 ± 0.791
2.307AlaGlu: 2.307 ± 1.709
0.865AlaPhe: 0.865 ± 0.477
2.307AlaGly: 2.307 ± 0.95
1.442AlaHis: 1.442 ± 0.296
2.884AlaIle: 2.884 ± 1.026
3.461AlaLys: 3.461 ± 0.588
6.634AlaLeu: 6.634 ± 1.355
0.865AlaMet: 0.865 ± 0.477
1.442AlaAsn: 1.442 ± 0.796
2.019AlaPro: 2.019 ± 1.024
2.307AlaGln: 2.307 ± 0.929
1.442AlaArg: 1.442 ± 0.772
3.173AlaSer: 3.173 ± 1.362
1.731AlaThr: 1.731 ± 0.652
2.019AlaVal: 2.019 ± 0.665
1.154AlaTrp: 1.154 ± 0.562
0.865AlaTyr: 0.865 ± 0.299
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 0.609
0.577CysCys: 0.577 ± 0.318
0.865CysAsp: 0.865 ± 0.662
0.288CysGlu: 0.288 ± 0.159
0.288CysPhe: 0.288 ± 0.159
0.865CysGly: 0.865 ± 0.306
0.577CysHis: 0.577 ± 0.303
0.865CysIle: 0.865 ± 0.306
2.019CysLys: 2.019 ± 0.665
2.019CysLeu: 2.019 ± 0.897
0.288CysMet: 0.288 ± 0.159
1.442CysAsn: 1.442 ± 0.499
1.154CysPro: 1.154 ± 0.562
1.154CysGln: 1.154 ± 0.879
0.865CysArg: 0.865 ± 0.306
2.019CysSer: 2.019 ± 0.861
0.577CysThr: 0.577 ± 0.281
1.154CysVal: 1.154 ± 0.637
0.865CysTrp: 0.865 ± 0.306
0.288CysTyr: 0.288 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
2.307AspAla: 2.307 ± 0.723
1.154AspCys: 1.154 ± 0.948
2.884AspAsp: 2.884 ± 1.125
2.596AspGlu: 2.596 ± 1.476
2.307AspPhe: 2.307 ± 0.595
3.173AspGly: 3.173 ± 1.227
1.442AspHis: 1.442 ± 0.296
2.307AspIle: 2.307 ± 0.766
3.461AspLys: 3.461 ± 0.751
8.365AspLeu: 8.365 ± 0.871
3.173AspMet: 3.173 ± 1.85
2.884AspAsn: 2.884 ± 1.456
5.192AspPro: 5.192 ± 2.077
2.596AspGln: 2.596 ± 0.404
1.731AspArg: 1.731 ± 0.517
3.461AspSer: 3.461 ± 0.588
2.596AspThr: 2.596 ± 0.528
2.307AspVal: 2.307 ± 0.533
1.442AspTrp: 1.442 ± 0.65
4.615AspTyr: 4.615 ± 1.488
0.288AspXaa: 0.288 ± 0.159
Glu
1.442GluAla: 1.442 ± 0.631
1.154GluCys: 1.154 ± 0.382
5.192GluAsp: 5.192 ± 1.859
3.173GluGlu: 3.173 ± 1.889
2.596GluPhe: 2.596 ± 1.43
2.884GluGly: 2.884 ± 0.818
2.307GluHis: 2.307 ± 1.118
4.903GluIle: 4.903 ± 1.124
3.173GluLys: 3.173 ± 0.346
2.596GluLeu: 2.596 ± 1.081
0.865GluMet: 0.865 ± 0.517
4.038GluAsn: 4.038 ± 0.739
2.019GluPro: 2.019 ± 0.349
0.288GluGln: 0.288 ± 0.159
3.173GluArg: 3.173 ± 0.792
6.057GluSer: 6.057 ± 1.476
4.615GluThr: 4.615 ± 1.117
4.327GluVal: 4.327 ± 0.578
0.288GluTrp: 0.288 ± 0.345
3.173GluTyr: 3.173 ± 0.861
0.0GluXaa: 0.0 ± 0.0
Phe
2.019PheAla: 2.019 ± 0.49
0.865PheCys: 0.865 ± 0.963
1.442PheAsp: 1.442 ± 0.796
2.884PheGlu: 2.884 ± 0.94
0.865PhePhe: 0.865 ± 0.477
3.173PheGly: 3.173 ± 0.92
1.154PheHis: 1.154 ± 0.317
1.154PheIle: 1.154 ± 0.606
3.461PheLys: 3.461 ± 1.671
5.192PheLeu: 5.192 ± 1.249
0.865PheMet: 0.865 ± 0.477
1.154PheAsn: 1.154 ± 0.452
2.307PhePro: 2.307 ± 0.612
1.154PheGln: 1.154 ± 0.637
2.884PheArg: 2.884 ± 0.564
2.884PheSer: 2.884 ± 0.823
2.019PheThr: 2.019 ± 0.914
2.307PheVal: 2.307 ± 0.595
0.865PheTrp: 0.865 ± 0.552
1.442PheTyr: 1.442 ± 0.922
0.0PheXaa: 0.0 ± 0.0
Gly
1.731GlyAla: 1.731 ± 0.589
0.577GlyCys: 0.577 ± 0.699
3.461GlyAsp: 3.461 ± 0.779
3.173GlyGlu: 3.173 ± 0.557
2.019GlyPhe: 2.019 ± 0.674
4.327GlyGly: 4.327 ± 0.409
0.865GlyHis: 0.865 ± 0.477
6.057GlyIle: 6.057 ± 1.584
2.884GlyLys: 2.884 ± 1.044
5.769GlyLeu: 5.769 ± 1.043
1.154GlyMet: 1.154 ± 0.646
1.442GlyAsn: 1.442 ± 0.753
2.884GlyPro: 2.884 ± 0.818
2.884GlyGln: 2.884 ± 0.494
2.596GlyArg: 2.596 ± 0.625
4.903GlySer: 4.903 ± 0.585
2.596GlyThr: 2.596 ± 0.591
4.327GlyVal: 4.327 ± 0.643
1.154GlyTrp: 1.154 ± 0.317
2.019GlyTyr: 2.019 ± 0.369
0.0GlyXaa: 0.0 ± 0.0
His
0.865HisAla: 0.865 ± 0.306
0.288HisCys: 0.288 ± 0.159
1.442HisAsp: 1.442 ± 1.277
2.019HisGlu: 2.019 ± 0.78
1.442HisPhe: 1.442 ± 0.536
1.442HisGly: 1.442 ± 0.796
1.442HisHis: 1.442 ± 0.439
2.884HisIle: 2.884 ± 1.235
2.884HisLys: 2.884 ± 0.848
2.307HisLeu: 2.307 ± 0.634
0.288HisMet: 0.288 ± 0.159
1.154HisAsn: 1.154 ± 0.637
1.442HisPro: 1.442 ± 0.296
1.442HisGln: 1.442 ± 0.414
1.731HisArg: 1.731 ± 0.517
2.307HisSer: 2.307 ± 0.887
0.577HisThr: 0.577 ± 0.318
1.154HisVal: 1.154 ± 0.317
0.865HisTrp: 0.865 ± 0.306
0.865HisTyr: 0.865 ± 0.517
0.0HisXaa: 0.0 ± 0.0
Ile
3.461IleAla: 3.461 ± 0.801
1.731IleCys: 1.731 ± 0.908
4.615IleAsp: 4.615 ± 1.331
4.327IleGlu: 4.327 ± 1.001
4.038IlePhe: 4.038 ± 0.748
3.461IleGly: 3.461 ± 0.544
2.596IleHis: 2.596 ± 0.896
3.461IleIle: 3.461 ± 0.871
6.634IleLys: 6.634 ± 0.955
5.769IleLeu: 5.769 ± 1.046
0.577IleMet: 0.577 ± 0.318
2.307IleAsn: 2.307 ± 0.894
5.192IlePro: 5.192 ± 1.395
2.596IleGln: 2.596 ± 0.704
5.769IleArg: 5.769 ± 1.379
4.903IleSer: 4.903 ± 1.659
2.596IleThr: 2.596 ± 2.674
2.884IleVal: 2.884 ± 0.676
1.442IleTrp: 1.442 ± 0.796
3.173IleTyr: 3.173 ± 1.176
0.0IleXaa: 0.0 ± 0.0
Lys
1.731LysAla: 1.731 ± 0.809
1.154LysCys: 1.154 ± 0.389
4.327LysAsp: 4.327 ± 0.965
4.038LysGlu: 4.038 ± 1.487
2.596LysPhe: 2.596 ± 0.609
2.884LysGly: 2.884 ± 0.564
1.154LysHis: 1.154 ± 0.606
4.327LysIle: 4.327 ± 2.114
3.75LysLys: 3.75 ± 0.384
3.75LysLeu: 3.75 ± 1.021
2.596LysMet: 2.596 ± 1.568
3.461LysAsn: 3.461 ± 1.006
1.154LysPro: 1.154 ± 0.646
2.019LysGln: 2.019 ± 1.259
4.327LysArg: 4.327 ± 1.302
6.634LysSer: 6.634 ± 1.445
5.48LysThr: 5.48 ± 0.614
4.327LysVal: 4.327 ± 0.843
2.019LysTrp: 2.019 ± 0.49
2.019LysTyr: 2.019 ± 0.506
0.0LysXaa: 0.0 ± 0.0
Leu
4.327LeuAla: 4.327 ± 2.042
2.307LeuCys: 2.307 ± 1.211
4.903LeuAsp: 4.903 ± 1.166
6.057LeuGlu: 6.057 ± 1.51
2.884LeuPhe: 2.884 ± 0.823
4.327LeuGly: 4.327 ± 1.163
1.731LeuHis: 1.731 ± 0.955
7.499LeuIle: 7.499 ± 1.514
5.192LeuLys: 5.192 ± 0.537
7.788LeuLeu: 7.788 ± 1.238
2.019LeuMet: 2.019 ± 0.675
5.769LeuAsn: 5.769 ± 1.311
3.173LeuPro: 3.173 ± 0.557
3.173LeuGln: 3.173 ± 1.418
5.192LeuArg: 5.192 ± 0.931
9.807LeuSer: 9.807 ± 1.59
6.346LeuThr: 6.346 ± 1.227
4.327LeuVal: 4.327 ± 1.675
1.442LeuTrp: 1.442 ± 0.558
3.75LeuTyr: 3.75 ± 0.697
0.288LeuXaa: 0.288 ± 0.581
Met
1.731MetAla: 1.731 ± 0.56
0.0MetCys: 0.0 ± 0.0
2.019MetAsp: 2.019 ± 1.526
2.019MetGlu: 2.019 ± 1.06
1.154MetPhe: 1.154 ± 0.862
1.442MetGly: 1.442 ± 0.513
0.577MetHis: 0.577 ± 0.318
1.731MetIle: 1.731 ± 0.355
1.731MetLys: 1.731 ± 0.843
1.154MetLeu: 1.154 ± 0.866
1.154MetMet: 1.154 ± 0.389
1.442MetAsn: 1.442 ± 0.513
0.288MetPro: 0.288 ± 0.345
0.288MetGln: 0.288 ± 0.159
2.307MetArg: 2.307 ± 0.894
2.019MetSer: 2.019 ± 0.519
0.865MetThr: 0.865 ± 0.623
2.019MetVal: 2.019 ± 0.965
0.288MetTrp: 0.288 ± 0.159
0.288MetTyr: 0.288 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
1.731AsnAla: 1.731 ± 0.652
0.288AsnCys: 0.288 ± 0.375
0.865AsnAsp: 0.865 ± 0.306
2.596AsnGlu: 2.596 ± 1.039
2.019AsnPhe: 2.019 ± 0.799
3.173AsnGly: 3.173 ± 1.197
2.019AsnHis: 2.019 ± 0.78
3.75AsnIle: 3.75 ± 0.93
2.307AsnLys: 2.307 ± 0.352
6.922AsnLeu: 6.922 ± 1.406
0.577AsnMet: 0.577 ± 0.318
2.596AsnAsn: 2.596 ± 1.103
2.307AsnPro: 2.307 ± 1.212
2.019AsnGln: 2.019 ± 0.625
2.596AsnArg: 2.596 ± 1.432
4.903AsnSer: 4.903 ± 1.427
3.173AsnThr: 3.173 ± 1.362
1.154AsnVal: 1.154 ± 1.053
0.865AsnTrp: 0.865 ± 0.306
1.731AsnTyr: 1.731 ± 0.612
0.288AsnXaa: 0.288 ± 0.375
Pro
4.038ProAla: 4.038 ± 1.338
0.288ProCys: 0.288 ± 0.375
4.903ProAsp: 4.903 ± 0.56
2.596ProGlu: 2.596 ± 1.918
2.596ProPhe: 2.596 ± 0.907
3.173ProGly: 3.173 ± 2.335
0.577ProHis: 0.577 ± 0.318
2.019ProIle: 2.019 ± 1.114
2.307ProLys: 2.307 ± 0.533
4.903ProLeu: 4.903 ± 1.659
0.0ProMet: 0.0 ± 0.0
1.442ProAsn: 1.442 ± 0.587
4.038ProPro: 4.038 ± 1.326
0.577ProGln: 0.577 ± 0.318
1.154ProArg: 1.154 ± 0.637
4.038ProSer: 4.038 ± 1.42
3.173ProThr: 3.173 ± 0.47
4.327ProVal: 4.327 ± 0.788
0.577ProTrp: 0.577 ± 0.318
2.019ProTyr: 2.019 ± 1.008
0.0ProXaa: 0.0 ± 0.0
Gln
1.731GlnAla: 1.731 ± 0.434
0.577GlnCys: 0.577 ± 0.281
2.307GlnAsp: 2.307 ± 0.95
2.019GlnGlu: 2.019 ± 0.577
0.577GlnPhe: 0.577 ± 0.526
2.307GlnGly: 2.307 ± 0.634
0.288GlnHis: 0.288 ± 0.159
3.75GlnIle: 3.75 ± 0.665
3.173GlnLys: 3.173 ± 0.76
2.019GlnLeu: 2.019 ± 0.755
0.865GlnMet: 0.865 ± 0.299
2.019GlnAsn: 2.019 ± 0.78
0.288GlnPro: 0.288 ± 0.159
1.154GlnGln: 1.154 ± 0.382
0.865GlnArg: 0.865 ± 0.306
2.884GlnSer: 2.884 ± 0.822
1.442GlnThr: 1.442 ± 0.995
2.307GlnVal: 2.307 ± 0.594
0.288GlnTrp: 0.288 ± 0.159
1.442GlnTyr: 1.442 ± 0.513
0.0GlnXaa: 0.0 ± 0.0
Arg
4.615ArgAla: 4.615 ± 1.546
1.154ArgCys: 1.154 ± 0.434
2.307ArgAsp: 2.307 ± 0.769
3.75ArgGlu: 3.75 ± 0.991
1.731ArgPhe: 1.731 ± 0.599
3.461ArgGly: 3.461 ± 0.975
2.307ArgHis: 2.307 ± 0.533
2.884ArgIle: 2.884 ± 0.846
2.884ArgLys: 2.884 ± 0.514
3.75ArgLeu: 3.75 ± 1.371
1.442ArgMet: 1.442 ± 0.414
2.307ArgAsn: 2.307 ± 0.929
2.596ArgPro: 2.596 ± 1.039
1.442ArgGln: 1.442 ± 0.626
1.442ArgArg: 1.442 ± 0.956
2.596ArgSer: 2.596 ± 0.404
3.173ArgThr: 3.173 ± 0.799
6.057ArgVal: 6.057 ± 0.523
0.577ArgTrp: 0.577 ± 0.318
0.865ArgTyr: 0.865 ± 0.299
0.0ArgXaa: 0.0 ± 0.0
Ser
3.461SerAla: 3.461 ± 0.801
2.884SerCys: 2.884 ± 1.175
7.499SerAsp: 7.499 ± 1.592
5.48SerGlu: 5.48 ± 0.611
4.615SerPhe: 4.615 ± 0.867
4.327SerGly: 4.327 ± 1.134
2.884SerHis: 2.884 ± 0.888
6.922SerIle: 6.922 ± 1.807
4.038SerLys: 4.038 ± 0.748
9.518SerLeu: 9.518 ± 1.449
1.154SerMet: 1.154 ± 0.555
3.75SerAsn: 3.75 ± 0.96
4.327SerPro: 4.327 ± 0.659
2.019SerGln: 2.019 ± 0.709
4.038SerArg: 4.038 ± 1.268
8.365SerSer: 8.365 ± 1.523
3.75SerThr: 3.75 ± 0.396
6.634SerVal: 6.634 ± 1.492
2.884SerTrp: 2.884 ± 0.974
2.019SerTyr: 2.019 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
2.019ThrAla: 2.019 ± 0.832
1.154ThrCys: 1.154 ± 0.637
1.731ThrAsp: 1.731 ± 1.718
2.596ThrGlu: 2.596 ± 0.404
2.884ThrPhe: 2.884 ± 1.239
2.596ThrGly: 2.596 ± 0.581
2.307ThrHis: 2.307 ± 0.595
4.615ThrIle: 4.615 ± 0.686
2.884ThrLys: 2.884 ± 1.312
4.038ThrLeu: 4.038 ± 1.044
2.596ThrMet: 2.596 ± 0.737
2.019ThrAsn: 2.019 ± 0.519
2.307ThrPro: 2.307 ± 0.887
1.442ThrGln: 1.442 ± 0.796
2.884ThrArg: 2.884 ± 0.494
5.192ThrSer: 5.192 ± 0.417
3.461ThrThr: 3.461 ± 0.814
3.173ThrVal: 3.173 ± 1.017
0.865ThrTrp: 0.865 ± 0.583
1.731ThrTyr: 1.731 ± 1.034
0.0ThrXaa: 0.0 ± 0.0
Val
1.154ValAla: 1.154 ± 0.562
1.154ValCys: 1.154 ± 0.389
3.75ValAsp: 3.75 ± 1.419
3.75ValGlu: 3.75 ± 0.708
1.731ValPhe: 1.731 ± 0.955
2.596ValGly: 2.596 ± 0.936
1.154ValHis: 1.154 ± 0.555
5.48ValIle: 5.48 ± 1.176
3.461ValLys: 3.461 ± 1.385
4.903ValLeu: 4.903 ± 1.596
1.731ValMet: 1.731 ± 0.414
2.884ValAsn: 2.884 ± 1.591
4.038ValPro: 4.038 ± 0.721
2.019ValGln: 2.019 ± 0.577
3.75ValArg: 3.75 ± 1.224
8.365ValSer: 8.365 ± 1.268
2.596ValThr: 2.596 ± 1.103
3.461ValVal: 3.461 ± 0.915
1.154ValTrp: 1.154 ± 0.762
1.731ValTyr: 1.731 ± 0.91
0.0ValXaa: 0.0 ± 0.0
Trp
0.288TrpAla: 0.288 ± 0.159
0.577TrpCys: 0.577 ± 0.318
0.865TrpAsp: 0.865 ± 0.998
1.731TrpGlu: 1.731 ± 0.691
0.865TrpPhe: 0.865 ± 0.477
3.173TrpGly: 3.173 ± 0.798
0.865TrpHis: 0.865 ± 0.306
1.731TrpIle: 1.731 ± 0.553
1.154TrpLys: 1.154 ± 0.555
1.154TrpLeu: 1.154 ± 1.025
1.154TrpMet: 1.154 ± 0.69
0.865TrpAsn: 0.865 ± 0.306
0.288TrpPro: 0.288 ± 0.159
0.288TrpGln: 0.288 ± 0.159
0.0TrpArg: 0.0 ± 0.0
2.019TrpSer: 2.019 ± 0.78
0.577TrpThr: 0.577 ± 0.318
1.442TrpVal: 1.442 ± 0.587
0.865TrpTrp: 0.865 ± 0.405
0.288TrpTyr: 0.288 ± 0.375
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.442TyrAla: 1.442 ± 0.513
0.288TyrCys: 0.288 ± 0.375
1.731TyrAsp: 1.731 ± 0.433
0.865TyrGlu: 0.865 ± 0.583
2.019TyrPhe: 2.019 ± 0.49
1.731TyrGly: 1.731 ± 0.434
1.154TyrHis: 1.154 ± 0.317
2.307TyrIle: 2.307 ± 0.353
3.173TyrLys: 3.173 ± 0.66
3.75TyrLeu: 3.75 ± 0.396
0.865TyrMet: 0.865 ± 0.963
3.461TyrAsn: 3.461 ± 1.129
1.731TyrPro: 1.731 ± 0.446
1.442TyrGln: 1.442 ± 0.536
2.596TyrArg: 2.596 ± 1.224
4.038TyrSer: 4.038 ± 1.579
0.577TyrThr: 0.577 ± 0.433
1.154TyrVal: 1.154 ± 0.317
0.0TyrTrp: 0.0 ± 0.0
1.154TyrTyr: 1.154 ± 1.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.577XaaThr: 0.577 ± 0.651
0.0XaaVal: 0.0 ± 0.0
0.288XaaTrp: 0.288 ± 0.159
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski