Amino acid dipepetide frequency for Radi vesiculovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.957AlaAla: 1.957 ± 0.926
1.398AlaCys: 1.398 ± 0.606
4.195AlaAsp: 4.195 ± 0.811
3.076AlaGlu: 3.076 ± 0.833
1.678AlaPhe: 1.678 ± 0.803
3.076AlaGly: 3.076 ± 0.98
1.119AlaHis: 1.119 ± 0.458
2.517AlaIle: 2.517 ± 0.751
2.796AlaLys: 2.796 ± 1.611
5.034AlaLeu: 5.034 ± 0.682
1.678AlaMet: 1.678 ± 0.818
2.237AlaAsn: 2.237 ± 0.755
2.796AlaPro: 2.796 ± 1.683
1.398AlaGln: 1.398 ± 0.553
1.957AlaArg: 1.957 ± 0.277
6.152AlaSer: 6.152 ± 0.864
2.517AlaThr: 2.517 ± 1.038
2.796AlaVal: 2.796 ± 1.888
1.398AlaTrp: 1.398 ± 1.22
3.076AlaTyr: 3.076 ± 0.997
0.0AlaXaa: 0.0 ± 0.0
Cys
0.839CysAla: 0.839 ± 0.417
0.559CysCys: 0.559 ± 0.319
1.678CysAsp: 1.678 ± 0.518
0.559CysGlu: 0.559 ± 0.437
0.839CysPhe: 0.839 ± 0.393
1.398CysGly: 1.398 ± 0.656
0.839CysHis: 0.839 ± 0.417
0.559CysIle: 0.559 ± 0.319
0.839CysLys: 0.839 ± 0.417
1.119CysLeu: 1.119 ± 0.639
0.0CysMet: 0.0 ± 0.0
0.559CysAsn: 0.559 ± 0.336
1.119CysPro: 1.119 ± 0.632
1.119CysGln: 1.119 ± 0.458
1.119CysArg: 1.119 ± 0.41
0.559CysSer: 0.559 ± 0.319
0.559CysThr: 0.559 ± 0.336
0.839CysVal: 0.839 ± 0.339
0.559CysTrp: 0.559 ± 0.319
0.28CysTyr: 0.28 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
2.237AspAla: 2.237 ± 0.864
0.559AspCys: 0.559 ± 0.394
5.313AspAsp: 5.313 ± 0.871
3.076AspGlu: 3.076 ± 2.107
2.237AspPhe: 2.237 ± 0.816
1.957AspGly: 1.957 ± 0.598
0.839AspHis: 0.839 ± 0.339
4.754AspIle: 4.754 ± 1.076
2.796AspLys: 2.796 ± 0.919
8.669AspLeu: 8.669 ± 2.09
1.398AspMet: 1.398 ± 1.268
2.237AspAsn: 2.237 ± 1.011
4.195AspPro: 4.195 ± 0.799
2.517AspGln: 2.517 ± 1.253
3.356AspArg: 3.356 ± 0.994
4.754AspSer: 4.754 ± 1.412
1.957AspThr: 1.957 ± 0.576
2.517AspVal: 2.517 ± 1.319
1.678AspTrp: 1.678 ± 0.677
3.076AspTyr: 3.076 ± 0.757
0.0AspXaa: 0.0 ± 0.0
Glu
5.034GluAla: 5.034 ± 1.723
0.839GluCys: 0.839 ± 0.339
3.356GluAsp: 3.356 ± 2.03
4.754GluGlu: 4.754 ± 1.194
3.635GluPhe: 3.635 ± 1.415
3.635GluGly: 3.635 ± 1.167
1.119GluHis: 1.119 ± 0.41
3.356GluIle: 3.356 ± 1.236
2.796GluLys: 2.796 ± 1.14
5.313GluLeu: 5.313 ± 0.839
2.796GluMet: 2.796 ± 0.883
1.119GluAsn: 1.119 ± 0.458
1.119GluPro: 1.119 ± 0.453
2.237GluGln: 2.237 ± 0.957
3.356GluArg: 3.356 ± 0.89
3.915GluSer: 3.915 ± 2.123
4.195GluThr: 4.195 ± 0.648
3.356GluVal: 3.356 ± 1.415
0.839GluTrp: 0.839 ± 0.496
2.517GluTyr: 2.517 ± 0.974
0.0GluXaa: 0.0 ± 0.0
Phe
2.517PheAla: 2.517 ± 1.132
0.839PheCys: 0.839 ± 0.479
1.957PheAsp: 1.957 ± 0.813
2.517PheGlu: 2.517 ± 1.038
2.237PhePhe: 2.237 ± 1.184
4.195PheGly: 4.195 ± 0.842
1.678PheHis: 1.678 ± 1.145
1.119PheIle: 1.119 ± 0.639
4.754PheLys: 4.754 ± 0.683
3.635PheLeu: 3.635 ± 1.715
0.28PheMet: 0.28 ± 0.16
1.957PheAsn: 1.957 ± 0.908
3.635PhePro: 3.635 ± 1.777
0.559PheGln: 0.559 ± 0.319
3.915PheArg: 3.915 ± 0.757
2.796PheSer: 2.796 ± 0.962
0.559PheThr: 0.559 ± 0.394
2.796PheVal: 2.796 ± 0.748
0.559PheTrp: 0.559 ± 0.394
1.398PheTyr: 1.398 ± 0.945
0.0PheXaa: 0.0 ± 0.0
Gly
1.957GlyAla: 1.957 ± 0.886
0.28GlyCys: 0.28 ± 0.16
4.474GlyAsp: 4.474 ± 0.655
3.915GlyGlu: 3.915 ± 1.41
2.796GlyPhe: 2.796 ± 0.919
3.356GlyGly: 3.356 ± 1.559
1.119GlyHis: 1.119 ± 0.672
3.356GlyIle: 3.356 ± 0.589
2.796GlyLys: 2.796 ± 1.184
8.389GlyLeu: 8.389 ± 1.402
1.398GlyMet: 1.398 ± 0.403
2.237GlyAsn: 2.237 ± 0.589
2.237GlyPro: 2.237 ± 0.912
3.635GlyGln: 3.635 ± 1.254
5.313GlyArg: 5.313 ± 1.753
6.711GlySer: 6.711 ± 2.2
3.915GlyThr: 3.915 ± 1.333
4.195GlyVal: 4.195 ± 1.387
1.678GlyTrp: 1.678 ± 0.678
0.839GlyTyr: 0.839 ± 0.909
0.0GlyXaa: 0.0 ± 0.0
His
0.839HisAla: 0.839 ± 0.393
0.559HisCys: 0.559 ± 0.336
1.119HisAsp: 1.119 ± 0.423
0.839HisGlu: 0.839 ± 0.382
2.517HisPhe: 2.517 ± 0.877
1.957HisGly: 1.957 ± 0.424
1.398HisHis: 1.398 ± 1.148
1.957HisIle: 1.957 ± 0.833
1.398HisLys: 1.398 ± 0.553
1.678HisLeu: 1.678 ± 0.319
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.517HisPro: 2.517 ± 0.528
1.119HisGln: 1.119 ± 0.878
1.678HisArg: 1.678 ± 0.49
1.398HisSer: 1.398 ± 1.054
1.398HisThr: 1.398 ± 1.054
1.957HisVal: 1.957 ± 0.722
1.119HisTrp: 1.119 ± 0.453
0.28HisTyr: 0.28 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
2.796IleAla: 2.796 ± 1.013
1.957IleCys: 1.957 ± 0.87
4.474IleAsp: 4.474 ± 0.605
3.635IleGlu: 3.635 ± 0.62
2.237IlePhe: 2.237 ± 1.096
4.195IleGly: 4.195 ± 0.836
2.237IleHis: 2.237 ± 0.916
4.195IleIle: 4.195 ± 0.886
5.034IleLys: 5.034 ± 0.711
4.474IleLeu: 4.474 ± 0.933
2.237IleMet: 2.237 ± 0.78
3.915IleAsn: 3.915 ± 1.104
3.356IlePro: 3.356 ± 0.7
3.635IleGln: 3.635 ± 0.898
5.872IleArg: 5.872 ± 1.831
4.754IleSer: 4.754 ± 1.264
1.119IleThr: 1.119 ± 0.672
1.957IleVal: 1.957 ± 0.722
1.398IleTrp: 1.398 ± 0.656
1.119IleTyr: 1.119 ± 0.506
0.0IleXaa: 0.0 ± 0.0
Lys
2.517LysAla: 2.517 ± 1.139
1.398LysCys: 1.398 ± 0.799
2.517LysAsp: 2.517 ± 0.974
5.313LysGlu: 5.313 ± 1.696
1.678LysPhe: 1.678 ± 0.336
4.474LysGly: 4.474 ± 1.586
0.839LysHis: 0.839 ± 0.836
4.474LysIle: 4.474 ± 1.261
3.356LysLys: 3.356 ± 1.712
5.313LysLeu: 5.313 ± 1.331
1.119LysMet: 1.119 ± 0.458
1.398LysAsn: 1.398 ± 1.448
1.957LysPro: 1.957 ± 1.194
0.559LysGln: 0.559 ± 0.604
3.356LysArg: 3.356 ± 0.97
6.432LysSer: 6.432 ± 1.028
3.076LysThr: 3.076 ± 1.001
3.356LysVal: 3.356 ± 1.155
0.839LysTrp: 0.839 ± 0.339
1.678LysTyr: 1.678 ± 0.678
0.0LysXaa: 0.0 ± 0.0
Leu
4.474LeuAla: 4.474 ± 1.429
2.237LeuCys: 2.237 ± 1.343
4.754LeuAsp: 4.754 ± 0.685
7.271LeuGlu: 7.271 ± 2.215
2.796LeuPhe: 2.796 ± 0.708
5.034LeuGly: 5.034 ± 1.186
2.517LeuHis: 2.517 ± 0.653
8.669LeuIle: 8.669 ± 1.322
4.754LeuLys: 4.754 ± 0.959
9.508LeuLeu: 9.508 ± 1.326
2.796LeuMet: 2.796 ± 1.248
4.195LeuAsn: 4.195 ± 0.822
3.076LeuPro: 3.076 ± 0.824
1.957LeuGln: 1.957 ± 1.216
7.83LeuArg: 7.83 ± 1.534
12.864LeuSer: 12.864 ± 1.642
4.474LeuThr: 4.474 ± 0.915
4.474LeuVal: 4.474 ± 0.839
1.119LeuTrp: 1.119 ± 0.41
3.076LeuTyr: 3.076 ± 1.281
0.0LeuXaa: 0.0 ± 0.0
Met
1.957MetAla: 1.957 ± 0.692
0.28MetCys: 0.28 ± 0.16
1.119MetAsp: 1.119 ± 0.458
1.398MetGlu: 1.398 ± 0.856
1.678MetPhe: 1.678 ± 0.938
1.119MetGly: 1.119 ± 0.423
0.559MetHis: 0.559 ± 0.319
1.398MetIle: 1.398 ± 0.656
1.119MetLys: 1.119 ± 0.584
1.957MetLeu: 1.957 ± 0.833
1.398MetMet: 1.398 ± 0.789
0.839MetAsn: 0.839 ± 0.479
1.119MetPro: 1.119 ± 0.453
0.839MetGln: 0.839 ± 0.479
2.517MetArg: 2.517 ± 0.928
3.356MetSer: 3.356 ± 0.958
1.119MetThr: 1.119 ± 0.639
1.119MetVal: 1.119 ± 0.458
0.0MetTrp: 0.0 ± 0.0
0.559MetTyr: 0.559 ± 0.73
0.0MetXaa: 0.0 ± 0.0
Asn
3.076AsnAla: 3.076 ± 0.67
0.28AsnCys: 0.28 ± 0.16
2.517AsnAsp: 2.517 ± 0.998
1.398AsnGlu: 1.398 ± 0.656
1.678AsnPhe: 1.678 ± 0.954
1.678AsnGly: 1.678 ± 0.678
2.517AsnHis: 2.517 ± 1.095
2.796AsnIle: 2.796 ± 1.016
1.398AsnLys: 1.398 ± 0.844
6.152AsnLeu: 6.152 ± 2.005
0.839AsnMet: 0.839 ± 0.496
1.398AsnAsn: 1.398 ± 0.522
2.237AsnPro: 2.237 ± 0.63
2.517AsnGln: 2.517 ± 0.702
1.678AsnArg: 1.678 ± 0.336
3.076AsnSer: 3.076 ± 1.269
2.517AsnThr: 2.517 ± 1.017
1.398AsnVal: 1.398 ± 0.949
0.839AsnTrp: 0.839 ± 0.445
1.678AsnTyr: 1.678 ± 0.701
0.0AsnXaa: 0.0 ± 0.0
Pro
3.356ProAla: 3.356 ± 0.485
0.28ProCys: 0.28 ± 0.455
2.796ProAsp: 2.796 ± 0.573
1.678ProGlu: 1.678 ± 0.502
1.957ProPhe: 1.957 ± 0.594
2.517ProGly: 2.517 ± 1.488
1.119ProHis: 1.119 ± 0.41
2.517ProIle: 2.517 ± 0.992
1.678ProLys: 1.678 ± 0.786
5.034ProLeu: 5.034 ± 0.739
0.839ProMet: 0.839 ± 0.768
2.517ProAsn: 2.517 ± 1.623
2.237ProPro: 2.237 ± 1.125
0.559ProGln: 0.559 ± 0.336
1.678ProArg: 1.678 ± 0.444
5.313ProSer: 5.313 ± 1.961
3.076ProThr: 3.076 ± 0.453
3.915ProVal: 3.915 ± 0.712
0.839ProTrp: 0.839 ± 0.339
1.119ProTyr: 1.119 ± 1.35
0.0ProXaa: 0.0 ± 0.0
Gln
1.678GlnAla: 1.678 ± 0.767
0.839GlnCys: 0.839 ± 0.417
3.076GlnAsp: 3.076 ± 1.077
0.839GlnGlu: 0.839 ± 0.417
1.957GlnPhe: 1.957 ± 0.594
2.237GlnGly: 2.237 ± 0.452
0.839GlnHis: 0.839 ± 0.763
1.398GlnIle: 1.398 ± 0.522
1.957GlnLys: 1.957 ± 0.588
2.517GlnLeu: 2.517 ± 0.564
1.119GlnMet: 1.119 ± 0.453
1.119GlnAsn: 1.119 ± 0.506
1.119GlnPro: 1.119 ± 0.632
0.0GlnGln: 0.0 ± 0.0
2.237GlnArg: 2.237 ± 0.721
5.034GlnSer: 5.034 ± 1.715
2.237GlnThr: 2.237 ± 0.786
1.398GlnVal: 1.398 ± 0.274
1.119GlnTrp: 1.119 ± 0.423
0.559GlnTyr: 0.559 ± 0.437
0.0GlnXaa: 0.0 ± 0.0
Arg
3.356ArgAla: 3.356 ± 1.46
0.28ArgCys: 0.28 ± 0.16
2.796ArgAsp: 2.796 ± 0.588
4.195ArgGlu: 4.195 ± 2.069
4.195ArgPhe: 4.195 ± 1.347
4.754ArgGly: 4.754 ± 1.023
1.119ArgHis: 1.119 ± 0.453
4.474ArgIle: 4.474 ± 1.358
3.635ArgLys: 3.635 ± 1.059
5.034ArgLeu: 5.034 ± 0.979
1.119ArgMet: 1.119 ± 0.458
4.195ArgAsn: 4.195 ± 0.914
1.957ArgPro: 1.957 ± 0.531
1.957ArgGln: 1.957 ± 0.722
2.517ArgArg: 2.517 ± 0.741
5.872ArgSer: 5.872 ± 0.914
2.796ArgThr: 2.796 ± 0.351
5.872ArgVal: 5.872 ± 0.914
1.398ArgTrp: 1.398 ± 0.274
1.957ArgTyr: 1.957 ± 0.908
0.0ArgXaa: 0.0 ± 0.0
Ser
5.872SerAla: 5.872 ± 1.597
0.839SerCys: 0.839 ± 0.803
5.313SerAsp: 5.313 ± 0.864
5.034SerGlu: 5.034 ± 2.276
3.076SerPhe: 3.076 ± 0.663
6.711SerGly: 6.711 ± 1.032
1.119SerHis: 1.119 ± 0.639
6.432SerIle: 6.432 ± 0.887
4.195SerLys: 4.195 ± 0.966
8.949SerLeu: 8.949 ± 1.087
1.957SerMet: 1.957 ± 0.531
5.593SerAsn: 5.593 ± 0.691
3.356SerPro: 3.356 ± 0.902
3.635SerGln: 3.635 ± 0.824
5.872SerArg: 5.872 ± 1.348
7.55SerSer: 7.55 ± 1.43
6.711SerThr: 6.711 ± 0.697
5.313SerVal: 5.313 ± 1.242
2.237SerTrp: 2.237 ± 0.82
3.915SerTyr: 3.915 ± 0.503
0.0SerXaa: 0.0 ± 0.0
Thr
3.356ThrAla: 3.356 ± 0.944
0.839ThrCys: 0.839 ± 0.339
2.517ThrAsp: 2.517 ± 1.842
3.356ThrGlu: 3.356 ± 0.496
1.678ThrPhe: 1.678 ± 0.518
3.915ThrGly: 3.915 ± 1.336
1.678ThrHis: 1.678 ± 0.629
3.356ThrIle: 3.356 ± 0.627
3.076ThrLys: 3.076 ± 1.6
4.754ThrLeu: 4.754 ± 0.627
1.957ThrMet: 1.957 ± 1.118
1.398ThrAsn: 1.398 ± 0.522
1.957ThrPro: 1.957 ± 1.135
1.678ThrGln: 1.678 ± 0.336
1.957ThrArg: 1.957 ± 0.796
4.754ThrSer: 4.754 ± 1.264
3.356ThrThr: 3.356 ± 1.553
2.517ThrVal: 2.517 ± 0.372
1.398ThrTrp: 1.398 ± 0.504
1.398ThrTyr: 1.398 ± 0.448
0.0ThrXaa: 0.0 ± 0.0
Val
3.356ValAla: 3.356 ± 1.063
1.398ValCys: 1.398 ± 0.522
3.635ValAsp: 3.635 ± 0.856
3.915ValGlu: 3.915 ± 2.014
2.237ValPhe: 2.237 ± 0.988
3.635ValGly: 3.635 ± 1.208
1.678ValHis: 1.678 ± 1.071
3.356ValIle: 3.356 ± 1.308
3.076ValLys: 3.076 ± 1.612
4.474ValLeu: 4.474 ± 1.402
1.119ValMet: 1.119 ± 0.458
1.119ValAsn: 1.119 ± 0.315
2.237ValPro: 2.237 ± 0.556
1.957ValGln: 1.957 ± 0.567
4.195ValArg: 4.195 ± 0.49
3.915ValSer: 3.915 ± 1.192
3.076ValThr: 3.076 ± 1.065
2.237ValVal: 2.237 ± 1.255
1.119ValTrp: 1.119 ± 0.423
2.237ValTyr: 2.237 ± 0.944
0.0ValXaa: 0.0 ± 0.0
Trp
0.28TrpAla: 0.28 ± 0.16
0.0TrpCys: 0.0 ± 0.0
1.119TrpAsp: 1.119 ± 0.315
1.398TrpGlu: 1.398 ± 0.522
1.119TrpPhe: 1.119 ± 0.672
2.517TrpGly: 2.517 ± 0.702
0.28TrpHis: 0.28 ± 0.16
1.678TrpIle: 1.678 ± 0.518
1.678TrpLys: 1.678 ± 0.626
1.678TrpLeu: 1.678 ± 0.896
0.559TrpMet: 0.559 ± 0.336
1.398TrpAsn: 1.398 ± 0.656
0.28TrpPro: 0.28 ± 0.16
0.28TrpGln: 0.28 ± 0.485
1.119TrpArg: 1.119 ± 0.41
1.957TrpSer: 1.957 ± 0.813
1.119TrpThr: 1.119 ± 0.41
1.398TrpVal: 1.398 ± 0.403
0.28TrpTrp: 0.28 ± 0.455
0.28TrpTyr: 0.28 ± 0.402
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.398TyrAla: 1.398 ± 0.568
0.28TyrCys: 0.28 ± 0.402
1.678TyrAsp: 1.678 ± 0.922
1.119TyrGlu: 1.119 ± 0.639
1.678TyrPhe: 1.678 ± 0.595
2.237TyrGly: 2.237 ± 0.704
1.119TyrHis: 1.119 ± 0.453
2.237TyrIle: 2.237 ± 0.898
2.517TyrLys: 2.517 ± 0.395
4.195TyrLeu: 4.195 ± 1.061
0.559TyrMet: 0.559 ± 0.394
2.237TyrAsn: 2.237 ± 0.999
2.517TyrPro: 2.517 ± 0.652
1.119TyrGln: 1.119 ± 1.024
1.957TyrArg: 1.957 ± 0.47
2.517TyrSer: 2.517 ± 0.928
1.119TyrThr: 1.119 ± 1.208
0.559TyrVal: 0.559 ± 0.437
0.0TyrTrp: 0.0 ± 0.0
0.559TyrTyr: 0.559 ± 0.336
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3577 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski