Amino acid dipepetide frequency for Acute bee paralysis virus (strain Rothamsted) (ABPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.689AlaAla: 0.689 ± 0.209
0.0AlaCys: 0.0 ± 0.0
3.102AlaAsp: 3.102 ± 2.196
2.758AlaGlu: 2.758 ± 0.516
4.137AlaPhe: 4.137 ± 1.181
3.447AlaGly: 3.447 ± 1.046
0.689AlaHis: 0.689 ± 0.36
4.137AlaIle: 4.137 ± 0.603
2.068AlaLys: 2.068 ± 0.59
3.102AlaLeu: 3.102 ± 0.72
1.379AlaMet: 1.379 ± 0.258
2.413AlaAsn: 2.413 ± 0.973
2.758AlaPro: 2.758 ± 1.271
1.379AlaGln: 1.379 ± 0.258
2.413AlaArg: 2.413 ± 0.516
3.447AlaSer: 3.447 ± 1.532
2.413AlaThr: 2.413 ± 0.516
2.413AlaVal: 2.413 ± 0.382
0.689AlaTrp: 0.689 ± 1.49
2.413AlaTyr: 2.413 ± 0.382
0.0AlaXaa: 0.0 ± 0.0
Cys
1.379CysAla: 1.379 ± 0.469
0.0CysCys: 0.0 ± 0.0
1.379CysAsp: 1.379 ± 0.258
2.068CysGlu: 2.068 ± 0.59
0.689CysPhe: 0.689 ± 0.36
2.068CysGly: 2.068 ± 0.59
0.0CysHis: 0.0 ± 0.0
1.034CysIle: 1.034 ± 0.151
0.0CysLys: 0.0 ± 0.0
1.034CysLeu: 1.034 ± 0.54
0.345CysMet: 0.345 ± 0.292
0.345CysAsn: 0.345 ± 0.36
0.345CysPro: 0.345 ± 1.05
0.689CysGln: 0.689 ± 0.209
1.034CysArg: 1.034 ± 0.54
1.724CysSer: 1.724 ± 0.9
0.0CysThr: 0.0 ± 0.0
0.345CysVal: 0.345 ± 0.18
0.345CysTrp: 0.345 ± 0.18
1.379CysTyr: 1.379 ± 0.919
0.345CysXaa: 0.345 ± 0.18
Asp
3.102AspAla: 3.102 ± 0.453
0.689AspCys: 0.689 ± 0.36
4.137AspAsp: 4.137 ± 1.255
5.171AspGlu: 5.171 ± 1.709
4.137AspPhe: 4.137 ± 0.603
2.068AspGly: 2.068 ± 0.302
0.345AspHis: 0.345 ± 0.36
1.724AspIle: 1.724 ± 0.9
5.171AspLys: 5.171 ± 0.89
7.928AspLeu: 7.928 ± 1.404
1.724AspMet: 1.724 ± 1.47
3.792AspAsn: 3.792 ± 1.886
1.724AspPro: 1.724 ± 0.766
2.413AspGln: 2.413 ± 1.308
1.379AspArg: 1.379 ± 0.258
4.481AspSer: 4.481 ± 0.927
3.447AspThr: 3.447 ± 0.635
6.549AspVal: 6.549 ± 1.147
0.689AspTrp: 0.689 ± 0.209
1.034AspTyr: 1.034 ± 0.561
0.0AspXaa: 0.0 ± 0.0
Glu
4.826GluAla: 4.826 ± 1.532
0.345GluCys: 0.345 ± 0.18
2.758GluAsp: 2.758 ± 0.516
4.481GluGlu: 4.481 ± 1.356
3.447GluPhe: 3.447 ± 1.8
2.068GluGly: 2.068 ± 0.59
1.034GluHis: 1.034 ± 0.54
3.792GluIle: 3.792 ± 0.635
2.413GluLys: 2.413 ± 0.766
5.171GluLeu: 5.171 ± 1.709
3.447GluMet: 3.447 ± 0.521
5.171GluAsn: 5.171 ± 0.952
1.379GluPro: 1.379 ± 0.258
3.102GluGln: 3.102 ± 0.453
1.379GluArg: 1.379 ± 0.418
3.102GluSer: 3.102 ± 1.121
4.481GluThr: 4.481 ± 1.135
5.86GluVal: 5.86 ± 1.184
0.689GluTrp: 0.689 ± 0.36
3.102GluTyr: 3.102 ± 1.121
0.0GluXaa: 0.0 ± 0.0
Phe
1.379PheAla: 1.379 ± 0.72
0.345PheCys: 0.345 ± 0.18
2.758PheAsp: 2.758 ± 0.516
1.724PheGlu: 1.724 ± 0.9
1.724PhePhe: 1.724 ± 0.766
4.137PheGly: 4.137 ± 0.774
1.379PheHis: 1.379 ± 0.418
2.758PheIle: 2.758 ± 0.943
1.379PheLys: 1.379 ± 0.258
3.102PheLeu: 3.102 ± 0.453
1.379PheMet: 1.379 ± 1.519
2.413PheAsn: 2.413 ± 0.766
1.379PhePro: 1.379 ± 0.258
1.724PheGln: 1.724 ± 0.766
2.413PheArg: 2.413 ± 0.766
3.792PheSer: 3.792 ± 1.192
2.758PheThr: 2.758 ± 0.451
3.792PheVal: 3.792 ± 0.635
1.379PheTrp: 1.379 ± 0.72
1.034PheTyr: 1.034 ± 1.461
0.0PheXaa: 0.0 ± 0.0
Gly
1.379GlyAla: 1.379 ± 1.455
0.345GlyCys: 0.345 ± 0.18
4.826GlyAsp: 4.826 ± 1.532
3.447GlyGlu: 3.447 ± 0.837
3.102GlyPhe: 3.102 ± 1.121
1.379GlyGly: 1.379 ± 1.44
0.689GlyHis: 0.689 ± 0.36
4.137GlyIle: 4.137 ± 1.738
4.826GlyLys: 4.826 ± 1.532
2.068GlyLeu: 2.068 ± 1.08
1.724GlyMet: 1.724 ± 0.418
2.413GlyAsn: 2.413 ± 0.516
3.102GlyPro: 3.102 ± 0.72
1.724GlyGln: 1.724 ± 0.418
2.413GlyArg: 2.413 ± 0.973
2.758GlySer: 2.758 ± 1.271
3.447GlyThr: 3.447 ± 0.635
2.068GlyVal: 2.068 ± 1.121
1.379GlyTrp: 1.379 ± 0.919
2.413GlyTyr: 2.413 ± 0.973
0.0GlyXaa: 0.0 ± 0.0
His
0.345HisAla: 0.345 ± 0.18
0.345HisCys: 0.345 ± 0.18
0.345HisAsp: 0.345 ± 0.36
1.034HisGlu: 1.034 ± 0.151
1.379HisPhe: 1.379 ± 0.72
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.724HisIle: 1.724 ± 0.317
1.379HisLys: 1.379 ± 0.72
1.379HisLeu: 1.379 ± 0.72
1.379HisMet: 1.379 ± 0.72
0.345HisAsn: 0.345 ± 0.36
0.345HisPro: 0.345 ± 0.18
0.345HisGln: 0.345 ± 0.18
0.345HisArg: 0.345 ± 1.539
1.034HisSer: 1.034 ± 0.151
0.689HisThr: 0.689 ± 0.209
1.724HisVal: 1.724 ± 0.418
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.792IleAla: 3.792 ± 0.635
1.379IleCys: 1.379 ± 0.72
4.826IleAsp: 4.826 ± 0.887
4.826IleGlu: 4.826 ± 1.089
2.068IlePhe: 2.068 ± 0.302
3.447IleGly: 3.447 ± 1.596
1.034IleHis: 1.034 ± 0.54
4.481IleIle: 4.481 ± 1.323
4.481IleLys: 4.481 ± 1.323
5.515IleLeu: 5.515 ± 1.213
2.413IleMet: 2.413 ± 0.382
5.515IleAsn: 5.515 ± 1.032
4.137IlePro: 4.137 ± 2.125
1.724IleGln: 1.724 ± 0.418
5.171IleArg: 5.171 ± 2.423
5.515IleSer: 5.515 ± 0.811
7.239IleThr: 7.239 ± 1.01
5.86IleVal: 5.86 ± 0.885
0.689IleTrp: 0.689 ± 0.36
1.034IleTyr: 1.034 ± 0.561
0.0IleXaa: 0.0 ± 0.0
Lys
2.068LysAla: 2.068 ± 0.758
1.034LysCys: 1.034 ± 0.54
3.447LysAsp: 3.447 ± 1.3
3.447LysGlu: 3.447 ± 0.837
3.792LysPhe: 3.792 ± 1.007
4.137LysGly: 4.137 ± 0.603
1.724LysHis: 1.724 ± 0.418
7.584LysIle: 7.584 ± 1.598
4.826LysLys: 4.826 ± 2.835
5.515LysLeu: 5.515 ± 1.213
2.758LysMet: 2.758 ± 0.417
4.481LysAsn: 4.481 ± 0.665
3.102LysPro: 3.102 ± 1.3
2.068LysGln: 2.068 ± 1.312
3.447LysArg: 3.447 ± 1.832
3.792LysSer: 3.792 ± 1.038
6.549LysThr: 6.549 ± 2.319
7.239LysVal: 7.239 ± 1.843
1.724LysTrp: 1.724 ± 0.317
2.758LysTyr: 2.758 ± 0.516
0.0LysXaa: 0.0 ± 0.0
Leu
4.826LeuAla: 4.826 ± 1.053
2.413LeuCys: 2.413 ± 0.766
6.205LeuAsp: 6.205 ± 1.009
3.792LeuGlu: 3.792 ± 1.479
2.413LeuPhe: 2.413 ± 1.308
3.447LeuGly: 3.447 ± 1.163
1.034LeuHis: 1.034 ± 1.461
6.549LeuIle: 6.549 ± 1.946
7.584LeuLys: 7.584 ± 4.164
5.171LeuLeu: 5.171 ± 1.124
1.379LeuMet: 1.379 ± 0.31
7.928LeuAsn: 7.928 ± 1.76
2.758LeuPro: 2.758 ± 1.261
1.724LeuGln: 1.724 ± 1.339
4.481LeuArg: 4.481 ± 1.14
5.515LeuSer: 5.515 ± 1.236
3.792LeuThr: 3.792 ± 1.038
4.137LeuVal: 4.137 ± 1.181
0.0LeuTrp: 0.0 ± 0.0
2.413LeuTyr: 2.413 ± 0.973
0.0LeuXaa: 0.0 ± 0.0
Met
2.068MetAla: 2.068 ± 1.121
1.724MetCys: 1.724 ± 1.409
2.758MetAsp: 2.758 ± 0.516
2.413MetGlu: 2.413 ± 0.766
0.689MetPhe: 0.689 ± 0.72
0.689MetGly: 0.689 ± 0.36
0.345MetHis: 0.345 ± 0.18
2.068MetIle: 2.068 ± 2.923
2.758MetLys: 2.758 ± 0.943
1.034MetLeu: 1.034 ± 0.54
0.689MetMet: 0.689 ± 1.49
1.379MetAsn: 1.379 ± 0.258
1.379MetPro: 1.379 ± 0.258
2.758MetGln: 2.758 ± 0.516
1.724MetArg: 1.724 ± 0.9
0.689MetSer: 0.689 ± 0.209
2.413MetThr: 2.413 ± 0.382
1.379MetVal: 1.379 ± 0.258
1.379MetTrp: 1.379 ± 1.39
1.379MetTyr: 1.379 ± 0.258
0.0MetXaa: 0.0 ± 0.0
Asn
2.413AsnAla: 2.413 ± 1.308
1.379AsnCys: 1.379 ± 0.72
2.413AsnAsp: 2.413 ± 0.973
2.413AsnGlu: 2.413 ± 0.766
2.068AsnPhe: 2.068 ± 0.59
3.792AsnGly: 3.792 ± 1.007
0.689AsnHis: 0.689 ± 0.36
5.171AsnIle: 5.171 ± 1.255
4.481AsnLys: 4.481 ± 1.507
3.447AsnLeu: 3.447 ± 1.046
2.413AsnMet: 2.413 ± 0.516
3.792AsnAsn: 3.792 ± 1.886
4.137AsnPro: 4.137 ± 1.285
1.724AsnGln: 1.724 ± 0.418
3.447AsnArg: 3.447 ± 0.521
5.171AsnSer: 5.171 ± 1.181
4.826AsnThr: 4.826 ± 3.474
6.549AsnVal: 6.549 ± 1.042
1.724AsnTrp: 1.724 ± 0.317
2.413AsnTyr: 2.413 ± 0.766
0.0AsnXaa: 0.0 ± 0.0
Pro
2.413ProAla: 2.413 ± 0.516
1.034ProCys: 1.034 ± 0.561
1.034ProAsp: 1.034 ± 0.151
2.413ProGlu: 2.413 ± 0.382
1.379ProPhe: 1.379 ± 1.801
2.413ProGly: 2.413 ± 0.516
0.345ProHis: 0.345 ± 0.18
4.481ProIle: 4.481 ± 1.14
3.102ProLys: 3.102 ± 0.72
3.792ProLeu: 3.792 ± 1.038
0.689ProMet: 0.689 ± 0.209
3.102ProAsn: 3.102 ± 0.72
1.724ProPro: 1.724 ± 3.007
1.034ProGln: 1.034 ± 0.561
1.034ProArg: 1.034 ± 0.561
2.758ProSer: 2.758 ± 1.543
2.413ProThr: 2.413 ± 1.388
4.481ProVal: 4.481 ± 2.6
1.379ProTrp: 1.379 ± 1.455
2.758ProTyr: 2.758 ± 1.271
0.0ProXaa: 0.0 ± 0.0
Gln
1.379GlnAla: 1.379 ± 0.418
0.689GlnCys: 0.689 ± 0.36
2.413GlnAsp: 2.413 ± 0.766
2.758GlnGlu: 2.758 ± 0.516
0.689GlnPhe: 0.689 ± 0.209
2.413GlnGly: 2.413 ± 0.766
1.034GlnHis: 1.034 ± 0.54
3.792GlnIle: 3.792 ± 2.906
1.724GlnLys: 1.724 ± 0.418
2.758GlnLeu: 2.758 ± 1.271
1.034GlnMet: 1.034 ± 1.461
1.724GlnAsn: 1.724 ± 0.766
2.068GlnPro: 2.068 ± 0.302
1.724GlnGln: 1.724 ± 2.946
1.379GlnArg: 1.379 ± 0.258
0.689GlnSer: 0.689 ± 0.209
2.068GlnThr: 2.068 ± 0.628
1.724GlnVal: 1.724 ± 0.317
0.345GlnTrp: 0.345 ± 0.18
1.034GlnTyr: 1.034 ± 0.151
0.345GlnXaa: 0.345 ± 0.18
Arg
2.068ArgAla: 2.068 ± 0.628
0.689ArgCys: 0.689 ± 0.209
3.102ArgAsp: 3.102 ± 0.453
1.724ArgGlu: 1.724 ± 0.9
1.034ArgPhe: 1.034 ± 0.151
1.724ArgGly: 1.724 ± 0.418
0.689ArgHis: 0.689 ± 0.36
2.758ArgIle: 2.758 ± 0.451
5.171ArgLys: 5.171 ± 1.255
2.413ArgLeu: 2.413 ± 1.24
0.689ArgMet: 0.689 ± 0.36
4.137ArgAsn: 4.137 ± 2.623
2.068ArgPro: 2.068 ± 1.315
0.689ArgGln: 0.689 ± 0.209
2.068ArgArg: 2.068 ± 1.312
2.758ArgSer: 2.758 ± 1.543
4.137ArgThr: 4.137 ± 1.285
3.102ArgVal: 3.102 ± 0.453
0.0ArgTrp: 0.0 ± 0.0
2.413ArgTyr: 2.413 ± 0.516
0.0ArgXaa: 0.0 ± 0.0
Ser
4.137SerAla: 4.137 ± 0.831
0.0SerCys: 0.0 ± 0.0
3.792SerAsp: 3.792 ± 1.007
4.481SerGlu: 4.481 ± 1.135
2.413SerPhe: 2.413 ± 1.388
4.137SerGly: 4.137 ± 0.774
0.345SerHis: 0.345 ± 0.36
4.826SerIle: 4.826 ± 2.579
7.239SerLys: 7.239 ± 1.843
6.549SerLeu: 6.549 ± 2.319
2.758SerMet: 2.758 ± 0.516
4.481SerAsn: 4.481 ± 1.135
2.068SerPro: 2.068 ± 1.517
2.413SerGln: 2.413 ± 0.766
2.068SerArg: 2.068 ± 1.312
4.137SerSer: 4.137 ± 0.774
4.481SerThr: 4.481 ± 1.597
3.792SerVal: 3.792 ± 1.389
1.034SerTrp: 1.034 ± 0.561
1.724SerTyr: 1.724 ± 0.766
0.0SerXaa: 0.0 ± 0.0
Thr
2.413ThrAla: 2.413 ± 1.479
0.345ThrCys: 0.345 ± 0.36
2.758ThrAsp: 2.758 ± 0.837
3.447ThrGlu: 3.447 ± 2.039
3.102ThrPhe: 3.102 ± 0.72
3.792ThrGly: 3.792 ± 1.389
1.379ThrHis: 1.379 ± 0.258
4.826ThrIle: 4.826 ± 1.464
4.826ThrLys: 4.826 ± 0.738
4.826ThrLeu: 4.826 ± 2.835
2.413ThrMet: 2.413 ± 0.382
5.171ThrAsn: 5.171 ± 0.889
3.102ThrPro: 3.102 ± 1.181
2.758ThrGln: 2.758 ± 1.543
2.068ThrArg: 2.068 ± 0.628
5.515ThrSer: 5.515 ± 2.153
4.481ThrThr: 4.481 ± 2.922
3.447ThrVal: 3.447 ± 1.092
0.345ThrTrp: 0.345 ± 0.18
4.137ThrTyr: 4.137 ± 1.013
0.0ThrXaa: 0.0 ± 0.0
Val
3.447ValAla: 3.447 ± 2.039
2.758ValCys: 2.758 ± 0.516
5.515ValAsp: 5.515 ± 1.425
6.549ValGlu: 6.549 ± 1.147
3.792ValPhe: 3.792 ± 1.007
2.413ValGly: 2.413 ± 0.516
0.689ValHis: 0.689 ± 0.72
5.171ValIle: 5.171 ± 0.89
6.205ValLys: 6.205 ± 1.441
7.239ValLeu: 7.239 ± 0.571
0.689ValMet: 0.689 ± 0.36
3.792ValAsn: 3.792 ± 0.635
3.792ValPro: 3.792 ± 1.389
2.413ValGln: 2.413 ± 0.973
1.724ValArg: 1.724 ± 0.418
5.86ValSer: 5.86 ± 1.597
3.102ValThr: 3.102 ± 1.682
5.515ValVal: 5.515 ± 1.236
1.034ValTrp: 1.034 ± 0.151
3.102ValTyr: 3.102 ± 1.121
0.0ValXaa: 0.0 ± 0.0
Trp
0.345TrpAla: 0.345 ± 0.36
0.689TrpCys: 0.689 ± 0.36
0.689TrpAsp: 0.689 ± 0.209
0.689TrpGlu: 0.689 ± 0.36
0.689TrpPhe: 0.689 ± 0.36
0.689TrpGly: 0.689 ± 0.72
0.0TrpHis: 0.0 ± 0.0
0.345TrpIle: 0.345 ± 0.18
2.413TrpLys: 2.413 ± 1.308
2.068TrpLeu: 2.068 ± 1.315
1.034TrpMet: 1.034 ± 1.461
1.034TrpAsn: 1.034 ± 0.561
0.689TrpPro: 0.689 ± 0.36
0.0TrpGln: 0.0 ± 0.0
1.034TrpArg: 1.034 ± 0.151
1.034TrpSer: 1.034 ± 0.151
0.689TrpThr: 0.689 ± 0.209
0.689TrpVal: 0.689 ± 0.209
0.0TrpTrp: 0.0 ± 0.0
0.345TrpTyr: 0.345 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.379TyrAla: 1.379 ± 0.258
0.345TyrCys: 0.345 ± 0.36
3.102TyrAsp: 3.102 ± 0.72
2.413TyrGlu: 2.413 ± 1.26
0.0TyrPhe: 0.0 ± 0.0
1.379TyrGly: 1.379 ± 0.258
0.689TyrHis: 0.689 ± 0.36
3.447TyrIle: 3.447 ± 1.092
3.447TyrLys: 3.447 ± 0.521
3.792TyrLeu: 3.792 ± 0.592
1.379TyrMet: 1.379 ± 0.418
1.034TyrAsn: 1.034 ± 1.461
1.379TyrPro: 1.379 ± 0.418
1.379TyrGln: 1.379 ± 0.258
2.413TyrArg: 2.413 ± 1.479
3.102TyrSer: 3.102 ± 1.397
1.724TyrThr: 1.724 ± 0.418
4.137TyrVal: 4.137 ± 0.603
0.345TyrTrp: 0.345 ± 0.36
2.413TyrTyr: 2.413 ± 0.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.345XaaAsp: 0.345 ± 0.18
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.345XaaIle: 0.345 ± 0.18
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski