Amino acid dipepetide frequency for Ageratum yellow vein virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.287AlaAla: 8.287 ± 3.449
1.842AlaCys: 1.842 ± 1.061
0.921AlaAsp: 0.921 ± 0.741
1.842AlaGlu: 1.842 ± 1.343
0.921AlaPhe: 0.921 ± 1.052
0.921AlaGly: 0.921 ± 0.671
0.921AlaHis: 0.921 ± 0.838
2.762AlaIle: 2.762 ± 1.314
3.683AlaLys: 3.683 ± 0.987
6.446AlaLeu: 6.446 ± 1.997
2.762AlaMet: 2.762 ± 1.101
1.842AlaAsn: 1.842 ± 0.983
2.762AlaPro: 2.762 ± 0.854
4.604AlaGln: 4.604 ± 2.645
3.683AlaArg: 3.683 ± 2.017
4.604AlaSer: 4.604 ± 1.934
2.762AlaThr: 2.762 ± 1.584
0.921AlaVal: 0.921 ± 0.838
0.921AlaTrp: 0.921 ± 0.671
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.921CysAla: 0.921 ± 1.052
1.842CysCys: 1.842 ± 1.676
0.921CysAsp: 0.921 ± 0.918
1.842CysGlu: 1.842 ± 1.061
0.921CysPhe: 0.921 ± 0.918
1.842CysGly: 1.842 ± 0.983
0.921CysHis: 0.921 ± 0.838
0.921CysIle: 0.921 ± 0.998
0.921CysLys: 0.921 ± 0.741
0.0CysLeu: 0.0 ± 0.0
1.842CysMet: 1.842 ± 1.222
1.842CysAsn: 1.842 ± 0.983
4.604CysPro: 4.604 ± 2.342
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.921CysSer: 0.921 ± 1.052
1.842CysThr: 1.842 ± 1.177
1.842CysVal: 1.842 ± 1.483
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.842AspAla: 1.842 ± 1.343
0.921AspCys: 0.921 ± 1.052
0.921AspAsp: 0.921 ± 0.671
2.762AspGlu: 2.762 ± 0.873
1.842AspPhe: 1.842 ± 1.061
2.762AspGly: 2.762 ± 2.014
0.921AspHis: 0.921 ± 0.918
3.683AspIle: 3.683 ± 1.235
0.921AspLys: 0.921 ± 0.671
8.287AspLeu: 8.287 ± 4.013
0.921AspMet: 0.921 ± 0.838
3.683AspAsn: 3.683 ± 2.354
1.842AspPro: 1.842 ± 0.961
2.762AspGln: 2.762 ± 1.157
2.762AspArg: 2.762 ± 1.272
1.842AspSer: 1.842 ± 1.177
2.762AspThr: 2.762 ± 1.383
4.604AspVal: 4.604 ± 1.202
1.842AspTrp: 1.842 ± 1.343
0.921AspTyr: 0.921 ± 0.671
0.0AspXaa: 0.0 ± 0.0
Glu
6.446GluAla: 6.446 ± 1.912
0.0GluCys: 0.0 ± 0.0
0.921GluAsp: 0.921 ± 0.998
7.366GluGlu: 7.366 ± 3.534
2.762GluPhe: 2.762 ± 1.43
2.762GluGly: 2.762 ± 0.873
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
3.683GluLys: 3.683 ± 2.685
4.604GluLeu: 4.604 ± 1.449
0.0GluMet: 0.0 ± 0.0
3.683GluAsn: 3.683 ± 1.235
1.842GluPro: 1.842 ± 1.028
2.762GluGln: 2.762 ± 1.585
0.921GluArg: 0.921 ± 0.918
2.762GluSer: 2.762 ± 1.184
0.921GluThr: 0.921 ± 0.838
1.842GluVal: 1.842 ± 1.036
2.762GluTrp: 2.762 ± 1.314
2.762GluTyr: 2.762 ± 1.357
0.0GluXaa: 0.0 ± 0.0
Phe
0.921PheAla: 0.921 ± 0.671
0.921PheCys: 0.921 ± 0.741
2.762PheAsp: 2.762 ± 1.15
0.0PheGlu: 0.0 ± 0.0
0.921PhePhe: 0.921 ± 0.741
1.842PheGly: 1.842 ± 1.177
2.762PheHis: 2.762 ± 1.43
0.921PheIle: 0.921 ± 0.671
2.762PheLys: 2.762 ± 1.357
6.446PheLeu: 6.446 ± 1.591
1.842PheMet: 1.842 ± 0.697
3.683PheAsn: 3.683 ± 2.709
0.921PhePro: 0.921 ± 0.838
5.525PheGln: 5.525 ± 2.275
4.604PheArg: 4.604 ± 1.94
2.762PheSer: 2.762 ± 2.106
0.0PheThr: 0.0 ± 0.0
1.842PheVal: 1.842 ± 1.343
0.0PheTrp: 0.0 ± 0.0
0.921PheTyr: 0.921 ± 0.741
0.0PheXaa: 0.0 ± 0.0
Gly
2.762GlyAla: 2.762 ± 2.014
1.842GlyCys: 1.842 ± 1.177
0.921GlyAsp: 0.921 ± 0.671
1.842GlyGlu: 1.842 ± 0.944
1.842GlyPhe: 1.842 ± 1.308
2.762GlyGly: 2.762 ± 1.15
1.842GlyHis: 1.842 ± 0.961
4.604GlyIle: 4.604 ± 2.143
5.525GlyLys: 5.525 ± 2.09
1.842GlyLeu: 1.842 ± 1.233
0.0GlyMet: 0.0 ± 0.0
1.842GlyAsn: 1.842 ± 1.177
2.762GlyPro: 2.762 ± 1.15
2.762GlyGln: 2.762 ± 1.272
0.921GlyArg: 0.921 ± 0.671
0.921GlySer: 0.921 ± 0.671
3.683GlyThr: 3.683 ± 1.17
3.683GlyVal: 3.683 ± 1.873
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.842HisAla: 1.842 ± 0.697
2.762HisCys: 2.762 ± 2.221
2.762HisAsp: 2.762 ± 1.258
0.921HisGlu: 0.921 ± 0.918
2.762HisPhe: 2.762 ± 1.43
1.842HisGly: 1.842 ± 1.308
1.842HisHis: 1.842 ± 1.395
0.921HisIle: 0.921 ± 0.918
0.0HisLys: 0.0 ± 0.0
1.842HisLeu: 1.842 ± 1.343
0.0HisMet: 0.0 ± 0.0
4.604HisAsn: 4.604 ± 1.378
2.762HisPro: 2.762 ± 1.43
3.683HisGln: 3.683 ± 1.521
2.762HisArg: 2.762 ± 2.106
1.842HisSer: 1.842 ± 0.961
2.762HisThr: 2.762 ± 1.773
2.762HisVal: 2.762 ± 1.258
0.0HisTrp: 0.0 ± 0.0
0.921HisTyr: 0.921 ± 0.671
0.0HisXaa: 0.0 ± 0.0
Ile
0.921IleAla: 0.921 ± 1.052
0.0IleCys: 0.0 ± 0.0
2.762IleAsp: 2.762 ± 1.314
1.842IleGlu: 1.842 ± 1.343
5.525IlePhe: 5.525 ± 2.275
0.921IleGly: 0.921 ± 0.838
0.921IleHis: 0.921 ± 0.918
3.683IleIle: 3.683 ± 2.613
8.287IleLys: 8.287 ± 2.074
0.0IleLeu: 0.0 ± 0.0
0.921IleMet: 0.921 ± 0.892
1.842IleAsn: 1.842 ± 1.061
1.842IlePro: 1.842 ± 1.343
5.525IleGln: 5.525 ± 1.668
6.446IleArg: 6.446 ± 1.961
4.604IleSer: 4.604 ± 1.88
3.683IleThr: 3.683 ± 1.383
2.762IleVal: 2.762 ± 1.04
1.842IleTrp: 1.842 ± 1.061
1.842IleTyr: 1.842 ± 1.177
0.0IleXaa: 0.0 ± 0.0
Lys
2.762LysAla: 2.762 ± 1.357
0.921LysCys: 0.921 ± 0.671
1.842LysAsp: 1.842 ± 0.961
6.446LysGlu: 6.446 ± 3.699
2.762LysPhe: 2.762 ± 1.584
2.762LysGly: 2.762 ± 1.15
2.762LysHis: 2.762 ± 1.357
3.683LysIle: 3.683 ± 2.234
1.842LysLys: 1.842 ± 1.028
1.842LysLeu: 1.842 ± 1.343
0.0LysMet: 0.0 ± 0.0
6.446LysAsn: 6.446 ± 1.997
2.762LysPro: 2.762 ± 1.04
0.0LysGln: 0.0 ± 0.0
3.683LysArg: 3.683 ± 1.914
5.525LysSer: 5.525 ± 1.842
1.842LysThr: 1.842 ± 0.697
5.525LysVal: 5.525 ± 2.014
0.0LysTrp: 0.0 ± 0.0
3.683LysTyr: 3.683 ± 0.941
0.0LysXaa: 0.0 ± 0.0
Leu
1.842LeuAla: 1.842 ± 1.308
3.683LeuCys: 3.683 ± 1.243
4.604LeuAsp: 4.604 ± 2.221
5.525LeuGlu: 5.525 ± 1.83
0.921LeuPhe: 0.921 ± 0.671
3.683LeuGly: 3.683 ± 1.587
2.762LeuHis: 2.762 ± 1.433
4.604LeuIle: 4.604 ± 1.904
3.683LeuLys: 3.683 ± 0.987
4.604LeuLeu: 4.604 ± 1.88
0.0LeuMet: 0.0 ± 0.0
5.525LeuAsn: 5.525 ± 1.482
1.842LeuPro: 1.842 ± 0.983
2.762LeuGln: 2.762 ± 1.519
4.604LeuArg: 4.604 ± 2.171
2.762LeuSer: 2.762 ± 1.433
5.525LeuThr: 5.525 ± 1.948
3.683LeuVal: 3.683 ± 2.251
0.921LeuTrp: 0.921 ± 0.918
3.683LeuTyr: 3.683 ± 1.235
0.0LeuXaa: 0.0 ± 0.0
Met
1.842MetAla: 1.842 ± 0.697
0.0MetCys: 0.0 ± 0.0
3.683MetAsp: 3.683 ± 1.664
0.0MetGlu: 0.0 ± 0.0
0.921MetPhe: 0.921 ± 0.741
1.842MetGly: 1.842 ± 1.036
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.842MetLys: 1.842 ± 1.061
0.921MetLeu: 0.921 ± 0.838
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.921MetPro: 0.921 ± 0.671
0.921MetGln: 0.921 ± 0.998
1.842MetArg: 1.842 ± 2.104
2.762MetSer: 2.762 ± 1.773
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.842MetTrp: 1.842 ± 0.961
2.762MetTyr: 2.762 ± 2.224
0.0MetXaa: 0.0 ± 0.0
Asn
2.762AsnAla: 2.762 ± 1.15
0.0AsnCys: 0.0 ± 0.0
3.683AsnAsp: 3.683 ± 0.987
1.842AsnGlu: 1.842 ± 1.028
1.842AsnPhe: 1.842 ± 0.697
0.921AsnGly: 0.921 ± 0.741
5.525AsnHis: 5.525 ± 2.646
2.762AsnIle: 2.762 ± 1.15
0.0AsnLys: 0.0 ± 0.0
4.604AsnLeu: 4.604 ± 2.641
1.842AsnMet: 1.842 ± 1.35
3.683AsnAsn: 3.683 ± 1.587
4.604AsnPro: 4.604 ± 1.044
2.762AsnGln: 2.762 ± 1.922
2.762AsnArg: 2.762 ± 1.773
8.287AsnSer: 8.287 ± 3.114
7.366AsnThr: 7.366 ± 1.948
3.683AsnVal: 3.683 ± 1.425
0.0AsnTrp: 0.0 ± 0.0
2.762AsnTyr: 2.762 ± 0.873
0.0AsnXaa: 0.0 ± 0.0
Pro
1.842ProAla: 1.842 ± 1.483
1.842ProCys: 1.842 ± 1.028
2.762ProAsp: 2.762 ± 1.339
2.762ProGlu: 2.762 ± 1.157
1.842ProPhe: 1.842 ± 0.944
0.921ProGly: 0.921 ± 0.671
3.683ProHis: 3.683 ± 1.476
3.683ProIle: 3.683 ± 1.965
5.525ProLys: 5.525 ± 3.04
2.762ProLeu: 2.762 ± 1.43
2.762ProMet: 2.762 ± 1.383
2.762ProAsn: 2.762 ± 1.433
2.762ProPro: 2.762 ± 1.203
1.842ProGln: 1.842 ± 1.222
5.525ProArg: 5.525 ± 2.179
3.683ProSer: 3.683 ± 0.953
4.604ProThr: 4.604 ± 1.939
4.604ProVal: 4.604 ± 2.53
0.921ProTrp: 0.921 ± 0.671
1.842ProTyr: 1.842 ± 1.061
0.0ProXaa: 0.0 ± 0.0
Gln
4.604GlnAla: 4.604 ± 1.578
3.683GlnCys: 3.683 ± 2.613
0.921GlnAsp: 0.921 ± 1.052
1.842GlnGlu: 1.842 ± 0.697
4.604GlnPhe: 4.604 ± 2.539
0.921GlnGly: 0.921 ± 0.671
1.842GlnHis: 1.842 ± 2.104
2.762GlnIle: 2.762 ± 1.357
2.762GlnLys: 2.762 ± 1.596
1.842GlnLeu: 1.842 ± 1.258
0.0GlnMet: 0.0 ± 0.0
4.604GlnAsn: 4.604 ± 1.578
2.762GlnPro: 2.762 ± 2.293
2.762GlnGln: 2.762 ± 1.673
1.842GlnArg: 1.842 ± 0.697
3.683GlnSer: 3.683 ± 0.987
4.604GlnThr: 4.604 ± 1.861
4.604GlnVal: 4.604 ± 1.149
0.0GlnTrp: 0.0 ± 0.0
0.921GlnTyr: 0.921 ± 0.741
0.0GlnXaa: 0.0 ± 0.0
Arg
2.762ArgAla: 2.762 ± 1.258
0.921ArgCys: 0.921 ± 0.838
4.604ArgAsp: 4.604 ± 1.786
3.683ArgGlu: 3.683 ± 2.057
1.842ArgPhe: 1.842 ± 1.061
2.762ArgGly: 2.762 ± 0.854
4.604ArgHis: 4.604 ± 1.578
4.604ArgIle: 4.604 ± 1.83
2.762ArgLys: 2.762 ± 1.584
4.604ArgLeu: 4.604 ± 1.88
1.842ArgMet: 1.842 ± 1.2
2.762ArgAsn: 2.762 ± 1.157
6.446ArgPro: 6.446 ± 1.414
0.0ArgGln: 0.0 ± 0.0
11.05ArgArg: 11.05 ± 5.34
4.604ArgSer: 4.604 ± 2.408
4.604ArgThr: 4.604 ± 1.425
4.604ArgVal: 4.604 ± 1.659
0.0ArgTrp: 0.0 ± 0.0
1.842ArgTyr: 1.842 ± 1.028
0.0ArgXaa: 0.0 ± 0.0
Ser
2.762SerAla: 2.762 ± 2.014
0.0SerCys: 0.0 ± 0.0
5.525SerAsp: 5.525 ± 1.842
2.762SerGlu: 2.762 ± 1.43
1.842SerPhe: 1.842 ± 1.343
3.683SerGly: 3.683 ± 1.423
2.762SerHis: 2.762 ± 1.399
3.683SerIle: 3.683 ± 1.383
4.604SerLys: 4.604 ± 2.097
1.842SerLeu: 1.842 ± 0.983
0.921SerMet: 0.921 ± 1.052
2.762SerAsn: 2.762 ± 1.15
6.446SerPro: 6.446 ± 2.872
1.842SerGln: 1.842 ± 0.983
6.446SerArg: 6.446 ± 2.191
10.129SerSer: 10.129 ± 4.395
8.287SerThr: 8.287 ± 4.185
3.683SerVal: 3.683 ± 1.726
0.0SerTrp: 0.0 ± 0.0
4.604SerTyr: 4.604 ± 1.074
0.0SerXaa: 0.0 ± 0.0
Thr
3.683ThrAla: 3.683 ± 1.235
0.921ThrCys: 0.921 ± 1.052
0.0ThrAsp: 0.0 ± 0.0
2.762ThrGlu: 2.762 ± 1.425
0.0ThrPhe: 0.0 ± 0.0
3.683ThrGly: 3.683 ± 1.091
4.604ThrHis: 4.604 ± 1.944
4.604ThrIle: 4.604 ± 1.641
1.842ThrLys: 1.842 ± 0.697
3.683ThrLeu: 3.683 ± 1.243
1.842ThrMet: 1.842 ± 1.036
6.446ThrAsn: 6.446 ± 1.954
5.525ThrPro: 5.525 ± 2.997
2.762ThrGln: 2.762 ± 1.184
2.762ThrArg: 2.762 ± 1.425
7.366ThrSer: 7.366 ± 3.766
1.842ThrThr: 1.842 ± 2.104
6.446ThrVal: 6.446 ± 2.707
2.762ThrTrp: 2.762 ± 1.841
1.842ThrTyr: 1.842 ± 0.961
0.0ThrXaa: 0.0 ± 0.0
Val
0.921ValAla: 0.921 ± 0.671
0.921ValCys: 0.921 ± 0.671
3.683ValAsp: 3.683 ± 1.39
1.842ValGlu: 1.842 ± 1.676
4.604ValPhe: 4.604 ± 0.86
2.762ValGly: 2.762 ± 1.662
0.921ValHis: 0.921 ± 0.838
6.446ValIle: 6.446 ± 1.572
5.525ValLys: 5.525 ± 1.37
5.525ValLeu: 5.525 ± 3.168
1.842ValMet: 1.842 ± 1.483
0.921ValAsn: 0.921 ± 0.998
4.604ValPro: 4.604 ± 1.16
6.446ValGln: 6.446 ± 2.715
2.762ValArg: 2.762 ± 1.662
2.762ValSer: 2.762 ± 1.15
4.604ValThr: 4.604 ± 1.869
0.921ValVal: 0.921 ± 0.741
0.0ValTrp: 0.0 ± 0.0
5.525ValTyr: 5.525 ± 1.993
0.0ValXaa: 0.0 ± 0.0
Trp
1.842TrpAla: 1.842 ± 1.343
0.0TrpCys: 0.0 ± 0.0
1.842TrpAsp: 1.842 ± 1.222
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.921TrpGly: 0.921 ± 0.671
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.921TrpMet: 0.921 ± 0.741
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.921TrpGln: 0.921 ± 0.671
2.762TrpArg: 2.762 ± 1.184
0.0TrpSer: 0.0 ± 0.0
2.762TrpThr: 2.762 ± 1.841
0.921TrpVal: 0.921 ± 0.671
0.0TrpTrp: 0.0 ± 0.0
0.921TrpTyr: 0.921 ± 0.671
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.842TyrAla: 1.842 ± 1.483
0.921TyrCys: 0.921 ± 0.838
2.762TyrAsp: 2.762 ± 2.224
0.921TyrGlu: 0.921 ± 0.741
3.683TyrPhe: 3.683 ± 1.678
1.842TyrGly: 1.842 ± 0.697
0.0TyrHis: 0.0 ± 0.0
1.842TyrIle: 1.842 ± 1.343
0.921TyrLys: 0.921 ± 0.671
5.525TyrLeu: 5.525 ± 1.697
0.921TyrMet: 0.921 ± 1.005
2.762TyrAsn: 2.762 ± 1.04
0.921TyrPro: 0.921 ± 0.671
0.921TyrGln: 0.921 ± 0.741
2.762TyrArg: 2.762 ± 2.224
2.762TyrSer: 2.762 ± 1.43
0.921TyrThr: 0.921 ± 0.918
4.604TyrVal: 4.604 ± 1.248
0.0TyrTrp: 0.0 ± 0.0
0.921TyrTyr: 0.921 ± 1.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1087 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski