Amino acid dipepetide frequency for Persea americana chrysovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.361AlaAla: 3.361 ± 0.681
1.008AlaCys: 1.008 ± 0.065
2.689AlaAsp: 2.689 ± 0.556
3.025AlaGlu: 3.025 ± 1.022
2.689AlaPhe: 2.689 ± 0.94
4.706AlaGly: 4.706 ± 0.909
0.672AlaHis: 0.672 ± 0.283
3.025AlaIle: 3.025 ± 0.337
5.042AlaLys: 5.042 ± 0.323
2.689AlaLeu: 2.689 ± 0.466
3.361AlaMet: 3.361 ± 1.03
2.017AlaAsn: 2.017 ± 0.603
1.008AlaPro: 1.008 ± 0.803
1.345AlaGln: 1.345 ± 0.452
4.706AlaArg: 4.706 ± 1.206
2.353AlaSer: 2.353 ± 0.267
2.353AlaThr: 2.353 ± 0.874
2.689AlaVal: 2.689 ± 0.812
0.336AlaTrp: 0.336 ± 0.262
2.353AlaTyr: 2.353 ± 0.692
0.0AlaXaa: 0.0 ± 0.0
Cys
1.008CysAla: 1.008 ± 0.498
0.672CysCys: 0.672 ± 0.226
2.353CysAsp: 2.353 ± 0.874
4.034CysGlu: 4.034 ± 1.552
0.336CysPhe: 0.336 ± 0.262
1.345CysGly: 1.345 ± 0.233
0.0CysHis: 0.0 ± 0.0
0.672CysIle: 0.672 ± 0.535
0.672CysLys: 0.672 ± 0.535
1.008CysLeu: 1.008 ± 0.803
0.0CysMet: 0.0 ± 0.0
0.336CysAsn: 0.336 ± 0.262
0.336CysPro: 0.336 ± 0.268
0.336CysGln: 0.336 ± 0.262
1.008CysArg: 1.008 ± 0.468
1.345CysSer: 1.345 ± 0.624
1.008CysThr: 1.008 ± 0.505
2.017CysVal: 2.017 ± 0.603
0.336CysTrp: 0.336 ± 0.289
0.336CysTyr: 0.336 ± 0.268
0.0CysXaa: 0.0 ± 0.0
Asp
3.361AspAla: 3.361 ± 0.211
0.672AspCys: 0.672 ± 0.226
3.697AspAsp: 3.697 ± 0.054
5.378AspGlu: 5.378 ± 1.388
0.672AspPhe: 0.672 ± 0.535
5.042AspGly: 5.042 ± 1.311
0.0AspHis: 0.0 ± 0.0
4.706AspIle: 4.706 ± 0.905
3.361AspLys: 3.361 ± 0.767
2.017AspLeu: 2.017 ± 0.37
2.353AspMet: 2.353 ± 0.417
2.353AspAsn: 2.353 ± 0.615
1.345AspPro: 1.345 ± 0.233
0.336AspGln: 0.336 ± 0.262
4.37AspArg: 4.37 ± 0.545
4.706AspSer: 4.706 ± 0.532
0.672AspThr: 0.672 ± 0.226
6.05AspVal: 6.05 ± 0.985
1.681AspTrp: 1.681 ± 0.163
1.345AspTyr: 1.345 ± 0.81
0.0AspXaa: 0.0 ± 0.0
Glu
4.706GluAla: 4.706 ± 0.535
1.681GluCys: 1.681 ± 0.362
5.378GluAsp: 5.378 ± 0.475
12.101GluGlu: 12.101 ± 3.775
3.025GluPhe: 3.025 ± 0.877
9.412GluGly: 9.412 ± 0.383
0.672GluHis: 0.672 ± 0.226
6.723GluIle: 6.723 ± 1.454
9.748GluLys: 9.748 ± 2.326
5.714GluLeu: 5.714 ± 1.195
5.042GluMet: 5.042 ± 0.323
3.697GluAsn: 3.697 ± 0.84
1.008GluPro: 1.008 ± 0.787
2.353GluGln: 2.353 ± 0.62
11.429GluArg: 11.429 ± 5.722
5.378GluSer: 5.378 ± 0.475
5.714GluThr: 5.714 ± 0.961
10.42GluVal: 10.42 ± 2.362
4.034GluTrp: 4.034 ± 0.723
2.353GluTyr: 2.353 ± 0.213
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.672PheCys: 0.672 ± 0.226
2.353PheAsp: 2.353 ± 0.389
3.025PheGlu: 3.025 ± 0.73
0.336PhePhe: 0.336 ± 0.268
3.361PheGly: 3.361 ± 0.681
0.336PheHis: 0.336 ± 0.268
0.672PheIle: 0.672 ± 0.283
1.681PheLys: 1.681 ± 0.726
2.353PheLeu: 2.353 ± 0.82
0.672PheMet: 0.672 ± 0.312
0.672PheAsn: 0.672 ± 0.535
1.008PhePro: 1.008 ± 0.41
0.336PheGln: 0.336 ± 0.262
1.345PheArg: 1.345 ± 0.353
2.017PheSer: 2.017 ± 0.466
1.345PheThr: 1.345 ± 0.233
2.353PheVal: 2.353 ± 0.997
0.336PheTrp: 0.336 ± 0.268
0.336PheTyr: 0.336 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
4.37GlyAla: 4.37 ± 0.95
1.008GlyCys: 1.008 ± 0.541
3.025GlyAsp: 3.025 ± 0.194
7.731GlyGlu: 7.731 ± 1.208
1.681GlyPhe: 1.681 ± 0.498
4.706GlyGly: 4.706 ± 0.909
0.672GlyHis: 0.672 ± 0.283
4.37GlyIle: 4.37 ± 1.387
5.042GlyLys: 5.042 ± 1.123
8.067GlyLeu: 8.067 ± 1.525
2.689GlyMet: 2.689 ± 0.828
4.034GlyAsn: 4.034 ± 1.648
1.008GlyPro: 1.008 ± 0.498
2.017GlyGln: 2.017 ± 0.936
4.706GlyArg: 4.706 ± 1.198
7.059GlySer: 7.059 ± 1.194
3.025GlyThr: 3.025 ± 1.566
7.395GlyVal: 7.395 ± 1.264
1.681GlyTrp: 1.681 ± 0.498
2.353GlyTyr: 2.353 ± 1.171
0.0GlyXaa: 0.0 ± 0.0
His
1.345HisAla: 1.345 ± 0.233
0.672HisCys: 0.672 ± 0.283
0.336HisAsp: 0.336 ± 0.268
1.345HisGlu: 1.345 ± 0.257
0.336HisPhe: 0.336 ± 0.262
0.336HisGly: 0.336 ± 0.268
0.0HisHis: 0.0 ± 0.0
1.008HisIle: 1.008 ± 0.541
1.345HisLys: 1.345 ± 0.624
0.336HisLeu: 0.336 ± 0.262
1.008HisMet: 1.008 ± 0.42
1.008HisAsn: 1.008 ± 0.803
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.672HisSer: 0.672 ± 0.535
0.336HisThr: 0.336 ± 0.268
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.336HisTyr: 0.336 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
1.681IleAla: 1.681 ± 0.642
2.017IleCys: 2.017 ± 0.764
4.034IleAsp: 4.034 ± 1.059
5.378IleGlu: 5.378 ± 1.112
1.681IlePhe: 1.681 ± 0.607
5.042IleGly: 5.042 ± 1.621
0.336IleHis: 0.336 ± 0.262
3.361IleIle: 3.361 ± 1.457
5.714IleLys: 5.714 ± 1.625
5.042IleLeu: 5.042 ± 0.208
2.353IleMet: 2.353 ± 0.521
3.025IleAsn: 3.025 ± 0.337
3.361IlePro: 3.361 ± 1.407
1.008IleGln: 1.008 ± 0.468
3.025IleArg: 3.025 ± 0.337
6.387IleSer: 6.387 ± 0.304
1.008IleThr: 1.008 ± 0.065
3.361IleVal: 3.361 ± 0.764
1.345IleTrp: 1.345 ± 0.65
2.689IleTyr: 2.689 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
3.361LysAla: 3.361 ± 1.131
1.008LysCys: 1.008 ± 0.468
3.025LysAsp: 3.025 ± 0.194
10.42LysGlu: 10.42 ± 3.961
1.345LysPhe: 1.345 ± 0.353
3.697LysGly: 3.697 ± 0.859
0.336LysHis: 0.336 ± 0.262
5.714LysIle: 5.714 ± 0.826
8.403LysLys: 8.403 ± 0.573
5.378LysLeu: 5.378 ± 0.919
3.697LysMet: 3.697 ± 0.538
2.017LysAsn: 2.017 ± 1.236
2.689LysPro: 2.689 ± 0.357
1.345LysGln: 1.345 ± 0.667
6.05LysArg: 6.05 ± 1.229
4.37LysSer: 4.37 ± 0.972
3.025LysThr: 3.025 ± 1.445
5.378LysVal: 5.378 ± 0.817
2.017LysTrp: 2.017 ± 0.84
2.353LysTyr: 2.353 ± 0.267
0.0LysXaa: 0.0 ± 0.0
Leu
5.714LeuAla: 5.714 ± 0.898
1.681LeuCys: 1.681 ± 0.642
3.697LeuAsp: 3.697 ± 1.173
7.731LeuGlu: 7.731 ± 1.138
2.689LeuPhe: 2.689 ± 0.466
9.412LeuGly: 9.412 ± 1.069
1.008LeuHis: 1.008 ± 0.42
3.697LeuIle: 3.697 ± 0.856
3.361LeuLys: 3.361 ± 1.019
3.025LeuLeu: 3.025 ± 0.473
3.025LeuMet: 3.025 ± 0.305
2.017LeuAsn: 2.017 ± 0.361
2.689LeuPro: 2.689 ± 0.514
0.336LeuGln: 0.336 ± 0.289
5.042LeuArg: 5.042 ± 1.142
3.697LeuSer: 3.697 ± 0.442
2.353LeuThr: 2.353 ± 1.274
5.714LeuVal: 5.714 ± 0.743
0.672LeuTrp: 0.672 ± 0.226
3.361LeuTyr: 3.361 ± 0.995
0.0LeuXaa: 0.0 ± 0.0
Met
3.025MetAla: 3.025 ± 0.614
1.345MetCys: 1.345 ± 0.257
0.672MetAsp: 0.672 ± 0.578
4.37MetGlu: 4.37 ± 0.676
0.0MetPhe: 0.0 ± 0.0
2.689MetGly: 2.689 ± 0.735
0.672MetHis: 0.672 ± 0.226
2.017MetIle: 2.017 ± 0.525
4.034MetLys: 4.034 ± 0.314
3.361MetLeu: 3.361 ± 0.481
1.681MetMet: 1.681 ± 0.362
1.681MetAsn: 1.681 ± 0.315
1.681MetPro: 1.681 ± 0.362
0.672MetGln: 0.672 ± 0.226
3.697MetArg: 3.697 ± 0.999
3.025MetSer: 3.025 ± 0.613
0.672MetThr: 0.672 ± 0.524
1.681MetVal: 1.681 ± 0.619
1.008MetTrp: 1.008 ± 0.42
2.017MetTyr: 2.017 ± 0.764
0.0MetXaa: 0.0 ± 0.0
Asn
2.017AsnAla: 2.017 ± 0.361
0.336AsnCys: 0.336 ± 0.289
1.345AsnAsp: 1.345 ± 0.667
4.034AsnGlu: 4.034 ± 1.117
1.345AsnPhe: 1.345 ± 0.233
3.025AsnGly: 3.025 ± 0.933
1.681AsnHis: 1.681 ± 0.642
2.353AsnIle: 2.353 ± 1.054
3.361AsnLys: 3.361 ± 0.481
4.706AsnLeu: 4.706 ± 0.904
1.008AsnMet: 1.008 ± 0.065
0.672AsnAsn: 0.672 ± 0.524
0.672AsnPro: 0.672 ± 0.524
1.008AsnGln: 1.008 ± 0.42
3.025AsnArg: 3.025 ± 0.614
2.017AsnSer: 2.017 ± 0.129
1.345AsnThr: 1.345 ± 0.233
3.361AsnVal: 3.361 ± 0.326
2.353AsnTrp: 2.353 ± 0.417
0.672AsnTyr: 0.672 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
2.017ProAla: 2.017 ± 0.84
0.672ProCys: 0.672 ± 0.283
2.017ProAsp: 2.017 ± 0.129
2.689ProGlu: 2.689 ± 0.466
1.008ProPhe: 1.008 ± 0.468
1.008ProGly: 1.008 ± 0.803
0.336ProHis: 0.336 ± 0.262
2.017ProIle: 2.017 ± 0.525
1.681ProLys: 1.681 ± 0.515
1.345ProLeu: 1.345 ± 0.65
1.681ProMet: 1.681 ± 0.163
1.345ProAsn: 1.345 ± 0.353
1.681ProPro: 1.681 ± 0.642
1.345ProGln: 1.345 ± 1.157
0.0ProArg: 0.0 ± 0.0
2.689ProSer: 2.689 ± 0.514
1.345ProThr: 1.345 ± 0.624
2.689ProVal: 2.689 ± 1.189
0.336ProTrp: 0.336 ± 0.262
0.336ProTyr: 0.336 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
0.672GlnAla: 0.672 ± 0.524
0.336GlnCys: 0.336 ± 0.262
0.672GlnAsp: 0.672 ± 0.312
2.689GlnGlu: 2.689 ± 0.94
0.336GlnPhe: 0.336 ± 0.268
0.672GlnGly: 0.672 ± 0.226
0.336GlnHis: 0.336 ± 0.268
1.008GlnIle: 1.008 ± 0.468
2.017GlnLys: 2.017 ± 0.936
1.008GlnLeu: 1.008 ± 0.42
1.008GlnMet: 1.008 ± 0.541
1.345GlnAsn: 1.345 ± 0.257
0.672GlnPro: 0.672 ± 0.312
0.0GlnGln: 0.0 ± 0.0
1.345GlnArg: 1.345 ± 0.65
2.017GlnSer: 2.017 ± 1.159
0.336GlnThr: 0.336 ± 0.262
1.345GlnVal: 1.345 ± 0.353
0.0GlnTrp: 0.0 ± 0.0
0.672GlnTyr: 0.672 ± 0.535
0.0GlnXaa: 0.0 ± 0.0
Arg
3.361ArgAla: 3.361 ± 0.764
1.681ArgCys: 1.681 ± 0.163
4.706ArgAsp: 4.706 ± 1.372
12.437ArgGlu: 12.437 ± 4.473
3.361ArgPhe: 3.361 ± 1.238
4.37ArgGly: 4.37 ± 1.002
0.672ArgHis: 0.672 ± 0.226
6.387ArgIle: 6.387 ± 0.867
3.697ArgLys: 3.697 ± 0.999
6.723ArgLeu: 6.723 ± 0.173
0.672ArgMet: 0.672 ± 0.535
3.361ArgAsn: 3.361 ± 0.292
0.672ArgPro: 0.672 ± 0.226
1.681ArgGln: 1.681 ± 0.902
6.723ArgArg: 6.723 ± 1.083
4.706ArgSer: 4.706 ± 0.43
2.353ArgThr: 2.353 ± 0.831
3.697ArgVal: 3.697 ± 1.605
2.017ArgTrp: 2.017 ± 0.37
2.017ArgTyr: 2.017 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
2.353SerAla: 2.353 ± 0.615
0.672SerCys: 0.672 ± 0.524
3.361SerAsp: 3.361 ± 0.725
8.403SerGlu: 8.403 ± 0.573
1.345SerPhe: 1.345 ± 0.452
7.395SerGly: 7.395 ± 1.664
0.0SerHis: 0.0 ± 0.0
5.714SerIle: 5.714 ± 1.397
5.042SerLys: 5.042 ± 0.666
5.378SerLeu: 5.378 ± 1.5
4.034SerMet: 4.034 ± 0.253
4.37SerAsn: 4.37 ± 1.495
1.008SerPro: 1.008 ± 0.468
0.336SerGln: 0.336 ± 0.289
6.05SerArg: 6.05 ± 1.14
6.387SerSer: 6.387 ± 1.6
2.017SerThr: 2.017 ± 0.361
6.387SerVal: 6.387 ± 1.381
1.008SerTrp: 1.008 ± 0.065
1.681SerTyr: 1.681 ± 0.619
0.0SerXaa: 0.0 ± 0.0
Thr
1.681ThrAla: 1.681 ± 0.362
0.336ThrCys: 0.336 ± 0.262
0.336ThrAsp: 0.336 ± 0.268
3.697ThrGlu: 3.697 ± 0.404
0.672ThrPhe: 0.672 ± 0.578
1.345ThrGly: 1.345 ± 0.452
0.336ThrHis: 0.336 ± 0.289
1.008ThrIle: 1.008 ± 0.065
3.025ThrLys: 3.025 ± 0.305
2.689ThrLeu: 2.689 ± 0.893
1.681ThrMet: 1.681 ± 0.642
1.008ThrAsn: 1.008 ± 0.505
2.353ThrPro: 2.353 ± 1.039
1.008ThrGln: 1.008 ± 0.468
3.697ThrArg: 3.697 ± 0.77
3.025ThrSer: 3.025 ± 0.665
1.681ThrThr: 1.681 ± 0.619
2.689ThrVal: 2.689 ± 0.357
1.008ThrTrp: 1.008 ± 0.468
3.361ThrTyr: 3.361 ± 0.565
0.0ThrXaa: 0.0 ± 0.0
Val
4.034ValAla: 4.034 ± 0.933
1.008ValCys: 1.008 ± 0.498
6.387ValAsp: 6.387 ± 2.122
7.731ValGlu: 7.731 ± 0.236
1.345ValPhe: 1.345 ± 0.233
5.714ValGly: 5.714 ± 1.869
1.681ValHis: 1.681 ± 0.963
4.37ValIle: 4.37 ± 0.518
6.05ValLys: 6.05 ± 0.854
5.714ValLeu: 5.714 ± 1.371
1.681ValMet: 1.681 ± 0.498
1.681ValAsn: 1.681 ± 0.362
3.361ValPro: 3.361 ± 0.938
1.681ValGln: 1.681 ± 0.515
6.387ValArg: 6.387 ± 0.304
5.714ValSer: 5.714 ± 2.225
3.361ValThr: 3.361 ± 0.89
7.059ValVal: 7.059 ± 1.394
1.008ValTrp: 1.008 ± 0.065
1.681ValTyr: 1.681 ± 1.053
0.0ValXaa: 0.0 ± 0.0
Trp
1.008TrpAla: 1.008 ± 0.468
0.672TrpCys: 0.672 ± 0.226
1.008TrpAsp: 1.008 ± 0.41
1.345TrpGlu: 1.345 ± 0.452
0.672TrpPhe: 0.672 ± 0.578
1.008TrpGly: 1.008 ± 0.468
0.336TrpHis: 0.336 ± 0.262
1.681TrpIle: 1.681 ± 0.498
1.681TrpLys: 1.681 ± 0.315
2.689TrpLeu: 2.689 ± 0.735
1.008TrpMet: 1.008 ± 0.787
0.336TrpAsn: 0.336 ± 0.262
0.672TrpPro: 0.672 ± 0.226
1.008TrpGln: 1.008 ± 0.41
1.008TrpArg: 1.008 ± 0.42
2.689TrpSer: 2.689 ± 0.514
0.672TrpThr: 0.672 ± 0.226
1.681TrpVal: 1.681 ± 0.619
0.336TrpTrp: 0.336 ± 0.289
1.008TrpTyr: 1.008 ± 0.42
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.353TyrAla: 2.353 ± 0.615
1.008TyrCys: 1.008 ± 0.065
3.025TyrAsp: 3.025 ± 1.196
3.025TyrGlu: 3.025 ± 1.405
0.336TyrPhe: 0.336 ± 0.289
2.353TyrGly: 2.353 ± 0.692
0.336TyrHis: 0.336 ± 0.268
1.681TyrIle: 1.681 ± 0.926
0.672TyrLys: 0.672 ± 0.226
1.681TyrLeu: 1.681 ± 0.315
1.008TyrMet: 1.008 ± 0.803
3.361TyrAsn: 3.361 ± 0.292
1.008TyrPro: 1.008 ± 0.505
0.336TyrGln: 0.336 ± 0.268
1.681TyrArg: 1.681 ± 0.163
2.689TyrSer: 2.689 ± 0.417
2.017TyrThr: 2.017 ± 0.466
1.681TyrVal: 1.681 ± 0.315
1.008TyrTrp: 1.008 ± 0.41
0.672TyrTyr: 0.672 ± 0.226
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2976 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski