Amino acid dipepetide frequency for Groundnut chlorotic fan-spot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.1AlaAla: 2.1 ± 1.357
1.167AlaCys: 1.167 ± 0.266
2.8AlaAsp: 2.8 ± 0.689
3.033AlaGlu: 3.033 ± 1.223
1.867AlaPhe: 1.867 ± 0.629
0.933AlaGly: 0.933 ± 0.181
0.0AlaHis: 0.0 ± 0.0
3.033AlaIle: 3.033 ± 1.228
2.333AlaLys: 2.333 ± 0.868
4.2AlaLeu: 4.2 ± 1.925
1.4AlaMet: 1.4 ± 0.328
1.867AlaAsn: 1.867 ± 0.254
0.933AlaPro: 0.933 ± 0.434
0.467AlaGln: 0.467 ± 0.242
1.867AlaArg: 1.867 ± 0.38
3.5AlaSer: 3.5 ± 0.847
1.633AlaThr: 1.633 ± 0.286
1.867AlaVal: 1.867 ± 0.806
0.7AlaTrp: 0.7 ± 0.452
1.633AlaTyr: 1.633 ± 1.1
0.0AlaXaa: 0.0 ± 0.0
Cys
0.7CysAla: 0.7 ± 0.42
0.933CysCys: 0.933 ± 0.484
1.167CysAsp: 1.167 ± 0.582
1.4CysGlu: 1.4 ± 0.84
1.633CysPhe: 1.633 ± 0.342
1.867CysGly: 1.867 ± 1.724
0.467CysHis: 0.467 ± 0.242
1.633CysIle: 1.633 ± 0.334
1.633CysLys: 1.633 ± 0.424
2.1CysLeu: 2.1 ± 0.372
0.233CysMet: 0.233 ± 0.261
1.4CysAsn: 1.4 ± 0.328
1.4CysPro: 1.4 ± 0.445
0.233CysGln: 0.233 ± 0.121
1.167CysArg: 1.167 ± 0.282
1.167CysSer: 1.167 ± 0.282
0.933CysThr: 0.933 ± 0.181
2.1CysVal: 2.1 ± 1.26
0.233CysTrp: 0.233 ± 0.261
1.4CysTyr: 1.4 ± 0.495
0.0CysXaa: 0.0 ± 0.0
Asp
0.933AspAla: 0.933 ± 0.453
1.867AspCys: 1.867 ± 1.724
4.666AspAsp: 4.666 ± 0.804
3.966AspGlu: 3.966 ± 1.326
4.666AspPhe: 4.666 ± 0.804
3.266AspGly: 3.266 ± 0.139
1.4AspHis: 1.4 ± 0.84
4.666AspIle: 4.666 ± 0.949
4.433AspLys: 4.433 ± 1.021
6.066AspLeu: 6.066 ± 0.992
2.333AspMet: 2.333 ± 0.275
4.9AspAsn: 4.9 ± 0.876
1.4AspPro: 1.4 ± 0.543
1.867AspGln: 1.867 ± 0.968
1.867AspArg: 1.867 ± 0.38
3.033AspSer: 3.033 ± 1.744
1.633AspThr: 1.633 ± 0.632
4.666AspVal: 4.666 ± 0.745
0.467AspTrp: 0.467 ± 0.242
3.733AspTyr: 3.733 ± 0.957
0.0AspXaa: 0.0 ± 0.0
Glu
3.033GluAla: 3.033 ± 0.784
1.4GluCys: 1.4 ± 0.248
4.9GluAsp: 4.9 ± 0.5
3.966GluGlu: 3.966 ± 1.377
2.566GluPhe: 2.566 ± 0.988
2.566GluGly: 2.566 ± 0.497
0.7GluHis: 0.7 ± 0.363
5.133GluIle: 5.133 ± 1.651
5.833GluLys: 5.833 ± 0.577
5.366GluLeu: 5.366 ± 1.401
3.5GluMet: 3.5 ± 0.705
3.733GluAsn: 3.733 ± 0.725
1.4GluPro: 1.4 ± 0.84
1.867GluGln: 1.867 ± 0.859
2.1GluArg: 2.1 ± 0.781
4.666GluSer: 4.666 ± 2.033
4.2GluThr: 4.2 ± 0.456
3.966GluVal: 3.966 ± 1.009
0.467GluTrp: 0.467 ± 0.242
3.266GluTyr: 3.266 ± 0.668
0.0GluXaa: 0.0 ± 0.0
Phe
1.633PheAla: 1.633 ± 0.953
1.167PheCys: 1.167 ± 0.873
3.266PheAsp: 3.266 ± 0.606
2.8PheGlu: 2.8 ± 0.789
1.4PhePhe: 1.4 ± 0.248
1.4PheGly: 1.4 ± 0.248
1.4PheHis: 1.4 ± 0.495
3.033PheIle: 3.033 ± 0.617
3.966PheLys: 3.966 ± 0.956
5.6PheLeu: 5.6 ± 0.476
2.333PheMet: 2.333 ± 0.564
2.8PheAsn: 2.8 ± 0.99
1.867PhePro: 1.867 ± 0.363
2.333PheGln: 2.333 ± 0.275
1.633PheArg: 1.633 ± 0.847
4.433PheSer: 4.433 ± 1.952
1.4PheThr: 1.4 ± 0.248
2.1PheVal: 2.1 ± 0.459
0.0PheTrp: 0.0 ± 0.0
2.566PheTyr: 2.566 ± 0.863
0.0PheXaa: 0.0 ± 0.0
Gly
1.4GlyAla: 1.4 ± 0.328
1.633GlyCys: 1.633 ± 0.286
2.1GlyAsp: 2.1 ± 0.372
2.333GlyGlu: 2.333 ± 0.275
2.8GlyPhe: 2.8 ± 0.744
0.933GlyGly: 0.933 ± 0.484
1.167GlyHis: 1.167 ± 0.582
3.5GlyIle: 3.5 ± 1.093
4.666GlyLys: 4.666 ± 1.597
4.2GlyLeu: 4.2 ± 0.456
0.933GlyMet: 0.933 ± 0.484
2.333GlyAsn: 2.333 ± 1.333
1.4GlyPro: 1.4 ± 0.445
0.7GlyGln: 0.7 ± 0.522
1.4GlyArg: 1.4 ± 0.328
3.733GlySer: 3.733 ± 2.078
2.1GlyThr: 2.1 ± 0.738
2.333GlyVal: 2.333 ± 0.617
0.233GlyTrp: 0.233 ± 0.121
1.4GlyTyr: 1.4 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.633HisAla: 1.633 ± 0.511
0.467HisCys: 0.467 ± 0.523
1.4HisAsp: 1.4 ± 0.495
0.933HisGlu: 0.933 ± 0.33
1.4HisPhe: 1.4 ± 0.495
0.467HisGly: 0.467 ± 0.242
0.233HisHis: 0.233 ± 0.121
0.7HisIle: 0.7 ± 0.522
1.867HisLys: 1.867 ± 1.002
2.1HisLeu: 2.1 ± 0.459
0.233HisMet: 0.233 ± 0.121
1.867HisAsn: 1.867 ± 0.629
0.7HisPro: 0.7 ± 0.124
0.7HisGln: 0.7 ± 0.124
0.7HisArg: 0.7 ± 0.124
1.867HisSer: 1.867 ± 0.38
0.7HisThr: 0.7 ± 0.124
1.167HisVal: 1.167 ± 0.282
0.233HisTrp: 0.233 ± 0.539
0.933HisTyr: 0.933 ± 0.181
0.0HisXaa: 0.0 ± 0.0
Ile
3.966IleAla: 3.966 ± 0.236
1.167IleCys: 1.167 ± 0.282
4.2IleAsp: 4.2 ± 0.917
4.666IleGlu: 4.666 ± 0.44
2.8IlePhe: 2.8 ± 0.817
2.8IleGly: 2.8 ± 0.744
0.7IleHis: 0.7 ± 0.363
4.433IleIle: 4.433 ± 0.665
8.633IleLys: 8.633 ± 2.637
6.3IleLeu: 6.3 ± 0.795
2.566IleMet: 2.566 ± 0.463
4.9IleAsn: 4.9 ± 0.834
2.333IlePro: 2.333 ± 0.22
0.933IleGln: 0.933 ± 0.181
2.1IleArg: 2.1 ± 0.806
9.099IleSer: 9.099 ± 0.839
4.666IleThr: 4.666 ± 1.597
4.666IleVal: 4.666 ± 1.386
0.233IleTrp: 0.233 ± 0.539
3.733IleTyr: 3.733 ± 0.643
0.0IleXaa: 0.0 ± 0.0
Lys
4.2LysAla: 4.2 ± 2.093
3.5LysCys: 3.5 ± 1.143
6.3LysAsp: 6.3 ± 1.104
7.699LysGlu: 7.699 ± 1.302
3.266LysPhe: 3.266 ± 0.139
3.966LysGly: 3.966 ± 0.236
3.733LysHis: 3.733 ± 0.957
6.066LysIle: 6.066 ± 0.684
7.699LysLys: 7.699 ± 1.929
6.066LysLeu: 6.066 ± 1.104
3.266LysMet: 3.266 ± 0.572
4.433LysAsn: 4.433 ± 0.476
0.933LysPro: 0.933 ± 0.33
2.8LysGln: 2.8 ± 0.081
3.266LysArg: 3.266 ± 0.978
6.066LysSer: 6.066 ± 0.987
6.066LysThr: 6.066 ± 0.252
3.966LysVal: 3.966 ± 0.207
0.933LysTrp: 0.933 ± 0.181
3.5LysTyr: 3.5 ± 0.797
0.0LysXaa: 0.0 ± 0.0
Leu
3.5LeuAla: 3.5 ± 0.797
2.1LeuCys: 2.1 ± 0.781
4.433LeuAsp: 4.433 ± 2.073
7.0LeuGlu: 7.0 ± 1.794
3.966LeuPhe: 3.966 ± 0.818
4.2LeuGly: 4.2 ± 1.587
1.4LeuHis: 1.4 ± 0.495
6.066LeuIle: 6.066 ± 0.157
7.933LeuLys: 7.933 ± 0.815
5.6LeuLeu: 5.6 ± 0.948
5.833LeuMet: 5.833 ± 0.389
6.3LeuAsn: 6.3 ± 0.795
3.033LeuPro: 3.033 ± 1.238
2.566LeuGln: 2.566 ± 0.771
3.266LeuArg: 3.266 ± 0.74
13.999LeuSer: 13.999 ± 1.342
3.966LeuThr: 3.966 ± 0.829
3.266LeuVal: 3.266 ± 0.606
0.7LeuTrp: 0.7 ± 0.42
5.133LeuTyr: 5.133 ± 0.925
0.0LeuXaa: 0.0 ± 0.0
Met
0.7MetAla: 0.7 ± 0.363
0.7MetCys: 0.7 ± 0.42
1.633MetAsp: 1.633 ± 0.619
1.167MetGlu: 1.167 ± 0.605
1.633MetPhe: 1.633 ± 0.847
1.633MetGly: 1.633 ± 0.511
0.467MetHis: 0.467 ± 0.483
3.966MetIle: 3.966 ± 0.207
3.5MetLys: 3.5 ± 1.093
4.2MetLeu: 4.2 ± 0.748
1.167MetMet: 1.167 ± 0.282
3.733MetAsn: 3.733 ± 0.761
1.633MetPro: 1.633 ± 0.632
1.4MetGln: 1.4 ± 0.543
1.4MetArg: 1.4 ± 0.543
2.8MetSer: 2.8 ± 0.689
2.333MetThr: 2.333 ± 0.546
1.633MetVal: 1.633 ± 0.334
0.0MetTrp: 0.0 ± 0.0
1.633MetTyr: 1.633 ± 0.424
0.0MetXaa: 0.0 ± 0.0
Asn
1.633AsnAla: 1.633 ± 0.286
1.867AsnCys: 1.867 ± 0.66
4.433AsnAsp: 4.433 ± 1.021
3.033AsnGlu: 3.033 ± 1.062
4.9AsnPhe: 4.9 ± 1.07
2.333AsnGly: 2.333 ± 1.863
1.867AsnHis: 1.867 ± 0.254
4.2AsnIle: 4.2 ± 0.744
6.066AsnLys: 6.066 ± 0.749
6.3AsnLeu: 6.3 ± 0.046
2.1AsnMet: 2.1 ± 1.089
2.566AsnAsn: 2.566 ± 0.786
3.266AsnPro: 3.266 ± 0.606
1.4AsnGln: 1.4 ± 0.726
2.1AsnArg: 2.1 ± 0.793
5.133AsnSer: 5.133 ± 0.326
4.2AsnThr: 4.2 ± 0.708
5.133AsnVal: 5.133 ± 0.993
0.933AsnTrp: 0.933 ± 0.434
4.2AsnTyr: 4.2 ± 1.831
0.0AsnXaa: 0.0 ± 0.0
Pro
0.467ProAla: 0.467 ± 0.242
0.933ProCys: 0.933 ± 0.484
1.4ProAsp: 1.4 ± 0.445
2.333ProGlu: 2.333 ± 0.868
1.167ProPhe: 1.167 ± 0.556
1.867ProGly: 1.867 ± 0.708
0.233ProHis: 0.233 ± 0.121
1.867ProIle: 1.867 ± 0.859
2.8ProLys: 2.8 ± 0.496
2.566ProLeu: 2.566 ± 0.863
0.467ProMet: 0.467 ± 0.242
2.333ProAsn: 2.333 ± 0.22
0.933ProPro: 0.933 ± 0.33
0.7ProGln: 0.7 ± 0.522
0.933ProArg: 0.933 ± 0.484
3.966ProSer: 3.966 ± 1.292
1.167ProThr: 1.167 ± 0.971
1.867ProVal: 1.867 ± 0.868
0.0ProTrp: 0.0 ± 0.0
0.933ProTyr: 0.933 ± 0.181
0.0ProXaa: 0.0 ± 0.0
Gln
0.467GlnAla: 0.467 ± 0.165
0.0GlnCys: 0.0 ± 0.0
1.4GlnAsp: 1.4 ± 0.248
2.1GlnGlu: 2.1 ± 0.455
0.233GlnPhe: 0.233 ± 0.261
1.633GlnGly: 1.633 ± 0.334
0.7GlnHis: 0.7 ± 0.42
3.266GlnIle: 3.266 ± 1.414
1.4GlnLys: 1.4 ± 0.445
3.266GlnLeu: 3.266 ± 0.884
1.167GlnMet: 1.167 ± 0.282
1.633GlnAsn: 1.633 ± 0.342
0.467GlnPro: 0.467 ± 0.483
0.0GlnGln: 0.0 ± 0.0
0.7GlnArg: 0.7 ± 0.363
2.566GlnSer: 2.566 ± 0.463
1.167GlnThr: 1.167 ± 0.282
0.7GlnVal: 0.7 ± 0.124
0.467GlnTrp: 0.467 ± 0.165
0.933GlnTyr: 0.933 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
0.7ArgAla: 0.7 ± 0.363
0.933ArgCys: 0.933 ± 0.33
1.867ArgAsp: 1.867 ± 0.708
3.733ArgGlu: 3.733 ± 1.209
1.633ArgPhe: 1.633 ± 0.511
0.7ArgGly: 0.7 ± 0.786
0.467ArgHis: 0.467 ± 0.242
3.033ArgIle: 3.033 ± 0.126
2.1ArgLys: 2.1 ± 0.455
3.266ArgLeu: 3.266 ± 0.668
0.7ArgMet: 0.7 ± 0.452
3.733ArgAsn: 3.733 ± 1.265
0.7ArgPro: 0.7 ± 0.363
0.7ArgGln: 0.7 ± 0.42
1.4ArgArg: 1.4 ± 0.248
2.8ArgSer: 2.8 ± 0.789
4.433ArgThr: 4.433 ± 0.665
1.4ArgVal: 1.4 ± 0.328
0.233ArgTrp: 0.233 ± 0.121
1.167ArgTyr: 1.167 ± 0.266
0.0ArgXaa: 0.0 ± 0.0
Ser
2.566SerAla: 2.566 ± 0.5
2.1SerCys: 2.1 ± 0.372
6.3SerAsp: 6.3 ± 0.394
6.3SerGlu: 6.3 ± 0.771
4.2SerPhe: 4.2 ± 1.184
3.966SerGly: 3.966 ± 1.292
1.867SerHis: 1.867 ± 0.66
7.699SerIle: 7.699 ± 1.424
8.633SerLys: 8.633 ± 0.514
10.266SerLeu: 10.266 ± 0.713
3.033SerMet: 3.033 ± 0.869
6.066SerAsn: 6.066 ± 0.74
2.1SerPro: 2.1 ± 0.455
2.1SerGln: 2.1 ± 0.372
3.5SerArg: 3.5 ± 0.59
7.0SerSer: 7.0 ± 0.665
5.366SerThr: 5.366 ± 0.518
4.9SerVal: 4.9 ± 0.402
0.467SerTrp: 0.467 ± 0.165
2.566SerTyr: 2.566 ± 0.163
0.0SerXaa: 0.0 ± 0.0
Thr
2.8ThrAla: 2.8 ± 0.789
0.933ThrCys: 0.933 ± 0.68
2.8ThrAsp: 2.8 ± 1.302
2.8ThrGlu: 2.8 ± 0.376
3.5ThrPhe: 3.5 ± 1.515
3.266ThrGly: 3.266 ± 1.629
0.7ThrHis: 0.7 ± 0.124
3.266ThrIle: 3.266 ± 0.945
4.9ThrLys: 4.9 ± 0.5
6.066ThrLeu: 6.066 ± 0.984
1.867ThrMet: 1.867 ± 0.66
5.366ThrAsn: 5.366 ± 0.672
1.4ThrPro: 1.4 ± 0.726
0.7ThrGln: 0.7 ± 0.124
1.867ThrArg: 1.867 ± 0.254
4.2ThrSer: 4.2 ± 0.409
2.566ThrThr: 2.566 ± 0.75
3.266ThrVal: 3.266 ± 0.139
0.7ThrTrp: 0.7 ± 0.786
1.867ThrTyr: 1.867 ± 0.66
0.0ThrXaa: 0.0 ± 0.0
Val
3.266ValAla: 3.266 ± 0.923
0.467ValCys: 0.467 ± 0.165
3.266ValAsp: 3.266 ± 0.139
3.733ValGlu: 3.733 ± 0.761
1.4ValPhe: 1.4 ± 0.903
1.4ValGly: 1.4 ± 0.84
1.4ValHis: 1.4 ± 0.495
4.2ValIle: 4.2 ± 0.982
3.5ValLys: 3.5 ± 0.533
6.066ValLeu: 6.066 ± 0.749
2.1ValMet: 2.1 ± 0.372
3.966ValAsn: 3.966 ± 0.608
1.867ValPro: 1.867 ± 1.382
1.4ValGln: 1.4 ± 0.328
1.633ValArg: 1.633 ± 0.511
5.6ValSer: 5.6 ± 1.579
3.033ValThr: 3.033 ± 1.371
2.1ValVal: 2.1 ± 0.459
0.467ValTrp: 0.467 ± 0.242
2.8ValTyr: 2.8 ± 0.081
0.0ValXaa: 0.0 ± 0.0
Trp
0.233TrpAla: 0.233 ± 0.261
0.0TrpCys: 0.0 ± 0.0
0.7TrpAsp: 0.7 ± 0.522
0.7TrpGlu: 0.7 ± 0.42
0.233TrpPhe: 0.233 ± 0.121
0.233TrpGly: 0.233 ± 0.261
0.233TrpHis: 0.233 ± 0.121
0.7TrpIle: 0.7 ± 0.522
0.7TrpLys: 0.7 ± 0.452
0.7TrpLeu: 0.7 ± 0.124
0.467TrpMet: 0.467 ± 0.242
0.233TrpAsn: 0.233 ± 0.121
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.233TrpArg: 0.233 ± 0.539
1.167TrpSer: 1.167 ± 0.365
0.0TrpThr: 0.0 ± 0.0
0.7TrpVal: 0.7 ± 0.124
0.0TrpTrp: 0.0 ± 0.0
0.467TrpTyr: 0.467 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.4TyrAla: 1.4 ± 0.395
0.467TyrCys: 0.467 ± 0.242
2.8TyrAsp: 2.8 ± 0.386
0.933TyrGlu: 0.933 ± 0.181
1.867TyrPhe: 1.867 ± 0.363
1.867TyrGly: 1.867 ± 0.363
1.167TyrHis: 1.167 ± 0.282
4.433TyrIle: 4.433 ± 0.879
5.133TyrLys: 5.133 ± 0.838
4.2TyrLeu: 4.2 ± 0.815
1.4TyrMet: 1.4 ± 0.903
3.5TyrAsn: 3.5 ± 1.14
1.167TyrPro: 1.167 ± 0.582
1.4TyrGln: 1.4 ± 0.495
2.566TyrArg: 2.566 ± 0.5
4.2TyrSer: 4.2 ± 0.909
3.266TyrThr: 3.266 ± 1.841
1.867TyrVal: 1.867 ± 0.363
0.233TyrTrp: 0.233 ± 0.261
1.867TyrTyr: 1.867 ± 0.629
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4287 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski