Amino acid dipepetide frequency for Cherry rusty mottle associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.36AlaAla: 3.36 ± 2.218
1.008AlaCys: 1.008 ± 0.649
4.032AlaAsp: 4.032 ± 1.456
2.688AlaGlu: 2.688 ± 0.637
2.352AlaPhe: 2.352 ± 1.045
5.04AlaGly: 5.04 ± 0.921
1.68AlaHis: 1.68 ± 1.819
3.36AlaIle: 3.36 ± 1.398
5.376AlaLys: 5.376 ± 1.613
5.712AlaLeu: 5.712 ± 0.866
0.336AlaMet: 0.336 ± 0.186
3.696AlaAsn: 3.696 ± 0.689
2.352AlaPro: 2.352 ± 0.662
1.344AlaGln: 1.344 ± 0.587
4.032AlaArg: 4.032 ± 1.221
6.048AlaSer: 6.048 ± 2.305
3.36AlaThr: 3.36 ± 1.032
3.36AlaVal: 3.36 ± 1.294
0.0AlaTrp: 0.0 ± 0.0
1.008AlaTyr: 1.008 ± 0.557
0.0AlaXaa: 0.0 ± 0.0
Cys
0.672CysAla: 0.672 ± 0.371
0.336CysCys: 0.336 ± 1.015
0.336CysAsp: 0.336 ± 0.186
1.008CysGlu: 1.008 ± 1.044
3.024CysPhe: 3.024 ± 0.964
1.008CysGly: 1.008 ± 0.806
0.672CysHis: 0.672 ± 0.371
2.352CysIle: 2.352 ± 1.045
0.336CysLys: 0.336 ± 0.666
2.688CysLeu: 2.688 ± 1.685
0.336CysMet: 0.336 ± 1.015
0.336CysAsn: 0.336 ± 0.666
1.344CysPro: 1.344 ± 0.743
0.0CysGln: 0.0 ± 0.0
0.336CysArg: 0.336 ± 0.186
3.024CysSer: 3.024 ± 1.545
1.68CysThr: 1.68 ± 0.721
1.68CysVal: 1.68 ± 0.848
0.0CysTrp: 0.0 ± 0.0
1.344CysTyr: 1.344 ± 0.558
0.0CysXaa: 0.0 ± 0.0
Asp
2.352AspAla: 2.352 ± 0.61
0.672AspCys: 0.672 ± 0.584
2.688AspAsp: 2.688 ± 1.116
4.032AspGlu: 4.032 ± 1.109
4.704AspPhe: 4.704 ± 1.333
2.688AspGly: 2.688 ± 1.117
1.344AspHis: 1.344 ± 0.743
2.688AspIle: 2.688 ± 1.116
3.024AspLys: 3.024 ± 1.19
5.04AspLeu: 5.04 ± 1.363
0.336AspMet: 0.336 ± 0.186
2.352AspAsn: 2.352 ± 0.885
3.36AspPro: 3.36 ± 1.608
1.344AspGln: 1.344 ± 0.743
2.688AspArg: 2.688 ± 1.04
5.712AspSer: 5.712 ± 1.315
0.672AspThr: 0.672 ± 0.584
1.008AspVal: 1.008 ± 0.557
2.016AspTrp: 2.016 ± 0.769
1.68AspTyr: 1.68 ± 0.848
0.0AspXaa: 0.0 ± 0.0
Glu
4.704GluAla: 4.704 ± 1.547
0.672GluCys: 0.672 ± 0.371
2.016GluAsp: 2.016 ± 0.79
6.72GluGlu: 6.72 ± 1.883
3.696GluPhe: 3.696 ± 1.454
5.04GluGly: 5.04 ± 2.014
1.68GluHis: 1.68 ± 0.611
5.712GluIle: 5.712 ± 3.021
5.376GluLys: 5.376 ± 1.695
4.704GluLeu: 4.704 ± 1.535
1.008GluMet: 1.008 ± 1.158
1.008GluAsn: 1.008 ± 0.694
2.688GluPro: 2.688 ± 1.04
2.352GluGln: 2.352 ± 1.196
3.024GluArg: 3.024 ± 0.991
4.704GluSer: 4.704 ± 1.333
0.672GluThr: 0.672 ± 0.654
4.704GluVal: 4.704 ± 1.333
0.0GluTrp: 0.0 ± 0.0
1.68GluTyr: 1.68 ± 0.611
0.0GluXaa: 0.0 ± 0.0
Phe
2.016PheAla: 2.016 ± 0.769
2.352PheCys: 2.352 ± 1.345
5.376PheAsp: 5.376 ± 1.615
6.384PheGlu: 6.384 ± 2.605
2.016PhePhe: 2.016 ± 0.764
3.36PheGly: 3.36 ± 1.341
3.696PheHis: 3.696 ± 1.012
5.04PheIle: 5.04 ± 1.264
5.04PheLys: 5.04 ± 1.379
5.712PheLeu: 5.712 ± 1.906
1.344PheMet: 1.344 ± 0.652
3.024PheAsn: 3.024 ± 0.805
1.68PhePro: 1.68 ± 0.93
1.008PheGln: 1.008 ± 0.603
2.016PheArg: 2.016 ± 0.79
6.72PheSer: 6.72 ± 3.21
3.36PheThr: 3.36 ± 1.389
3.36PheVal: 3.36 ± 0.817
0.336PheTrp: 0.336 ± 0.186
0.672PheTyr: 0.672 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
3.024GlyAla: 3.024 ± 1.061
2.352GlyCys: 2.352 ± 0.861
4.032GlyAsp: 4.032 ± 1.93
2.016GlyGlu: 2.016 ± 0.852
4.032GlyPhe: 4.032 ± 1.785
3.024GlyGly: 3.024 ± 1.122
0.672GlyHis: 0.672 ± 0.584
1.344GlyIle: 1.344 ± 0.558
5.376GlyLys: 5.376 ± 0.997
4.368GlyLeu: 4.368 ± 1.035
1.008GlyMet: 1.008 ± 0.533
1.344GlyAsn: 1.344 ± 0.842
2.016GlyPro: 2.016 ± 1.149
3.024GlyGln: 3.024 ± 1.031
4.032GlyArg: 4.032 ± 1.765
4.704GlySer: 4.704 ± 1.226
3.36GlyThr: 3.36 ± 1.847
5.376GlyVal: 5.376 ± 1.623
0.672GlyTrp: 0.672 ± 0.371
2.016GlyTyr: 2.016 ± 0.79
0.0GlyXaa: 0.0 ± 0.0
His
1.008HisAla: 1.008 ± 0.556
1.008HisCys: 1.008 ± 0.557
1.008HisAsp: 1.008 ± 0.557
2.016HisGlu: 2.016 ± 1.114
2.016HisPhe: 2.016 ± 0.835
1.344HisGly: 1.344 ± 1.714
2.688HisHis: 2.688 ± 2.814
0.336HisIle: 0.336 ± 0.186
1.68HisLys: 1.68 ± 0.929
2.688HisLeu: 2.688 ± 1.977
0.336HisMet: 0.336 ± 0.186
2.016HisAsn: 2.016 ± 0.769
1.008HisPro: 1.008 ± 0.556
0.336HisGln: 0.336 ± 0.721
0.672HisArg: 0.672 ± 0.371
4.368HisSer: 4.368 ± 2.272
1.68HisThr: 1.68 ± 0.721
0.672HisVal: 0.672 ± 0.584
0.0HisTrp: 0.0 ± 0.0
0.672HisTyr: 0.672 ± 0.371
0.0HisXaa: 0.0 ± 0.0
Ile
3.024IleAla: 3.024 ± 1.061
2.352IleCys: 2.352 ± 1.769
1.68IleAsp: 1.68 ± 0.643
5.04IleGlu: 5.04 ± 1.667
4.368IlePhe: 4.368 ± 1.254
3.36IleGly: 3.36 ± 1.185
1.344IleHis: 1.344 ± 1.365
3.696IleIle: 3.696 ± 2.186
3.696IleLys: 3.696 ± 1.095
5.376IleLeu: 5.376 ± 1.146
0.672IleMet: 0.672 ± 0.371
5.04IleAsn: 5.04 ± 1.758
2.688IlePro: 2.688 ± 2.438
1.68IleGln: 1.68 ± 0.643
3.696IleArg: 3.696 ± 2.071
6.384IleSer: 6.384 ± 2.232
2.352IleThr: 2.352 ± 1.43
5.376IleVal: 5.376 ± 2.827
0.336IleTrp: 0.336 ± 1.015
1.344IleTyr: 1.344 ± 0.752
0.0IleXaa: 0.0 ± 0.0
Lys
6.048LysAla: 6.048 ± 1.949
2.352LysCys: 2.352 ± 0.821
4.368LysAsp: 4.368 ± 0.881
4.032LysGlu: 4.032 ± 1.109
4.368LysPhe: 4.368 ± 1.413
5.04LysGly: 5.04 ± 2.301
0.672LysHis: 0.672 ± 0.371
4.368LysIle: 4.368 ± 1.727
6.72LysLys: 6.72 ± 2.109
6.72LysLeu: 6.72 ± 2.537
1.008LysMet: 1.008 ± 0.557
1.68LysAsn: 1.68 ± 1.074
2.352LysPro: 2.352 ± 2.216
1.344LysGln: 1.344 ± 0.607
4.368LysArg: 4.368 ± 1.303
3.696LysSer: 3.696 ± 1.348
4.032LysThr: 4.032 ± 1.528
3.696LysVal: 3.696 ± 1.069
0.0LysTrp: 0.0 ± 0.0
3.696LysTyr: 3.696 ± 1.12
0.0LysXaa: 0.0 ± 0.0
Leu
6.72LeuAla: 6.72 ± 1.653
2.016LeuCys: 2.016 ± 0.916
4.368LeuAsp: 4.368 ± 2.162
6.048LeuGlu: 6.048 ± 1.887
3.36LeuPhe: 3.36 ± 1.588
5.376LeuGly: 5.376 ± 1.127
1.68LeuHis: 1.68 ± 0.929
9.073LeuIle: 9.073 ± 3.647
7.392LeuLys: 7.392 ± 1.358
11.761LeuLeu: 11.761 ± 9.522
2.352LeuMet: 2.352 ± 0.907
3.696LeuAsn: 3.696 ± 0.849
5.376LeuPro: 5.376 ± 1.815
3.696LeuGln: 3.696 ± 2.125
6.384LeuArg: 6.384 ± 2.202
9.409LeuSer: 9.409 ± 4.046
7.392LeuThr: 7.392 ± 6.141
7.728LeuVal: 7.728 ± 2.995
0.0LeuTrp: 0.0 ± 0.0
2.688LeuTyr: 2.688 ± 2.088
0.0LeuXaa: 0.0 ± 0.0
Met
1.68MetAla: 1.68 ± 0.929
0.336MetCys: 0.336 ± 0.186
0.336MetAsp: 0.336 ± 0.186
1.68MetGlu: 1.68 ± 1.006
1.008MetPhe: 1.008 ± 0.557
1.008MetGly: 1.008 ± 0.557
0.0MetHis: 0.0 ± 0.0
0.336MetIle: 0.336 ± 0.186
1.344MetLys: 1.344 ± 0.743
2.352MetLeu: 2.352 ± 1.069
1.344MetMet: 1.344 ± 1.307
0.672MetAsn: 0.672 ± 0.371
0.336MetPro: 0.336 ± 0.186
0.336MetGln: 0.336 ± 0.186
1.008MetArg: 1.008 ± 0.557
1.68MetSer: 1.68 ± 0.996
1.344MetThr: 1.344 ± 1.123
0.672MetVal: 0.672 ± 0.925
0.336MetTrp: 0.336 ± 0.186
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.36AsnAla: 3.36 ± 1.857
0.672AsnCys: 0.672 ± 0.562
1.68AsnAsp: 1.68 ± 0.643
1.008AsnGlu: 1.008 ± 0.557
4.368AsnPhe: 4.368 ± 1.289
1.344AsnGly: 1.344 ± 0.752
2.016AsnHis: 2.016 ± 1.365
1.68AsnIle: 1.68 ± 1.204
2.688AsnLys: 2.688 ± 1.486
6.72AsnLeu: 6.72 ± 1.898
1.68AsnMet: 1.68 ± 0.666
0.672AsnAsn: 0.672 ± 0.371
2.688AsnPro: 2.688 ± 1.055
1.344AsnGln: 1.344 ± 0.842
2.016AsnArg: 2.016 ± 1.388
3.36AsnSer: 3.36 ± 1.085
1.344AsnThr: 1.344 ± 0.842
1.68AsnVal: 1.68 ± 0.671
0.672AsnTrp: 0.672 ± 0.562
1.344AsnTyr: 1.344 ± 0.752
0.0AsnXaa: 0.0 ± 0.0
Pro
2.016ProAla: 2.016 ± 1.056
0.672ProCys: 0.672 ± 0.584
3.696ProAsp: 3.696 ± 0.689
1.008ProGlu: 1.008 ± 0.557
2.016ProPhe: 2.016 ± 0.97
2.352ProGly: 2.352 ± 1.319
0.672ProHis: 0.672 ± 1.19
3.024ProIle: 3.024 ± 2.074
3.36ProLys: 3.36 ± 1.414
3.024ProLeu: 3.024 ± 2.312
1.008ProMet: 1.008 ± 0.556
0.672ProAsn: 0.672 ± 0.371
1.68ProPro: 1.68 ± 1.011
1.008ProGln: 1.008 ± 0.865
2.352ProArg: 2.352 ± 1.11
4.704ProSer: 4.704 ± 1.428
2.688ProThr: 2.688 ± 1.823
1.68ProVal: 1.68 ± 0.707
0.672ProTrp: 0.672 ± 0.371
1.008ProTyr: 1.008 ± 0.556
0.0ProXaa: 0.0 ± 0.0
Gln
0.672GlnAla: 0.672 ± 0.371
0.336GlnCys: 0.336 ± 0.186
1.344GlnAsp: 1.344 ± 0.558
2.352GlnGlu: 2.352 ± 2.241
2.016GlnPhe: 2.016 ± 1.114
0.672GlnGly: 0.672 ± 0.654
0.672GlnHis: 0.672 ± 0.371
0.672GlnIle: 0.672 ± 0.925
2.352GlnLys: 2.352 ± 0.894
3.36GlnLeu: 3.36 ± 1.518
0.672GlnMet: 0.672 ± 0.812
1.344GlnAsn: 1.344 ± 0.587
2.016GlnPro: 2.016 ± 1.115
0.336GlnGln: 0.336 ± 0.649
3.024GlnArg: 3.024 ± 1.148
3.696GlnSer: 3.696 ± 1.492
0.672GlnThr: 0.672 ± 0.371
1.008GlnVal: 1.008 ± 0.528
0.0GlnTrp: 0.0 ± 0.0
0.336GlnTyr: 0.336 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
3.36ArgAla: 3.36 ± 1.162
0.672ArgCys: 0.672 ± 0.371
2.688ArgAsp: 2.688 ± 1.428
2.688ArgGlu: 2.688 ± 1.086
4.368ArgPhe: 4.368 ± 1.105
2.016ArgGly: 2.016 ± 1.149
1.008ArgHis: 1.008 ± 0.557
3.36ArgIle: 3.36 ± 0.971
3.36ArgLys: 3.36 ± 0.589
5.376ArgLeu: 5.376 ± 1.66
0.672ArgMet: 0.672 ± 0.371
4.032ArgAsn: 4.032 ± 1.702
0.672ArgPro: 0.672 ± 0.654
1.68ArgGln: 1.68 ± 1.244
3.36ArgArg: 3.36 ± 2.969
4.032ArgSer: 4.032 ± 1.901
3.36ArgThr: 3.36 ± 2.542
2.688ArgVal: 2.688 ± 0.934
0.336ArgTrp: 0.336 ± 0.186
2.016ArgTyr: 2.016 ± 0.764
0.0ArgXaa: 0.0 ± 0.0
Ser
7.392SerAla: 7.392 ± 2.407
1.344SerCys: 1.344 ± 0.743
5.04SerAsp: 5.04 ± 2.645
5.376SerGlu: 5.376 ± 2.482
5.712SerPhe: 5.712 ± 1.195
5.04SerGly: 5.04 ± 1.607
2.688SerHis: 2.688 ± 1.089
7.056SerIle: 7.056 ± 1.779
6.048SerLys: 6.048 ± 1.672
9.409SerLeu: 9.409 ± 3.074
1.344SerMet: 1.344 ± 0.743
5.04SerAsn: 5.04 ± 1.675
2.688SerPro: 2.688 ± 0.842
1.68SerGln: 1.68 ± 0.666
3.024SerArg: 3.024 ± 0.901
9.073SerSer: 9.073 ± 3.3
4.032SerThr: 4.032 ± 0.928
6.048SerVal: 6.048 ± 3.38
0.672SerTrp: 0.672 ± 0.654
4.032SerTyr: 4.032 ± 1.068
0.0SerXaa: 0.0 ± 0.0
Thr
2.688ThrAla: 2.688 ± 1.834
0.336ThrCys: 0.336 ± 0.186
1.68ThrAsp: 1.68 ± 0.76
3.36ThrGlu: 3.36 ± 1.221
5.712ThrPhe: 5.712 ± 1.433
4.368ThrGly: 4.368 ± 2.175
1.008ThrHis: 1.008 ± 0.649
3.36ThrIle: 3.36 ± 1.763
2.352ThrLys: 2.352 ± 0.803
8.065ThrLeu: 8.065 ± 3.559
0.0ThrMet: 0.0 ± 0.0
1.344ThrAsn: 1.344 ± 0.587
1.68ThrPro: 1.68 ± 1.496
1.344ThrGln: 1.344 ± 0.939
1.008ThrArg: 1.008 ± 1.133
4.368ThrSer: 4.368 ± 1.59
1.344ThrThr: 1.344 ± 1.446
1.344ThrVal: 1.344 ± 1.535
0.0ThrTrp: 0.0 ± 0.0
1.008ThrTyr: 1.008 ± 1.133
0.0ThrXaa: 0.0 ± 0.0
Val
3.024ValAla: 3.024 ± 0.929
2.352ValCys: 2.352 ± 2.152
2.352ValAsp: 2.352 ± 1.11
2.352ValGlu: 2.352 ± 1.493
4.704ValPhe: 4.704 ± 0.947
4.032ValGly: 4.032 ± 1.319
2.016ValHis: 2.016 ± 0.835
4.032ValIle: 4.032 ± 1.66
4.032ValLys: 4.032 ± 1.153
7.056ValLeu: 7.056 ± 4.376
1.008ValMet: 1.008 ± 0.557
2.352ValAsn: 2.352 ± 0.979
1.008ValPro: 1.008 ± 0.556
2.688ValGln: 2.688 ± 0.69
3.36ValArg: 3.36 ± 1.099
3.696ValSer: 3.696 ± 1.125
2.688ValThr: 2.688 ± 1.695
5.04ValVal: 5.04 ± 2.321
1.008ValTrp: 1.008 ± 1.199
1.344ValTyr: 1.344 ± 0.842
0.0ValXaa: 0.0 ± 0.0
Trp
0.336TrpAla: 0.336 ± 0.186
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.336TrpHis: 0.336 ± 0.186
0.336TrpIle: 0.336 ± 1.015
0.0TrpLys: 0.0 ± 0.0
1.344TrpLeu: 1.344 ± 0.842
0.0TrpMet: 0.0 ± 0.0
0.336TrpAsn: 0.336 ± 0.649
0.336TrpPro: 0.336 ± 0.186
0.336TrpGln: 0.336 ± 0.649
0.672TrpArg: 0.672 ± 0.371
1.344TrpSer: 1.344 ± 0.752
0.336TrpThr: 0.336 ± 0.748
1.68TrpVal: 1.68 ± 0.643
0.0TrpTrp: 0.0 ± 0.0
0.336TrpTyr: 0.336 ± 0.649
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.688TyrAla: 2.688 ± 0.714
0.672TyrCys: 0.672 ± 1.297
1.68TyrAsp: 1.68 ± 0.929
2.352TyrGlu: 2.352 ± 0.61
0.672TyrPhe: 0.672 ± 0.371
1.68TyrGly: 1.68 ± 0.671
1.008TyrHis: 1.008 ± 0.806
1.68TyrIle: 1.68 ± 0.929
1.008TyrLys: 1.008 ± 0.557
5.04TyrLeu: 5.04 ± 1.64
1.008TyrMet: 1.008 ± 0.795
1.68TyrAsn: 1.68 ± 1.033
1.344TyrPro: 1.344 ± 0.558
1.008TyrGln: 1.008 ± 0.556
0.672TyrArg: 0.672 ± 1.021
2.016TyrSer: 2.016 ± 1.005
0.0TyrThr: 0.0 ± 0.0
1.344TyrVal: 1.344 ± 0.855
0.672TyrTrp: 0.672 ± 0.925
0.336TyrTyr: 0.336 ± 0.186
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski