Amino acid dipepetide frequency for Rudbeckia flower distortion virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.922AlaAla: 2.922 ± 1.151
0.835AlaCys: 0.835 ± 0.342
2.087AlaAsp: 2.087 ± 0.754
5.008AlaGlu: 5.008 ± 1.987
2.922AlaPhe: 2.922 ± 0.885
0.835AlaGly: 0.835 ± 0.606
0.0AlaHis: 0.0 ± 0.0
3.756AlaIle: 3.756 ± 1.812
4.174AlaLys: 4.174 ± 1.528
3.339AlaLeu: 3.339 ± 1.39
1.252AlaMet: 1.252 ± 0.693
2.087AlaAsn: 2.087 ± 0.531
0.417AlaPro: 0.417 ± 0.303
0.835AlaGln: 0.835 ± 0.71
0.835AlaArg: 0.835 ± 0.507
4.174AlaSer: 4.174 ± 1.824
2.087AlaThr: 2.087 ± 1.093
2.922AlaVal: 2.922 ± 1.035
0.417AlaTrp: 0.417 ± 0.355
2.087AlaTyr: 2.087 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.417CysAla: 0.417 ± 0.303
0.417CysCys: 0.417 ± 0.355
0.0CysAsp: 0.0 ± 0.0
1.669CysGlu: 1.669 ± 1.168
0.417CysPhe: 0.417 ± 0.303
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.669CysIle: 1.669 ± 0.604
1.252CysLys: 1.252 ± 0.628
2.087CysLeu: 2.087 ± 0.763
0.0CysMet: 0.0 ± 0.0
0.417CysAsn: 0.417 ± 0.545
1.252CysPro: 1.252 ± 0.628
0.417CysGln: 0.417 ± 0.486
0.835CysArg: 0.835 ± 0.342
2.087CysSer: 2.087 ± 1.079
1.252CysThr: 1.252 ± 0.563
0.0CysVal: 0.0 ± 0.0
0.417CysTrp: 0.417 ± 0.355
1.252CysTyr: 1.252 ± 0.586
0.0CysXaa: 0.0 ± 0.0
Asp
2.087AspAla: 2.087 ± 0.66
1.252AspCys: 1.252 ± 0.392
5.008AspAsp: 5.008 ± 0.859
7.93AspGlu: 7.93 ± 2.124
1.669AspPhe: 1.669 ± 0.501
2.087AspGly: 2.087 ± 0.934
1.669AspHis: 1.669 ± 0.881
2.922AspIle: 2.922 ± 1.124
4.591AspLys: 4.591 ± 1.553
3.756AspLeu: 3.756 ± 1.095
0.835AspMet: 0.835 ± 0.728
5.843AspAsn: 5.843 ± 0.919
1.669AspPro: 1.669 ± 1.03
1.252AspGln: 1.252 ± 0.527
1.669AspArg: 1.669 ± 0.623
5.426AspSer: 5.426 ± 1.597
3.756AspThr: 3.756 ± 1.128
2.504AspVal: 2.504 ± 1.371
0.835AspTrp: 0.835 ± 0.439
2.922AspTyr: 2.922 ± 0.669
0.0AspXaa: 0.0 ± 0.0
Glu
0.835GluAla: 0.835 ± 0.439
1.669GluCys: 1.669 ± 1.636
7.93GluAsp: 7.93 ± 1.717
5.426GluGlu: 5.426 ± 2.92
4.591GluPhe: 4.591 ± 0.966
2.504GluGly: 2.504 ± 1.348
2.922GluHis: 2.922 ± 0.702
7.93GluIle: 7.93 ± 1.848
5.843GluLys: 5.843 ± 1.228
7.095GluLeu: 7.095 ± 2.335
0.835GluMet: 0.835 ± 0.589
5.426GluAsn: 5.426 ± 1.281
0.835GluPro: 0.835 ± 0.71
3.756GluGln: 3.756 ± 1.751
3.339GluArg: 3.339 ± 1.107
3.339GluSer: 3.339 ± 1.399
4.591GluThr: 4.591 ± 1.168
2.922GluVal: 2.922 ± 0.565
1.252GluTrp: 1.252 ± 0.91
1.669GluTyr: 1.669 ± 0.684
0.0GluXaa: 0.0 ± 0.0
Phe
0.417PheAla: 0.417 ± 0.303
0.835PheCys: 0.835 ± 0.404
1.669PheAsp: 1.669 ± 1.104
3.339PheGlu: 3.339 ± 1.712
0.417PhePhe: 0.417 ± 0.355
1.669PheGly: 1.669 ± 0.702
1.669PheHis: 1.669 ± 0.509
2.504PheIle: 2.504 ± 0.986
5.008PheLys: 5.008 ± 1.187
5.843PheLeu: 5.843 ± 2.019
0.835PheMet: 0.835 ± 0.501
3.339PheAsn: 3.339 ± 0.854
3.339PhePro: 3.339 ± 1.253
2.087PheGln: 2.087 ± 1.305
2.504PheArg: 2.504 ± 1.042
3.756PheSer: 3.756 ± 0.861
2.504PheThr: 2.504 ± 1.172
0.835PheVal: 0.835 ± 0.441
0.417PheTrp: 0.417 ± 0.303
1.252PheTyr: 1.252 ± 0.51
0.0PheXaa: 0.0 ± 0.0
Gly
1.252GlyAla: 1.252 ± 0.392
0.417GlyCys: 0.417 ± 0.355
0.835GlyAsp: 0.835 ± 0.598
1.669GlyGlu: 1.669 ± 0.483
2.922GlyPhe: 2.922 ± 1.339
0.417GlyGly: 0.417 ± 0.355
1.252GlyHis: 1.252 ± 1.065
2.922GlyIle: 2.922 ± 0.752
3.756GlyLys: 3.756 ± 1.09
4.174GlyLeu: 4.174 ± 0.85
0.417GlyMet: 0.417 ± 0.303
2.922GlyAsn: 2.922 ± 1.071
0.417GlyPro: 0.417 ± 0.37
2.087GlyGln: 2.087 ± 0.854
1.669GlyArg: 1.669 ± 0.483
2.087GlySer: 2.087 ± 1.017
2.087GlyThr: 2.087 ± 0.802
1.252GlyVal: 1.252 ± 0.58
0.417GlyTrp: 0.417 ± 0.37
2.087GlyTyr: 2.087 ± 0.705
0.0GlyXaa: 0.0 ± 0.0
His
1.252HisAla: 1.252 ± 0.611
0.835HisCys: 0.835 ± 0.553
1.669HisAsp: 1.669 ± 1.306
0.835HisGlu: 0.835 ± 0.584
1.252HisPhe: 1.252 ± 0.58
0.0HisGly: 0.0 ± 0.0
0.835HisHis: 0.835 ± 0.441
1.669HisIle: 1.669 ± 0.627
0.0HisLys: 0.0 ± 0.0
2.087HisLeu: 2.087 ± 0.873
0.417HisMet: 0.417 ± 0.37
2.504HisAsn: 2.504 ± 1.095
1.252HisPro: 1.252 ± 0.831
0.835HisGln: 0.835 ± 0.589
0.835HisArg: 0.835 ± 0.71
2.922HisSer: 2.922 ± 1.227
1.669HisThr: 1.669 ± 0.499
0.0HisVal: 0.0 ± 0.0
0.417HisTrp: 0.417 ± 0.303
2.087HisTyr: 2.087 ± 0.765
0.0HisXaa: 0.0 ± 0.0
Ile
1.669IleAla: 1.669 ± 0.604
0.835IleCys: 0.835 ± 0.71
6.26IleAsp: 6.26 ± 2.576
4.591IleGlu: 4.591 ± 0.637
2.504IlePhe: 2.504 ± 0.748
2.087IleGly: 2.087 ± 1.743
0.0IleHis: 0.0 ± 0.0
6.678IleIle: 6.678 ± 2.646
9.182IleLys: 9.182 ± 1.796
8.765IleLeu: 8.765 ± 1.14
0.835IleMet: 0.835 ± 0.573
5.426IleAsn: 5.426 ± 1.381
5.843IlePro: 5.843 ± 1.171
5.426IleGln: 5.426 ± 1.563
4.174IleArg: 4.174 ± 1.966
5.008IleSer: 5.008 ± 0.903
5.426IleThr: 5.426 ± 1.688
2.504IleVal: 2.504 ± 0.469
0.417IleTrp: 0.417 ± 0.37
2.087IleTyr: 2.087 ± 0.555
0.0IleXaa: 0.0 ± 0.0
Lys
3.756LysAla: 3.756 ± 0.626
1.252LysCys: 1.252 ± 0.58
2.087LysAsp: 2.087 ± 0.643
6.26LysGlu: 6.26 ± 0.832
4.591LysPhe: 4.591 ± 1.118
2.504LysGly: 2.504 ± 0.561
2.504LysHis: 2.504 ± 1.054
7.095LysIle: 7.095 ± 1.881
7.095LysLys: 7.095 ± 1.922
10.434LysLeu: 10.434 ± 1.183
0.835LysMet: 0.835 ± 1.021
6.678LysAsn: 6.678 ± 1.981
5.008LysPro: 5.008 ± 1.528
4.591LysGln: 4.591 ± 1.506
6.678LysArg: 6.678 ± 0.333
4.174LysSer: 4.174 ± 1.77
7.93LysThr: 7.93 ± 0.926
1.669LysVal: 1.669 ± 0.865
0.417LysTrp: 0.417 ± 0.37
2.087LysTyr: 2.087 ± 0.586
0.0LysXaa: 0.0 ± 0.0
Leu
5.426LeuAla: 5.426 ± 2.152
2.087LeuCys: 2.087 ± 1.059
5.008LeuAsp: 5.008 ± 0.862
7.513LeuGlu: 7.513 ± 1.988
5.008LeuPhe: 5.008 ± 1.423
2.922LeuGly: 2.922 ± 0.61
2.087LeuHis: 2.087 ± 0.717
4.591LeuIle: 4.591 ± 1.336
7.93LeuLys: 7.93 ± 2.023
7.095LeuLeu: 7.095 ± 1.926
2.087LeuMet: 2.087 ± 0.902
7.095LeuAsn: 7.095 ± 1.366
5.426LeuPro: 5.426 ± 1.59
4.174LeuGln: 4.174 ± 2.102
4.591LeuArg: 4.591 ± 0.861
5.426LeuSer: 5.426 ± 1.103
7.513LeuThr: 7.513 ± 1.39
5.843LeuVal: 5.843 ± 0.792
0.835LeuTrp: 0.835 ± 0.605
3.339LeuTyr: 3.339 ± 1.102
0.0LeuXaa: 0.0 ± 0.0
Met
1.252MetAla: 1.252 ± 0.713
0.0MetCys: 0.0 ± 0.0
1.669MetAsp: 1.669 ± 0.779
0.835MetGlu: 0.835 ± 0.642
0.0MetPhe: 0.0 ± 0.0
0.417MetGly: 0.417 ± 0.303
0.835MetHis: 0.835 ± 0.646
2.087MetIle: 2.087 ± 1.025
0.835MetLys: 0.835 ± 0.739
0.417MetLeu: 0.417 ± 0.37
0.417MetMet: 0.417 ± 0.37
0.417MetAsn: 0.417 ± 0.303
0.417MetPro: 0.417 ± 0.37
0.0MetGln: 0.0 ± 0.0
0.417MetArg: 0.417 ± 0.486
0.835MetSer: 0.835 ± 0.589
2.087MetThr: 2.087 ± 0.903
0.417MetVal: 0.417 ± 0.545
0.417MetTrp: 0.417 ± 0.355
0.417MetTyr: 0.417 ± 0.37
0.0MetXaa: 0.0 ± 0.0
Asn
3.339AsnAla: 3.339 ± 0.96
1.252AsnCys: 1.252 ± 0.747
2.504AsnAsp: 2.504 ± 0.684
6.26AsnGlu: 6.26 ± 1.084
2.922AsnPhe: 2.922 ± 0.808
2.504AsnGly: 2.504 ± 1.124
1.669AsnHis: 1.669 ± 0.627
6.26AsnIle: 6.26 ± 1.868
7.095AsnLys: 7.095 ± 0.7
7.095AsnLeu: 7.095 ± 1.664
0.835AsnMet: 0.835 ± 0.505
4.174AsnAsn: 4.174 ± 1.944
2.922AsnPro: 2.922 ± 0.992
2.087AsnGln: 2.087 ± 1.245
2.087AsnArg: 2.087 ± 1.143
5.843AsnSer: 5.843 ± 1.036
2.504AsnThr: 2.504 ± 0.806
3.756AsnVal: 3.756 ± 0.615
0.835AsnTrp: 0.835 ± 1.091
5.426AsnTyr: 5.426 ± 1.473
0.0AsnXaa: 0.0 ± 0.0
Pro
2.922ProAla: 2.922 ± 0.742
0.417ProCys: 0.417 ± 0.303
2.087ProAsp: 2.087 ± 0.919
2.504ProGlu: 2.504 ± 1.713
2.087ProPhe: 2.087 ± 0.754
1.669ProGly: 1.669 ± 0.684
1.669ProHis: 1.669 ± 0.959
3.756ProIle: 3.756 ± 1.523
2.504ProLys: 2.504 ± 0.725
5.426ProLeu: 5.426 ± 0.837
1.252ProMet: 1.252 ± 0.578
3.756ProAsn: 3.756 ± 1.039
1.252ProPro: 1.252 ± 0.558
0.417ProGln: 0.417 ± 0.37
2.087ProArg: 2.087 ± 1.113
2.087ProSer: 2.087 ± 0.426
3.339ProThr: 3.339 ± 0.856
1.669ProVal: 1.669 ± 0.759
0.0ProTrp: 0.0 ± 0.0
0.835ProTyr: 0.835 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
3.339GlnAla: 3.339 ± 0.697
0.417GlnCys: 0.417 ± 0.503
2.087GlnAsp: 2.087 ± 1.161
3.756GlnGlu: 3.756 ± 0.903
0.835GlnPhe: 0.835 ± 0.7
2.504GlnGly: 2.504 ± 0.553
0.417GlnHis: 0.417 ± 0.545
2.922GlnIle: 2.922 ± 0.669
2.504GlnLys: 2.504 ± 0.561
4.174GlnLeu: 4.174 ± 1.328
0.0GlnMet: 0.0 ± 0.0
2.087GlnAsn: 2.087 ± 1.024
2.087GlnPro: 2.087 ± 1.094
2.504GlnGln: 2.504 ± 0.868
2.504GlnArg: 2.504 ± 1.386
1.252GlnSer: 1.252 ± 0.569
2.922GlnThr: 2.922 ± 1.295
3.339GlnVal: 3.339 ± 1.484
0.417GlnTrp: 0.417 ± 0.303
1.669GlnTyr: 1.669 ± 1.068
0.0GlnXaa: 0.0 ± 0.0
Arg
2.504ArgAla: 2.504 ± 1.004
0.417ArgCys: 0.417 ± 0.355
0.835ArgAsp: 0.835 ± 0.618
2.087ArgGlu: 2.087 ± 0.91
2.504ArgPhe: 2.504 ± 0.965
2.922ArgGly: 2.922 ± 1.221
0.417ArgHis: 0.417 ± 0.303
3.339ArgIle: 3.339 ± 1.086
6.678ArgLys: 6.678 ± 0.812
4.591ArgLeu: 4.591 ± 1.31
0.417ArgMet: 0.417 ± 0.37
4.174ArgAsn: 4.174 ± 1.288
1.252ArgPro: 1.252 ± 0.611
0.835ArgGln: 0.835 ± 0.602
2.922ArgArg: 2.922 ± 1.678
2.922ArgSer: 2.922 ± 1.097
4.174ArgThr: 4.174 ± 1.092
0.417ArgVal: 0.417 ± 0.545
0.417ArgTrp: 0.417 ± 0.303
2.504ArgTyr: 2.504 ± 0.769
0.0ArgXaa: 0.0 ± 0.0
Ser
0.835SerAla: 0.835 ± 0.584
1.252SerCys: 1.252 ± 0.754
9.599SerAsp: 9.599 ± 1.682
5.008SerGlu: 5.008 ± 1.706
2.922SerPhe: 2.922 ± 0.808
3.756SerGly: 3.756 ± 1.118
2.504SerHis: 2.504 ± 0.616
6.678SerIle: 6.678 ± 2.448
8.765SerLys: 8.765 ± 1.239
3.339SerLeu: 3.339 ± 0.958
0.835SerMet: 0.835 ± 0.606
4.174SerAsn: 4.174 ± 1.049
2.922SerPro: 2.922 ± 1.412
0.835SerGln: 0.835 ± 0.598
3.339SerArg: 3.339 ± 1.185
7.095SerSer: 7.095 ± 2.431
4.174SerThr: 4.174 ± 1.399
1.252SerVal: 1.252 ± 0.842
0.417SerTrp: 0.417 ± 0.355
2.504SerTyr: 2.504 ± 0.988
0.0SerXaa: 0.0 ± 0.0
Thr
4.174ThrAla: 4.174 ± 0.851
0.417ThrCys: 0.417 ± 0.444
4.591ThrAsp: 4.591 ± 1.559
3.756ThrGlu: 3.756 ± 0.827
2.087ThrPhe: 2.087 ± 0.532
3.339ThrGly: 3.339 ± 1.161
1.252ThrHis: 1.252 ± 0.569
7.095ThrIle: 7.095 ± 1.979
2.922ThrLys: 2.922 ± 1.288
4.591ThrLeu: 4.591 ± 1.206
0.417ThrMet: 0.417 ± 0.545
5.426ThrAsn: 5.426 ± 1.035
1.252ThrPro: 1.252 ± 0.831
3.339ThrGln: 3.339 ± 1.538
2.922ThrArg: 2.922 ± 0.947
5.426ThrSer: 5.426 ± 1.775
2.087ThrThr: 2.087 ± 0.926
5.008ThrVal: 5.008 ± 1.241
0.0ThrTrp: 0.0 ± 0.0
2.504ThrTyr: 2.504 ± 0.868
0.0ThrXaa: 0.0 ± 0.0
Val
0.835ValAla: 0.835 ± 0.404
0.0ValCys: 0.0 ± 0.0
2.504ValAsp: 2.504 ± 0.614
2.504ValGlu: 2.504 ± 1.124
0.835ValPhe: 0.835 ± 0.584
1.252ValGly: 1.252 ± 0.392
0.417ValHis: 0.417 ± 0.355
2.922ValIle: 2.922 ± 1.004
3.339ValLys: 3.339 ± 1.14
4.591ValLeu: 4.591 ± 1.252
0.835ValMet: 0.835 ± 0.507
2.922ValAsn: 2.922 ± 1.138
2.087ValPro: 2.087 ± 0.643
2.922ValGln: 2.922 ± 0.795
0.835ValArg: 0.835 ± 0.404
3.756ValSer: 3.756 ± 0.987
2.087ValThr: 2.087 ± 0.902
1.252ValVal: 1.252 ± 0.563
0.417ValTrp: 0.417 ± 0.303
3.339ValTyr: 3.339 ± 0.947
0.0ValXaa: 0.0 ± 0.0
Trp
0.835TrpAla: 0.835 ± 0.342
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.252TrpGlu: 1.252 ± 0.754
0.417TrpPhe: 0.417 ± 0.303
0.0TrpGly: 0.0 ± 0.0
0.417TrpHis: 0.417 ± 0.37
0.835TrpIle: 0.835 ± 0.439
0.417TrpLys: 0.417 ± 0.37
0.417TrpLeu: 0.417 ± 0.355
0.417TrpMet: 0.417 ± 0.37
0.835TrpAsn: 0.835 ± 0.606
0.417TrpPro: 0.417 ± 0.545
0.835TrpGln: 0.835 ± 0.606
1.252TrpArg: 1.252 ± 0.628
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.417TrpTyr: 0.417 ± 0.503
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.922TyrAla: 2.922 ± 1.251
1.252TyrCys: 1.252 ± 0.679
1.669TyrAsp: 1.669 ± 0.604
2.504TyrGlu: 2.504 ± 0.745
3.339TyrPhe: 3.339 ± 1.264
1.669TyrGly: 1.669 ± 1.15
0.835TyrHis: 0.835 ± 0.404
2.087TyrIle: 2.087 ± 0.629
3.756TyrLys: 3.756 ± 0.788
6.678TyrLeu: 6.678 ± 1.094
0.0TyrMet: 0.0 ± 0.0
1.669TyrAsn: 1.669 ± 1.042
1.669TyrPro: 1.669 ± 1.08
2.504TyrGln: 2.504 ± 0.599
0.835TyrArg: 0.835 ± 0.501
4.591TyrSer: 4.591 ± 0.763
0.417TyrThr: 0.417 ± 0.303
2.087TyrVal: 2.087 ± 1.516
0.0TyrTrp: 0.0 ± 0.0
1.669TyrTyr: 1.669 ± 0.87
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2397 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski