Amino acid dipepetide frequency for Turnip leaf roll virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.634AlaAla: 2.634 ± 1.849
0.878AlaCys: 0.878 ± 0.616
0.878AlaAsp: 0.878 ± 0.814
4.39AlaGlu: 4.39 ± 1.635
4.39AlaPhe: 4.39 ± 1.377
3.512AlaGly: 3.512 ± 1.924
0.0AlaHis: 0.0 ± 0.0
2.634AlaIle: 2.634 ± 1.19
2.634AlaLys: 2.634 ± 1.19
6.146AlaLeu: 6.146 ± 2.105
0.878AlaMet: 0.878 ± 0.959
0.878AlaAsn: 0.878 ± 0.814
5.268AlaPro: 5.268 ± 1.997
0.878AlaGln: 0.878 ± 0.616
3.512AlaArg: 3.512 ± 1.713
8.78AlaSer: 8.78 ± 2.156
0.878AlaThr: 0.878 ± 1.072
1.756AlaVal: 1.756 ± 0.979
0.0AlaTrp: 0.0 ± 0.0
1.756AlaTyr: 1.756 ± 2.143
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.634CysGly: 2.634 ± 1.354
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.634CysLys: 2.634 ± 2.162
0.878CysLeu: 0.878 ± 1.168
0.878CysMet: 0.878 ± 0.814
1.756CysAsn: 1.756 ± 1.233
0.878CysPro: 0.878 ± 0.616
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.756CysThr: 1.756 ± 0.812
0.0CysVal: 0.0 ± 0.0
0.878CysTrp: 0.878 ± 0.959
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.634AspAla: 2.634 ± 1.016
0.878AspCys: 0.878 ± 0.814
4.39AspAsp: 4.39 ± 2.473
2.634AspGlu: 2.634 ± 2.084
5.268AspPhe: 5.268 ± 1.337
2.634AspGly: 2.634 ± 1.849
0.878AspHis: 0.878 ± 0.959
0.878AspIle: 0.878 ± 0.814
4.39AspLys: 4.39 ± 2.944
5.268AspLeu: 5.268 ± 1.436
1.756AspMet: 1.756 ± 1.06
4.39AspAsn: 4.39 ± 2.139
2.634AspPro: 2.634 ± 1.19
0.878AspGln: 0.878 ± 0.814
2.634AspArg: 2.634 ± 1.643
0.878AspSer: 0.878 ± 0.814
1.756AspThr: 1.756 ± 1.06
3.512AspVal: 3.512 ± 1.903
2.634AspTrp: 2.634 ± 1.778
1.756AspTyr: 1.756 ± 0.812
0.0AspXaa: 0.0 ± 0.0
Glu
8.78GluAla: 8.78 ± 3.184
0.0GluCys: 0.0 ± 0.0
5.268GluAsp: 5.268 ± 1.798
7.902GluGlu: 7.902 ± 3.165
3.512GluPhe: 3.512 ± 2.163
3.512GluGly: 3.512 ± 1.491
0.0GluHis: 0.0 ± 0.0
2.634GluIle: 2.634 ± 1.482
3.512GluLys: 3.512 ± 1.144
2.634GluLeu: 2.634 ± 1.482
0.0GluMet: 0.0 ± 0.0
3.512GluAsn: 3.512 ± 1.821
2.634GluPro: 2.634 ± 1.326
0.878GluGln: 0.878 ± 0.616
1.756GluArg: 1.756 ± 0.948
2.634GluSer: 2.634 ± 1.262
3.512GluThr: 3.512 ± 2.857
2.634GluVal: 2.634 ± 1.5
2.634GluTrp: 2.634 ± 1.016
2.634GluTyr: 2.634 ± 1.15
0.0GluXaa: 0.0 ± 0.0
Phe
0.878PheAla: 0.878 ± 0.616
0.0PheCys: 0.0 ± 0.0
2.634PheAsp: 2.634 ± 0.877
1.756PheGlu: 1.756 ± 0.979
0.0PhePhe: 0.0 ± 0.0
0.878PheGly: 0.878 ± 0.814
0.878PheHis: 0.878 ± 0.616
3.512PheIle: 3.512 ± 1.624
2.634PheLys: 2.634 ± 1.482
6.146PheLeu: 6.146 ± 2.209
0.0PheMet: 0.0 ± 0.0
3.512PheAsn: 3.512 ± 2.764
3.512PhePro: 3.512 ± 1.475
3.512PheGln: 3.512 ± 1.475
1.756PheArg: 1.756 ± 1.162
3.512PheSer: 3.512 ± 2.278
6.146PheThr: 6.146 ± 1.996
1.756PheVal: 1.756 ± 1.088
0.878PheTrp: 0.878 ± 0.814
3.512PheTyr: 3.512 ± 2.163
0.0PheXaa: 0.0 ± 0.0
Gly
3.512GlyAla: 3.512 ± 1.713
0.878GlyCys: 0.878 ± 0.934
5.268GlyAsp: 5.268 ± 1.266
5.268GlyGlu: 5.268 ± 1.744
0.878GlyPhe: 0.878 ± 0.934
4.39GlyGly: 4.39 ± 2.277
0.878GlyHis: 0.878 ± 0.616
4.39GlyIle: 4.39 ± 3.191
4.39GlyLys: 4.39 ± 1.942
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 0.0
0.878GlyAsn: 0.878 ± 1.168
4.39GlyPro: 4.39 ± 1.118
2.634GlyGln: 2.634 ± 1.326
5.268GlyArg: 5.268 ± 0.755
1.756GlySer: 1.756 ± 1.162
3.512GlyThr: 3.512 ± 1.619
6.146GlyVal: 6.146 ± 1.906
1.756GlyTrp: 1.756 ± 1.088
0.878GlyTyr: 0.878 ± 0.814
0.0GlyXaa: 0.0 ± 0.0
His
0.878HisAla: 0.878 ± 0.934
0.878HisCys: 0.878 ± 0.616
1.756HisAsp: 1.756 ± 1.233
0.878HisGlu: 0.878 ± 0.616
1.756HisPhe: 1.756 ± 0.812
0.878HisGly: 0.878 ± 0.934
0.878HisHis: 0.878 ± 0.934
0.878HisIle: 0.878 ± 0.934
0.878HisLys: 0.878 ± 0.616
6.146HisLeu: 6.146 ± 1.858
1.756HisMet: 1.756 ± 1.106
1.756HisAsn: 1.756 ± 1.233
1.756HisPro: 1.756 ± 1.233
2.634HisGln: 2.634 ± 2.108
3.512HisArg: 3.512 ± 2.999
0.0HisSer: 0.0 ± 0.0
0.878HisThr: 0.878 ± 0.616
1.756HisVal: 1.756 ± 0.979
0.878HisTrp: 0.878 ± 0.616
2.634HisTyr: 2.634 ± 0.917
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.878IleCys: 0.878 ± 0.616
1.756IleAsp: 1.756 ± 1.233
5.268IleGlu: 5.268 ± 2.248
4.39IlePhe: 4.39 ± 0.92
2.634IleGly: 2.634 ± 0.917
3.512IleHis: 3.512 ± 2.637
5.268IleIle: 5.268 ± 2.872
1.756IleLys: 1.756 ± 0.812
4.39IleLeu: 4.39 ± 1.118
0.878IleMet: 0.878 ± 0.934
2.634IleAsn: 2.634 ± 1.262
2.634IlePro: 2.634 ± 1.427
6.146IleGln: 6.146 ± 2.263
4.39IleArg: 4.39 ± 1.56
8.78IleSer: 8.78 ± 4.481
4.39IleThr: 4.39 ± 1.695
0.878IleVal: 0.878 ± 0.814
0.878IleTrp: 0.878 ± 1.072
2.634IleTyr: 2.634 ± 1.594
0.0IleXaa: 0.0 ± 0.0
Lys
2.634LysAla: 2.634 ± 2.441
0.878LysCys: 0.878 ± 0.934
7.024LysAsp: 7.024 ± 2.916
5.268LysGlu: 5.268 ± 3.261
1.756LysPhe: 1.756 ± 0.948
3.512LysGly: 3.512 ± 1.475
3.512LysHis: 3.512 ± 1.843
0.878LysIle: 0.878 ± 0.934
7.024LysLys: 7.024 ± 2.48
2.634LysLeu: 2.634 ± 0.917
1.756LysMet: 1.756 ± 0.812
1.756LysAsn: 1.756 ± 0.979
6.146LysPro: 6.146 ± 2.192
2.634LysGln: 2.634 ± 0.917
6.146LysArg: 6.146 ± 2.2
5.268LysSer: 5.268 ± 2.799
5.268LysThr: 5.268 ± 1.939
0.878LysVal: 0.878 ± 0.616
0.878LysTrp: 0.878 ± 0.814
4.39LysTyr: 4.39 ± 1.204
0.0LysXaa: 0.0 ± 0.0
Leu
7.024LeuAla: 7.024 ± 2.954
2.634LeuCys: 2.634 ± 1.15
5.268LeuAsp: 5.268 ± 1.337
3.512LeuGlu: 3.512 ± 1.491
3.512LeuPhe: 3.512 ± 1.431
4.39LeuGly: 4.39 ± 2.593
2.634LeuHis: 2.634 ± 1.299
3.512LeuIle: 3.512 ± 1.655
9.658LeuLys: 9.658 ± 1.858
1.756LeuLeu: 1.756 ± 1.162
1.756LeuMet: 1.756 ± 1.216
5.268LeuAsn: 5.268 ± 2.079
3.512LeuPro: 3.512 ± 2.677
4.39LeuGln: 4.39 ± 2.788
6.146LeuArg: 6.146 ± 2.376
5.268LeuSer: 5.268 ± 2.883
2.634LeuThr: 2.634 ± 0.917
2.634LeuVal: 2.634 ± 2.139
0.878LeuTrp: 0.878 ± 0.616
0.878LeuTyr: 0.878 ± 0.616
0.0LeuXaa: 0.0 ± 0.0
Met
1.756MetAla: 1.756 ± 1.088
0.878MetCys: 0.878 ± 1.168
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.634MetGly: 2.634 ± 1.349
1.756MetHis: 1.756 ± 1.216
0.878MetIle: 0.878 ± 1.072
2.634MetLys: 2.634 ± 1.194
1.756MetLeu: 1.756 ± 1.06
0.878MetMet: 0.878 ± 1.168
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.756MetSer: 1.756 ± 1.627
2.634MetThr: 2.634 ± 2.441
1.756MetVal: 1.756 ± 1.216
0.878MetTrp: 0.878 ± 0.814
2.634MetTyr: 2.634 ± 2.441
0.0MetXaa: 0.0 ± 0.0
Asn
1.756AsnAla: 1.756 ± 0.948
0.0AsnCys: 0.0 ± 0.0
4.39AsnAsp: 4.39 ± 1.846
1.756AsnGlu: 1.756 ± 1.233
2.634AsnPhe: 2.634 ± 1.549
2.634AsnGly: 2.634 ± 0.917
0.878AsnHis: 0.878 ± 0.959
4.39AsnIle: 4.39 ± 2.374
4.39AsnLys: 4.39 ± 1.495
7.902AsnLeu: 7.902 ± 3.907
0.878AsnMet: 0.878 ± 0.814
2.634AsnAsn: 2.634 ± 1.19
3.512AsnPro: 3.512 ± 1.368
0.878AsnGln: 0.878 ± 0.934
1.756AsnArg: 1.756 ± 1.216
2.634AsnSer: 2.634 ± 1.007
0.878AsnThr: 0.878 ± 0.934
2.634AsnVal: 2.634 ± 1.15
0.0AsnTrp: 0.0 ± 0.0
4.39AsnTyr: 4.39 ± 2.139
0.0AsnXaa: 0.0 ± 0.0
Pro
5.268ProAla: 5.268 ± 2.078
0.878ProCys: 0.878 ± 0.934
0.878ProAsp: 0.878 ± 1.072
1.756ProGlu: 1.756 ± 0.948
2.634ProPhe: 2.634 ± 1.15
3.512ProGly: 3.512 ± 1.401
3.512ProHis: 3.512 ± 2.466
4.39ProIle: 4.39 ± 1.851
3.512ProLys: 3.512 ± 1.717
5.268ProLeu: 5.268 ± 3.719
1.756ProMet: 1.756 ± 1.697
2.634ProAsn: 2.634 ± 1.19
4.39ProPro: 4.39 ± 2.354
1.756ProGln: 1.756 ± 0.948
3.512ProArg: 3.512 ± 1.084
7.024ProSer: 7.024 ± 2.572
4.39ProThr: 4.39 ± 1.043
1.756ProVal: 1.756 ± 2.335
0.878ProTrp: 0.878 ± 0.814
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.878GlnAla: 0.878 ± 0.616
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.634GlnGlu: 2.634 ± 1.914
3.512GlnPhe: 3.512 ± 1.713
0.878GlnGly: 0.878 ± 0.616
0.878GlnHis: 0.878 ± 0.934
4.39GlnIle: 4.39 ± 1.204
1.756GlnLys: 1.756 ± 1.132
3.512GlnLeu: 3.512 ± 1.947
0.878GlnMet: 0.878 ± 1.508
3.512GlnAsn: 3.512 ± 1.084
2.634GlnPro: 2.634 ± 2.801
0.878GlnGln: 0.878 ± 0.616
1.756GlnArg: 1.756 ± 1.088
2.634GlnSer: 2.634 ± 0.917
3.512GlnThr: 3.512 ± 1.368
0.878GlnVal: 0.878 ± 1.168
0.878GlnTrp: 0.878 ± 0.616
1.756GlnTyr: 1.756 ± 0.812
0.0GlnXaa: 0.0 ± 0.0
Arg
1.756ArgAla: 1.756 ± 1.162
0.0ArgCys: 0.0 ± 0.0
3.512ArgAsp: 3.512 ± 2.384
4.39ArgGlu: 4.39 ± 1.695
4.39ArgPhe: 4.39 ± 2.007
3.512ArgGly: 3.512 ± 1.401
3.512ArgHis: 3.512 ± 1.122
6.146ArgIle: 6.146 ± 2.117
1.756ArgLys: 1.756 ± 1.088
2.634ArgLeu: 2.634 ± 1.894
0.0ArgMet: 0.0 ± 0.0
0.878ArgAsn: 0.878 ± 0.814
2.634ArgPro: 2.634 ± 1.504
1.756ArgGln: 1.756 ± 0.979
4.39ArgArg: 4.39 ± 2.352
7.024ArgSer: 7.024 ± 0.807
5.268ArgThr: 5.268 ± 1.1
6.146ArgVal: 6.146 ± 2.21
0.878ArgTrp: 0.878 ± 0.814
1.756ArgTyr: 1.756 ± 1.428
0.0ArgXaa: 0.0 ± 0.0
Ser
5.268SerAla: 5.268 ± 3.698
0.878SerCys: 0.878 ± 0.814
2.634SerAsp: 2.634 ± 1.504
0.878SerGlu: 0.878 ± 0.616
3.512SerPhe: 3.512 ± 1.475
3.512SerGly: 3.512 ± 1.591
0.878SerHis: 0.878 ± 0.959
6.146SerIle: 6.146 ± 2.266
4.39SerLys: 4.39 ± 2.06
7.902SerLeu: 7.902 ± 2.673
3.512SerMet: 3.512 ± 2.432
2.634SerAsn: 2.634 ± 1.349
5.268SerPro: 5.268 ± 1.804
1.756SerGln: 1.756 ± 1.627
6.146SerArg: 6.146 ± 1.279
4.39SerSer: 4.39 ± 1.798
5.268SerThr: 5.268 ± 2.675
2.634SerVal: 2.634 ± 1.5
2.634SerTrp: 2.634 ± 0.917
1.756SerTyr: 1.756 ± 1.233
0.0SerXaa: 0.0 ± 0.0
Thr
2.634ThrAla: 2.634 ± 1.262
0.0ThrCys: 0.0 ± 0.0
1.756ThrAsp: 1.756 ± 1.162
3.512ThrGlu: 3.512 ± 1.07
1.756ThrPhe: 1.756 ± 0.812
6.146ThrGly: 6.146 ± 1.893
6.146ThrHis: 6.146 ± 2.307
6.146ThrIle: 6.146 ± 3.041
2.634ThrLys: 2.634 ± 0.917
3.512ThrLeu: 3.512 ± 1.624
1.756ThrMet: 1.756 ± 1.627
2.634ThrAsn: 2.634 ± 1.19
5.268ThrPro: 5.268 ± 1.874
2.634ThrGln: 2.634 ± 1.349
3.512ThrArg: 3.512 ± 1.717
5.268ThrSer: 5.268 ± 3.574
3.512ThrThr: 3.512 ± 1.907
3.512ThrVal: 3.512 ± 1.624
0.878ThrTrp: 0.878 ± 0.814
0.878ThrTyr: 0.878 ± 0.934
0.0ThrXaa: 0.0 ± 0.0
Val
3.512ValAla: 3.512 ± 2.384
1.756ValCys: 1.756 ± 1.162
1.756ValAsp: 1.756 ± 1.132
4.39ValGlu: 4.39 ± 2.365
1.756ValPhe: 1.756 ± 1.162
1.756ValGly: 1.756 ± 1.06
0.878ValHis: 0.878 ± 0.814
3.512ValIle: 3.512 ± 2.764
2.634ValLys: 2.634 ± 1.504
2.634ValLeu: 2.634 ± 0.917
0.878ValMet: 0.878 ± 0.551
4.39ValAsn: 4.39 ± 2.41
2.634ValPro: 2.634 ± 1.482
1.756ValGln: 1.756 ± 0.948
2.634ValArg: 2.634 ± 1.185
0.878ValSer: 0.878 ± 0.814
2.634ValThr: 2.634 ± 2.241
1.756ValVal: 1.756 ± 1.319
0.0ValTrp: 0.0 ± 0.0
3.512ValTyr: 3.512 ± 1.491
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.616
0.0TrpCys: 0.0 ± 0.0
1.756TrpAsp: 1.756 ± 1.319
0.878TrpGlu: 0.878 ± 0.814
0.878TrpPhe: 0.878 ± 0.959
0.878TrpGly: 0.878 ± 0.616
0.0TrpHis: 0.0 ± 0.0
1.756TrpIle: 1.756 ± 1.088
2.634TrpLys: 2.634 ± 1.504
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.756TrpAsn: 1.756 ± 1.162
0.0TrpPro: 0.0 ± 0.0
1.756TrpGln: 1.756 ± 0.812
1.756TrpArg: 1.756 ± 1.162
0.878TrpSer: 0.878 ± 0.814
2.634TrpThr: 2.634 ± 0.877
0.878TrpVal: 0.878 ± 0.616
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.756TyrAsp: 1.756 ± 0.812
3.512TyrGlu: 3.512 ± 2.393
0.878TyrPhe: 0.878 ± 0.814
2.634TyrGly: 2.634 ± 1.349
1.756TyrHis: 1.756 ± 1.233
1.756TyrIle: 1.756 ± 1.233
4.39TyrLys: 4.39 ± 1.273
6.146TyrLeu: 6.146 ± 1.864
1.756TyrMet: 1.756 ± 1.003
3.512TyrAsn: 3.512 ± 1.907
0.0TyrPro: 0.0 ± 0.0
0.878TyrGln: 0.878 ± 0.814
1.756TyrArg: 1.756 ± 0.979
2.634TyrSer: 2.634 ± 2.082
2.634TyrThr: 2.634 ± 1.15
1.756TyrVal: 1.756 ± 1.627
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1140 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski