Amino acid dipepetide frequency for Leopards Hill virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.881AlaAla: 2.881 ± 2.605
2.373AlaCys: 2.373 ± 0.438
2.542AlaAsp: 2.542 ± 0.285
5.932AlaGlu: 5.932 ± 0.715
1.864AlaPhe: 1.864 ± 0.401
2.712AlaGly: 2.712 ± 1.282
1.017AlaHis: 1.017 ± 0.394
3.729AlaIle: 3.729 ± 0.892
3.898AlaLys: 3.898 ± 1.665
3.22AlaLeu: 3.22 ± 0.177
0.508AlaMet: 0.508 ± 0.309
2.203AlaAsn: 2.203 ± 1.156
2.034AlaPro: 2.034 ± 0.396
1.695AlaGln: 1.695 ± 0.689
2.373AlaArg: 2.373 ± 0.312
3.22AlaSer: 3.22 ± 1.429
2.542AlaThr: 2.542 ± 0.659
3.729AlaVal: 3.729 ± 0.479
0.847AlaTrp: 0.847 ± 0.633
1.864AlaTyr: 1.864 ± 0.183
0.0AlaXaa: 0.0 ± 0.0
Cys
1.525CysAla: 1.525 ± 0.507
1.017CysCys: 1.017 ± 0.297
1.017CysAsp: 1.017 ± 0.694
0.678CysGlu: 0.678 ± 0.314
1.525CysPhe: 1.525 ± 0.087
1.356CysGly: 1.356 ± 1.112
0.678CysHis: 0.678 ± 0.283
1.525CysIle: 1.525 ± 0.56
1.525CysLys: 1.525 ± 0.864
3.22CysLeu: 3.22 ± 0.407
0.508CysMet: 0.508 ± 0.111
1.356CysAsn: 1.356 ± 1.112
1.864CysPro: 1.864 ± 0.656
1.525CysGln: 1.525 ± 0.087
1.525CysArg: 1.525 ± 0.357
1.356CysSer: 1.356 ± 0.14
1.864CysThr: 1.864 ± 1.18
0.847CysVal: 0.847 ± 0.454
0.847CysTrp: 0.847 ± 0.486
0.339CysTyr: 0.339 ± 0.421
0.0CysXaa: 0.0 ± 0.0
Asp
2.373AspAla: 2.373 ± 0.72
1.525AspCys: 1.525 ± 0.334
3.051AspAsp: 3.051 ± 0.802
2.712AspGlu: 2.712 ± 1.189
1.356AspPhe: 1.356 ± 0.727
3.051AspGly: 3.051 ± 0.293
0.847AspHis: 0.847 ± 0.393
3.22AspIle: 3.22 ± 0.59
3.729AspLys: 3.729 ± 0.365
5.593AspLeu: 5.593 ± 0.063
1.525AspMet: 1.525 ± 0.897
2.712AspAsn: 2.712 ± 0.504
2.373AspPro: 2.373 ± 0.449
1.017AspGln: 1.017 ± 0.297
1.695AspArg: 1.695 ± 0.349
4.237AspSer: 4.237 ± 0.905
2.373AspThr: 2.373 ± 0.751
4.576AspVal: 4.576 ± 0.872
0.847AspTrp: 0.847 ± 0.216
1.695AspTyr: 1.695 ± 0.432
0.0AspXaa: 0.0 ± 0.0
Glu
4.237GluAla: 4.237 ± 0.619
1.525GluCys: 1.525 ± 0.291
3.729GluAsp: 3.729 ± 0.656
7.119GluGlu: 7.119 ± 0.518
2.542GluPhe: 2.542 ± 0.356
4.237GluGly: 4.237 ± 0.781
1.695GluHis: 1.695 ± 0.778
4.407GluIle: 4.407 ± 0.882
5.593GluLys: 5.593 ± 0.46
8.644GluLeu: 8.644 ± 2.986
1.864GluMet: 1.864 ± 0.636
2.034GluAsn: 2.034 ± 0.61
1.695GluPro: 1.695 ± 0.349
2.712GluGln: 2.712 ± 0.585
2.712GluArg: 2.712 ± 0.314
5.424GluSer: 5.424 ± 0.484
3.39GluThr: 3.39 ± 0.698
6.441GluVal: 6.441 ± 1.262
0.508GluTrp: 0.508 ± 0.111
1.356GluTyr: 1.356 ± 0.252
0.0GluXaa: 0.0 ± 0.0
Phe
1.356PheAla: 1.356 ± 0.14
0.847PheCys: 0.847 ± 0.238
1.864PheAsp: 1.864 ± 0.111
3.051PheGlu: 3.051 ± 0.594
2.373PhePhe: 2.373 ± 0.486
2.203PheGly: 2.203 ± 1.108
0.847PheHis: 0.847 ± 0.238
1.695PheIle: 1.695 ± 0.393
3.39PheLys: 3.39 ± 1.335
4.576PheLeu: 4.576 ± 0.845
0.678PheMet: 0.678 ± 0.363
3.051PheAsn: 3.051 ± 0.918
1.186PhePro: 1.186 ± 0.555
2.034PheGln: 2.034 ± 0.031
1.695PheArg: 1.695 ± 0.432
5.254PheSer: 5.254 ± 0.148
3.051PheThr: 3.051 ± 0.315
1.186PheVal: 1.186 ± 0.376
0.169PheTrp: 0.169 ± 0.091
1.525PheTyr: 1.525 ± 0.357
0.0PheXaa: 0.0 ± 0.0
Gly
2.712GlyAla: 2.712 ± 1.136
1.695GlyCys: 1.695 ± 1.249
4.068GlyAsp: 4.068 ± 0.063
2.712GlyGlu: 2.712 ± 1.09
1.525GlyPhe: 1.525 ± 0.291
2.203GlyGly: 2.203 ± 0.239
1.017GlyHis: 1.017 ± 0.297
4.237GlyIle: 4.237 ± 0.828
4.915GlyLys: 4.915 ± 0.882
4.237GlyLeu: 4.237 ± 0.268
1.525GlyMet: 1.525 ± 0.317
2.881GlyAsn: 2.881 ± 0.672
1.017GlyPro: 1.017 ± 0.223
1.695GlyGln: 1.695 ± 0.488
2.712GlyArg: 2.712 ± 0.762
4.915GlySer: 4.915 ± 0.691
4.746GlyThr: 4.746 ± 1.56
3.051GlyVal: 3.051 ± 0.581
0.508GlyTrp: 0.508 ± 0.309
0.508GlyTyr: 0.508 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
1.186HisAla: 1.186 ± 0.383
0.508HisCys: 0.508 ± 0.347
0.508HisAsp: 0.508 ± 0.355
1.186HisGlu: 1.186 ± 0.376
1.356HisPhe: 1.356 ± 0.344
1.525HisGly: 1.525 ± 0.334
0.847HisHis: 0.847 ± 0.238
1.356HisIle: 1.356 ± 0.381
1.186HisLys: 1.186 ± 0.243
2.373HisLeu: 2.373 ± 0.992
0.508HisMet: 0.508 ± 0.309
0.847HisAsn: 0.847 ± 0.216
0.847HisPro: 0.847 ± 0.227
0.508HisGln: 0.508 ± 0.355
1.525HisArg: 1.525 ± 0.357
1.186HisSer: 1.186 ± 0.376
1.186HisThr: 1.186 ± 0.243
1.695HisVal: 1.695 ± 0.476
0.169HisTrp: 0.169 ± 0.091
0.847HisTyr: 0.847 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
3.559IleAla: 3.559 ± 0.865
1.017IleCys: 1.017 ± 0.545
1.695IleAsp: 1.695 ± 0.778
5.763IleGlu: 5.763 ± 0.435
2.373IlePhe: 2.373 ± 0.167
1.864IleGly: 1.864 ± 0.947
1.356IleHis: 1.356 ± 0.292
3.051IleIle: 3.051 ± 0.819
4.576IleLys: 4.576 ± 1.373
6.441IleLeu: 6.441 ± 0.928
2.034IleMet: 2.034 ± 0.266
4.068IleAsn: 4.068 ± 0.42
1.356IlePro: 1.356 ± 0.14
2.542IleGln: 2.542 ± 0.854
2.203IleArg: 2.203 ± 0.527
5.085IleSer: 5.085 ± 1.296
3.898IleThr: 3.898 ± 1.009
4.237IleVal: 4.237 ± 0.817
0.847IleTrp: 0.847 ± 0.653
1.695IleTyr: 1.695 ± 0.853
0.0IleXaa: 0.0 ± 0.0
Lys
3.051LysAla: 3.051 ± 0.363
0.678LysCys: 0.678 ± 0.556
4.407LysAsp: 4.407 ± 0.699
4.915LysGlu: 4.915 ± 1.26
2.373LysPhe: 2.373 ± 1.272
5.932LysGly: 5.932 ± 1.035
1.356LysHis: 1.356 ± 0.567
4.915LysIle: 4.915 ± 1.004
6.949LysLys: 6.949 ± 1.4
7.458LysLeu: 7.458 ± 0.74
1.525LysMet: 1.525 ± 0.763
3.39LysAsn: 3.39 ± 0.785
3.898LysPro: 3.898 ± 0.447
1.186LysGln: 1.186 ± 0.208
3.729LysArg: 3.729 ± 0.797
5.424LysSer: 5.424 ± 1.537
3.898LysThr: 3.898 ± 0.135
7.627LysVal: 7.627 ± 1.874
0.508LysTrp: 0.508 ± 0.272
2.203LysTyr: 2.203 ± 0.746
0.0LysXaa: 0.0 ± 0.0
Leu
4.068LeuAla: 4.068 ± 0.52
2.034LeuCys: 2.034 ± 0.446
5.932LeuAsp: 5.932 ± 1.315
6.271LeuGlu: 6.271 ± 1.638
3.898LeuPhe: 3.898 ± 0.156
6.102LeuGly: 6.102 ± 0.752
2.373LeuHis: 2.373 ± 0.449
4.915LeuIle: 4.915 ± 1.62
7.627LeuLys: 7.627 ± 0.533
10.508LeuLeu: 10.508 ± 2.639
2.373LeuMet: 2.373 ± 0.766
6.78LeuAsn: 6.78 ± 0.713
4.407LeuPro: 4.407 ± 0.157
3.559LeuGln: 3.559 ± 0.78
4.407LeuArg: 4.407 ± 1.59
9.661LeuSer: 9.661 ± 1.019
6.271LeuThr: 6.271 ± 0.923
6.949LeuVal: 6.949 ± 0.729
0.847LeuTrp: 0.847 ± 0.393
2.881LeuTyr: 2.881 ± 0.646
0.0LeuXaa: 0.0 ± 0.0
Met
1.525MetAla: 1.525 ± 0.603
1.356MetCys: 1.356 ± 0.527
1.695MetAsp: 1.695 ± 0.455
2.203MetGlu: 2.203 ± 0.239
0.847MetPhe: 0.847 ± 0.216
0.678MetGly: 0.678 ± 0.146
0.169MetHis: 0.169 ± 0.372
1.356MetIle: 1.356 ± 0.471
1.186MetLys: 1.186 ± 0.456
2.203MetLeu: 2.203 ± 0.746
1.356MetMet: 1.356 ± 0.727
0.847MetAsn: 0.847 ± 0.216
1.017MetPro: 1.017 ± 0.545
1.017MetGln: 1.017 ± 0.198
1.695MetArg: 1.695 ± 0.432
2.203MetSer: 2.203 ± 0.812
0.847MetThr: 0.847 ± 0.216
1.017MetVal: 1.017 ± 0.599
0.0MetTrp: 0.0 ± 0.0
0.169MetTyr: 0.169 ± 0.091
0.0MetXaa: 0.0 ± 0.0
Asn
1.695AsnAla: 1.695 ± 0.572
1.695AsnCys: 1.695 ± 0.2
1.864AsnAsp: 1.864 ± 0.947
2.881AsnGlu: 2.881 ± 0.438
2.203AsnPhe: 2.203 ± 0.395
2.034AsnGly: 2.034 ± 0.396
1.525AsnHis: 1.525 ± 0.56
3.898AsnIle: 3.898 ± 0.708
3.729AsnLys: 3.729 ± 0.897
6.441AsnLeu: 6.441 ± 0.401
0.847AsnMet: 0.847 ± 0.227
3.559AsnAsn: 3.559 ± 0.362
2.034AsnPro: 2.034 ± 1.125
1.186AsnGln: 1.186 ± 0.472
3.051AsnArg: 3.051 ± 0.293
4.576AsnSer: 4.576 ± 1.234
2.034AsnThr: 2.034 ± 0.782
2.542AsnVal: 2.542 ± 0.356
1.695AsnTrp: 1.695 ± 0.572
1.695AsnTyr: 1.695 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
2.034ProAla: 2.034 ± 1.198
0.847ProCys: 0.847 ± 0.227
2.712ProAsp: 2.712 ± 0.688
2.881ProGlu: 2.881 ± 0.709
1.695ProPhe: 1.695 ± 0.878
2.203ProGly: 2.203 ± 0.239
0.169ProHis: 0.169 ± 0.091
2.542ProIle: 2.542 ± 0.682
2.542ProLys: 2.542 ± 0.529
2.542ProLeu: 2.542 ± 0.557
0.508ProMet: 0.508 ± 0.272
1.356ProAsn: 1.356 ± 0.252
0.678ProPro: 0.678 ± 0.314
1.186ProGln: 1.186 ± 1.04
1.525ProArg: 1.525 ± 0.56
2.881ProSer: 2.881 ± 1.03
2.034ProThr: 2.034 ± 0.318
2.542ProVal: 2.542 ± 0.947
0.339ProTrp: 0.339 ± 0.142
1.525ProTyr: 1.525 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
2.373GlnAla: 2.373 ± 0.442
0.678GlnCys: 0.678 ± 0.283
1.864GlnAsp: 1.864 ± 0.473
2.542GlnGlu: 2.542 ± 0.532
1.186GlnPhe: 1.186 ± 0.208
1.695GlnGly: 1.695 ± 0.2
0.678GlnHis: 0.678 ± 0.363
2.542GlnIle: 2.542 ± 1.237
1.356GlnLys: 1.356 ± 0.14
3.898GlnLeu: 3.898 ± 0.929
1.186GlnMet: 1.186 ± 0.383
1.356GlnAsn: 1.356 ± 1.143
0.508GlnPro: 0.508 ± 0.111
1.864GlnGln: 1.864 ± 0.416
2.712GlnArg: 2.712 ± 0.762
2.203GlnSer: 2.203 ± 0.523
2.373GlnThr: 2.373 ± 0.173
1.695GlnVal: 1.695 ± 0.649
0.678GlnTrp: 0.678 ± 0.284
1.356GlnTyr: 1.356 ± 0.471
0.0GlnXaa: 0.0 ± 0.0
Arg
2.203ArgAla: 2.203 ± 0.361
1.186ArgCys: 1.186 ± 0.219
2.203ArgAsp: 2.203 ± 0.918
2.881ArgGlu: 2.881 ± 0.217
2.203ArgPhe: 2.203 ± 0.458
2.034ArgGly: 2.034 ± 0.031
1.356ArgHis: 1.356 ± 0.344
2.542ArgIle: 2.542 ± 0.529
4.237ArgLys: 4.237 ± 0.368
5.254ArgLeu: 5.254 ± 0.238
1.695ArgMet: 1.695 ± 0.908
2.712ArgAsn: 2.712 ± 0.707
1.695ArgPro: 1.695 ± 0.2
1.864ArgGln: 1.864 ± 0.738
3.22ArgArg: 3.22 ± 0.796
5.085ArgSer: 5.085 ± 0.695
2.542ArgThr: 2.542 ± 0.529
2.712ArgVal: 2.712 ± 1.395
0.339ArgTrp: 0.339 ± 0.182
1.356ArgTyr: 1.356 ± 0.14
0.0ArgXaa: 0.0 ± 0.0
Ser
5.932SerAla: 5.932 ± 1.039
2.712SerCys: 2.712 ± 0.87
4.068SerAsp: 4.068 ± 1.115
5.932SerGlu: 5.932 ± 1.25
5.254SerPhe: 5.254 ± 0.983
4.576SerGly: 4.576 ± 0.407
1.695SerHis: 1.695 ± 0.349
5.763SerIle: 5.763 ± 0.393
5.932SerLys: 5.932 ± 1.236
8.475SerLeu: 8.475 ± 0.971
1.356SerMet: 1.356 ± 0.568
4.407SerAsn: 4.407 ± 0.421
2.542SerPro: 2.542 ± 0.377
2.373SerGln: 2.373 ± 0.646
3.898SerArg: 3.898 ± 0.806
9.831SerSer: 9.831 ± 1.664
3.39SerThr: 3.39 ± 0.374
6.949SerVal: 6.949 ± 1.444
1.356SerTrp: 1.356 ± 0.736
2.203SerTyr: 2.203 ± 1.181
0.0SerXaa: 0.0 ± 0.0
Thr
3.559ThrAla: 3.559 ± 1.728
1.525ThrCys: 1.525 ± 0.768
2.881ThrAsp: 2.881 ± 1.03
5.254ThrGlu: 5.254 ± 1.215
2.203ThrPhe: 2.203 ± 0.501
4.068ThrGly: 4.068 ± 0.926
1.525ThrHis: 1.525 ± 1.041
2.203ThrIle: 2.203 ± 0.581
3.22ThrLys: 3.22 ± 0.619
4.237ThrLeu: 4.237 ± 0.673
1.017ThrMet: 1.017 ± 0.394
1.864ThrAsn: 1.864 ± 0.692
2.373ThrPro: 2.373 ± 0.438
1.356ThrGln: 1.356 ± 0.567
2.712ThrArg: 2.712 ± 1.09
7.119ThrSer: 7.119 ± 1.773
3.559ThrThr: 3.559 ± 1.616
3.051ThrVal: 3.051 ± 0.724
1.017ThrTrp: 1.017 ± 0.563
1.525ThrTyr: 1.525 ± 0.334
0.0ThrXaa: 0.0 ± 0.0
Val
3.051ValAla: 3.051 ± 0.587
1.356ValCys: 1.356 ± 0.567
3.051ValAsp: 3.051 ± 0.315
3.729ValGlu: 3.729 ± 0.658
3.051ValPhe: 3.051 ± 0.175
2.203ValGly: 2.203 ± 0.527
1.017ValHis: 1.017 ± 0.297
4.237ValIle: 4.237 ± 0.268
6.61ValLys: 6.61 ± 1.236
7.966ValLeu: 7.966 ± 0.949
1.186ValMet: 1.186 ± 0.376
3.559ValAsn: 3.559 ± 0.657
2.034ValPro: 2.034 ± 0.782
3.22ValGln: 3.22 ± 0.977
2.712ValArg: 2.712 ± 0.942
6.271ValSer: 6.271 ± 1.501
4.237ValThr: 4.237 ± 1.081
5.254ValVal: 5.254 ± 1.816
0.678ValTrp: 0.678 ± 0.314
2.373ValTyr: 2.373 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
0.508TrpAla: 0.508 ± 1.116
0.339TrpCys: 0.339 ± 0.142
0.339TrpAsp: 0.339 ± 0.182
1.186TrpGlu: 1.186 ± 0.627
0.678TrpPhe: 0.678 ± 0.868
1.186TrpGly: 1.186 ± 0.472
0.169TrpHis: 0.169 ± 0.372
0.339TrpIle: 0.339 ± 0.182
1.017TrpLys: 1.017 ± 0.545
1.695TrpLeu: 1.695 ± 0.572
0.339TrpMet: 0.339 ± 0.434
0.169TrpAsn: 0.169 ± 0.091
0.508TrpPro: 0.508 ± 0.272
0.508TrpGln: 0.508 ± 0.347
1.356TrpArg: 1.356 ± 0.524
0.678TrpSer: 0.678 ± 0.283
0.678TrpThr: 0.678 ± 0.284
0.678TrpVal: 0.678 ± 0.363
0.169TrpTrp: 0.169 ± 0.091
0.339TrpTyr: 0.339 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.356TyrAla: 1.356 ± 0.344
1.186TyrCys: 1.186 ± 0.627
1.017TyrAsp: 1.017 ± 0.297
2.203TyrGlu: 2.203 ± 0.847
1.525TyrPhe: 1.525 ± 0.334
1.017TyrGly: 1.017 ± 0.599
1.017TyrHis: 1.017 ± 0.425
1.186TyrIle: 1.186 ± 0.636
2.203TyrLys: 2.203 ± 0.501
2.881TyrLeu: 2.881 ± 0.438
0.678TyrMet: 0.678 ± 0.314
2.034TyrAsn: 2.034 ± 0.318
0.678TyrPro: 0.678 ± 0.146
1.695TyrGln: 1.695 ± 0.853
1.864TyrArg: 1.864 ± 0.512
2.034TyrSer: 2.034 ± 0.595
1.356TyrThr: 1.356 ± 0.381
1.017TyrVal: 1.017 ± 0.305
0.508TyrTrp: 0.508 ± 0.309
0.678TyrTyr: 0.678 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski