Amino acid dipepetide frequency for Nyamanini virus (isolate Tick/Thailand/39/1968)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.233AlaAla: 6.233 ± 0.902
1.355AlaCys: 1.355 ± 0.478
2.71AlaAsp: 2.71 ± 0.756
4.065AlaGlu: 4.065 ± 1.159
1.355AlaPhe: 1.355 ± 0.867
4.878AlaGly: 4.878 ± 0.396
1.626AlaHis: 1.626 ± 0.447
2.71AlaIle: 2.71 ± 0.576
3.523AlaLys: 3.523 ± 1.293
11.382AlaLeu: 11.382 ± 2.035
2.439AlaMet: 2.439 ± 0.637
2.168AlaAsn: 2.168 ± 0.739
3.794AlaPro: 3.794 ± 0.722
2.439AlaGln: 2.439 ± 0.761
4.607AlaArg: 4.607 ± 1.402
4.878AlaSer: 4.878 ± 1.127
4.065AlaThr: 4.065 ± 0.87
4.607AlaVal: 4.607 ± 1.102
1.626AlaTrp: 1.626 ± 0.41
1.897AlaTyr: 1.897 ± 0.896
0.0AlaXaa: 0.0 ± 0.0
Cys
0.271CysAla: 0.271 ± 0.164
0.271CysCys: 0.271 ± 0.164
0.271CysAsp: 0.271 ± 0.164
1.355CysGlu: 1.355 ± 0.489
1.084CysPhe: 1.084 ± 0.562
0.542CysGly: 0.542 ± 0.243
0.813CysHis: 0.813 ± 0.493
1.084CysIle: 1.084 ± 0.504
1.626CysLys: 1.626 ± 0.544
2.981CysLeu: 2.981 ± 1.462
0.0CysMet: 0.0 ± 0.0
0.813CysAsn: 0.813 ± 0.272
1.084CysPro: 1.084 ± 0.344
2.439CysGln: 2.439 ± 0.816
0.542CysArg: 0.542 ± 0.315
2.168CysSer: 2.168 ± 0.422
1.355CysThr: 1.355 ± 0.479
0.813CysVal: 0.813 ± 0.348
0.271CysTrp: 0.271 ± 0.164
1.084CysTyr: 1.084 ± 0.378
0.0CysXaa: 0.0 ± 0.0
Asp
2.439AspAla: 2.439 ± 0.61
1.355AspCys: 1.355 ± 0.476
1.084AspAsp: 1.084 ± 0.715
3.252AspGlu: 3.252 ± 1.453
1.626AspPhe: 1.626 ± 0.816
0.813AspGly: 0.813 ± 0.535
2.439AspHis: 2.439 ± 1.139
1.084AspIle: 1.084 ± 0.42
3.523AspLys: 3.523 ± 1.374
8.13AspLeu: 8.13 ± 1.235
0.542AspMet: 0.542 ± 0.667
1.355AspAsn: 1.355 ± 0.59
3.523AspPro: 3.523 ± 0.921
2.168AspGln: 2.168 ± 0.95
4.336AspArg: 4.336 ± 1.223
2.439AspSer: 2.439 ± 0.578
1.897AspThr: 1.897 ± 0.5
0.542AspVal: 0.542 ± 0.329
0.813AspTrp: 0.813 ± 0.322
2.439AspTyr: 2.439 ± 0.537
0.0AspXaa: 0.0 ± 0.0
Glu
6.775GluAla: 6.775 ± 0.911
1.084GluCys: 1.084 ± 0.657
4.878GluAsp: 4.878 ± 1.996
14.092GluGlu: 14.092 ± 7.896
2.168GluPhe: 2.168 ± 0.529
4.878GluGly: 4.878 ± 2.01
1.084GluHis: 1.084 ± 0.378
4.065GluIle: 4.065 ± 1.509
8.401GluLys: 8.401 ± 6.754
7.317GluLeu: 7.317 ± 1.366
1.626GluMet: 1.626 ± 0.604
0.813GluAsn: 0.813 ± 0.34
2.981GluPro: 2.981 ± 0.547
1.897GluGln: 1.897 ± 0.468
7.859GluArg: 7.859 ± 3.849
5.691GluSer: 5.691 ± 1.281
3.523GluThr: 3.523 ± 1.946
4.607GluVal: 4.607 ± 1.161
0.271GluTrp: 0.271 ± 0.164
2.71GluTyr: 2.71 ± 0.617
0.0GluXaa: 0.0 ± 0.0
Phe
1.626PheAla: 1.626 ± 1.131
0.542PheCys: 0.542 ± 0.329
0.813PheAsp: 0.813 ± 0.272
1.626PheGlu: 1.626 ± 0.578
0.813PhePhe: 0.813 ± 0.272
1.626PheGly: 1.626 ± 0.41
0.271PheHis: 0.271 ± 0.31
1.897PheIle: 1.897 ± 0.496
1.626PheLys: 1.626 ± 0.447
3.794PheLeu: 3.794 ± 0.943
0.271PheMet: 0.271 ± 0.313
0.813PheAsn: 0.813 ± 0.493
1.355PhePro: 1.355 ± 0.479
0.271PheGln: 0.271 ± 0.313
1.897PheArg: 1.897 ± 0.557
3.252PheSer: 3.252 ± 0.373
1.355PheThr: 1.355 ± 0.516
0.813PheVal: 0.813 ± 0.351
0.542PheTrp: 0.542 ± 0.243
0.542PheTyr: 0.542 ± 0.319
0.0PheXaa: 0.0 ± 0.0
Gly
3.252GlyAla: 3.252 ± 1.145
1.897GlyCys: 1.897 ± 0.638
3.523GlyAsp: 3.523 ± 1.638
2.981GlyGlu: 2.981 ± 1.341
1.084GlyPhe: 1.084 ± 0.657
3.523GlyGly: 3.523 ± 0.801
1.355GlyHis: 1.355 ± 0.744
1.626GlyIle: 1.626 ± 0.644
3.523GlyLys: 3.523 ± 1.101
6.775GlyLeu: 6.775 ± 0.863
1.897GlyMet: 1.897 ± 0.458
1.084GlyAsn: 1.084 ± 0.629
4.336GlyPro: 4.336 ± 0.927
2.71GlyGln: 2.71 ± 1.213
3.523GlyArg: 3.523 ± 0.885
5.149GlySer: 5.149 ± 0.522
3.252GlyThr: 3.252 ± 0.615
2.981GlyVal: 2.981 ± 0.809
1.355GlyTrp: 1.355 ± 0.454
0.813GlyTyr: 0.813 ± 0.272
0.0GlyXaa: 0.0 ± 0.0
His
1.084HisAla: 1.084 ± 0.378
1.355HisCys: 1.355 ± 0.516
1.897HisAsp: 1.897 ± 0.586
2.168HisGlu: 2.168 ± 0.728
0.271HisPhe: 0.271 ± 0.164
2.71HisGly: 2.71 ± 1.005
1.355HisHis: 1.355 ± 0.316
0.813HisIle: 0.813 ± 0.272
1.355HisLys: 1.355 ± 0.68
3.252HisLeu: 3.252 ± 1.625
0.271HisMet: 0.271 ± 0.313
1.084HisAsn: 1.084 ± 0.657
1.626HisPro: 1.626 ± 0.59
0.813HisGln: 0.813 ± 0.345
1.626HisArg: 1.626 ± 0.338
1.626HisSer: 1.626 ± 0.544
0.542HisThr: 0.542 ± 0.243
0.271HisVal: 0.271 ± 0.164
0.542HisTrp: 0.542 ± 0.329
0.813HisTyr: 0.813 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
3.252IleAla: 3.252 ± 0.634
1.355IleCys: 1.355 ± 0.821
1.084IleAsp: 1.084 ± 0.439
2.168IleGlu: 2.168 ± 0.455
1.897IlePhe: 1.897 ± 0.888
2.71IleGly: 2.71 ± 0.689
1.084IleHis: 1.084 ± 0.584
1.626IleIle: 1.626 ± 0.423
2.981IleLys: 2.981 ± 0.91
4.065IleLeu: 4.065 ± 0.305
0.271IleMet: 0.271 ± 0.313
1.626IleAsn: 1.626 ± 0.768
1.626IlePro: 1.626 ± 0.666
1.355IleGln: 1.355 ± 0.652
2.168IleArg: 2.168 ± 0.739
4.065IleSer: 4.065 ± 1.12
2.71IleThr: 2.71 ± 0.753
2.439IleVal: 2.439 ± 0.74
0.813IleTrp: 0.813 ± 0.426
0.813IleTyr: 0.813 ± 0.535
0.0IleXaa: 0.0 ± 0.0
Lys
3.252LysAla: 3.252 ± 1.364
0.542LysCys: 0.542 ± 0.243
2.981LysAsp: 2.981 ± 0.879
8.13LysGlu: 8.13 ± 5.486
1.355LysPhe: 1.355 ± 0.602
3.794LysGly: 3.794 ± 0.719
1.355LysHis: 1.355 ± 0.639
1.626LysIle: 1.626 ± 0.538
4.878LysLys: 4.878 ± 1.975
3.794LysLeu: 3.794 ± 1.76
1.355LysMet: 1.355 ± 0.479
1.626LysAsn: 1.626 ± 0.728
2.168LysPro: 2.168 ± 0.644
2.168LysGln: 2.168 ± 0.431
6.775LysArg: 6.775 ± 2.758
2.71LysSer: 2.71 ± 1.145
2.71LysThr: 2.71 ± 0.947
2.71LysVal: 2.71 ± 0.729
0.813LysTrp: 0.813 ± 0.668
1.355LysTyr: 1.355 ± 0.77
0.0LysXaa: 0.0 ± 0.0
Leu
10.027LeuAla: 10.027 ± 1.94
1.897LeuCys: 1.897 ± 0.532
4.607LeuAsp: 4.607 ± 1.105
10.569LeuGlu: 10.569 ± 1.23
3.794LeuPhe: 3.794 ± 0.781
5.962LeuGly: 5.962 ± 1.346
1.897LeuHis: 1.897 ± 0.756
5.42LeuIle: 5.42 ± 0.706
6.775LeuLys: 6.775 ± 1.71
14.905LeuLeu: 14.905 ± 4.254
1.897LeuMet: 1.897 ± 0.623
3.794LeuAsn: 3.794 ± 1.112
6.775LeuPro: 6.775 ± 1.787
6.775LeuGln: 6.775 ± 0.731
9.214LeuArg: 9.214 ± 1.244
9.485LeuSer: 9.485 ± 2.196
5.149LeuThr: 5.149 ± 0.564
5.962LeuVal: 5.962 ± 1.175
2.168LeuTrp: 2.168 ± 0.756
5.42LeuTyr: 5.42 ± 1.095
0.0LeuXaa: 0.0 ± 0.0
Met
2.168MetAla: 2.168 ± 0.422
0.0MetCys: 0.0 ± 0.0
1.084MetAsp: 1.084 ± 0.455
2.168MetGlu: 2.168 ± 0.645
0.271MetPhe: 0.271 ± 0.313
0.542MetGly: 0.542 ± 0.329
0.813MetHis: 0.813 ± 0.272
1.084MetIle: 1.084 ± 0.274
0.0MetLys: 0.0 ± 0.0
1.626MetLeu: 1.626 ± 0.434
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.355MetPro: 1.355 ± 0.516
0.813MetGln: 0.813 ± 0.429
0.813MetArg: 0.813 ± 0.322
1.897MetSer: 1.897 ± 0.615
2.168MetThr: 2.168 ± 0.706
0.813MetVal: 0.813 ± 0.568
0.271MetTrp: 0.271 ± 0.164
0.271MetTyr: 0.271 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
1.897AsnAla: 1.897 ± 0.756
0.813AsnCys: 0.813 ± 0.535
1.084AsnAsp: 1.084 ± 0.42
1.355AsnGlu: 1.355 ± 1.164
0.271AsnPhe: 0.271 ± 0.164
1.084AsnGly: 1.084 ± 0.378
0.542AsnHis: 0.542 ± 0.243
1.626AsnIle: 1.626 ± 0.423
0.813AsnLys: 0.813 ± 0.402
4.336AsnLeu: 4.336 ± 0.812
0.813AsnMet: 0.813 ± 0.493
0.813AsnAsn: 0.813 ± 0.938
2.439AsnPro: 2.439 ± 0.977
1.626AsnGln: 1.626 ± 0.423
2.439AsnArg: 2.439 ± 0.537
1.355AsnSer: 1.355 ± 0.316
1.626AsnThr: 1.626 ± 0.503
1.084AsnVal: 1.084 ± 0.455
0.813AsnTrp: 0.813 ± 0.348
1.084AsnTyr: 1.084 ± 0.378
0.0AsnXaa: 0.0 ± 0.0
Pro
3.252ProAla: 3.252 ± 2.367
1.626ProCys: 1.626 ± 0.447
3.252ProAsp: 3.252 ± 1.087
4.065ProGlu: 4.065 ± 0.767
2.439ProPhe: 2.439 ± 1.544
2.168ProGly: 2.168 ± 0.428
1.355ProHis: 1.355 ± 0.821
1.355ProIle: 1.355 ± 0.478
2.439ProLys: 2.439 ± 0.624
7.317ProLeu: 7.317 ± 1.26
1.084ProMet: 1.084 ± 0.549
1.626ProAsn: 1.626 ± 0.628
2.71ProPro: 2.71 ± 0.436
2.439ProGln: 2.439 ± 0.741
3.523ProArg: 3.523 ± 0.894
4.607ProSer: 4.607 ± 1.478
3.794ProThr: 3.794 ± 0.912
3.794ProVal: 3.794 ± 0.962
0.813ProTrp: 0.813 ± 0.272
1.626ProTyr: 1.626 ± 0.708
0.0ProXaa: 0.0 ± 0.0
Gln
4.065GlnAla: 4.065 ± 0.7
1.084GlnCys: 1.084 ± 0.402
2.168GlnAsp: 2.168 ± 0.742
4.607GlnGlu: 4.607 ± 1.595
1.084GlnPhe: 1.084 ± 0.455
3.794GlnGly: 3.794 ± 0.847
1.084GlnHis: 1.084 ± 0.992
2.439GlnIle: 2.439 ± 0.319
0.813GlnLys: 0.813 ± 0.322
4.065GlnLeu: 4.065 ± 0.923
0.271GlnMet: 0.271 ± 0.164
0.813GlnAsn: 0.813 ± 0.567
2.168GlnPro: 2.168 ± 0.965
0.542GlnGln: 0.542 ± 0.329
2.71GlnArg: 2.71 ± 0.956
2.71GlnSer: 2.71 ± 0.532
2.981GlnThr: 2.981 ± 1.534
3.252GlnVal: 3.252 ± 0.808
1.084GlnTrp: 1.084 ± 0.344
0.542GlnTyr: 0.542 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
6.504ArgAla: 6.504 ± 0.784
0.271ArgCys: 0.271 ± 0.313
4.336ArgAsp: 4.336 ± 1.779
7.317ArgGlu: 7.317 ± 4.022
1.626ArgPhe: 1.626 ± 0.715
5.42ArgGly: 5.42 ± 1.932
2.981ArgHis: 2.981 ± 0.44
1.626ArgIle: 1.626 ± 0.423
2.439ArgLys: 2.439 ± 1.038
6.775ArgLeu: 6.775 ± 1.068
1.897ArgMet: 1.897 ± 0.646
2.439ArgAsn: 2.439 ± 0.434
4.065ArgPro: 4.065 ± 0.888
3.252ArgGln: 3.252 ± 0.747
6.504ArgArg: 6.504 ± 2.548
4.878ArgSer: 4.878 ± 0.702
3.794ArgThr: 3.794 ± 0.362
5.149ArgVal: 5.149 ± 1.115
1.084ArgTrp: 1.084 ± 0.273
0.813ArgTyr: 0.813 ± 0.438
0.0ArgXaa: 0.0 ± 0.0
Ser
6.233SerAla: 6.233 ± 2.63
1.355SerCys: 1.355 ± 0.294
3.252SerAsp: 3.252 ± 1.069
5.962SerGlu: 5.962 ± 1.27
0.542SerPhe: 0.542 ± 0.329
5.42SerGly: 5.42 ± 0.99
2.168SerHis: 2.168 ± 0.549
2.71SerIle: 2.71 ± 0.689
3.252SerLys: 3.252 ± 0.843
13.008SerLeu: 13.008 ± 3.31
0.813SerMet: 0.813 ± 0.272
1.626SerAsn: 1.626 ± 0.503
4.336SerPro: 4.336 ± 0.739
4.065SerGln: 4.065 ± 0.793
6.233SerArg: 6.233 ± 0.798
8.13SerSer: 8.13 ± 1.287
3.523SerThr: 3.523 ± 1.252
3.794SerVal: 3.794 ± 1.74
0.813SerTrp: 0.813 ± 0.493
1.626SerTyr: 1.626 ± 0.766
0.0SerXaa: 0.0 ± 0.0
Thr
3.794ThrAla: 3.794 ± 0.553
1.355ThrCys: 1.355 ± 0.77
2.168ThrAsp: 2.168 ± 0.976
3.794ThrGlu: 3.794 ± 0.614
1.084ThrPhe: 1.084 ± 0.657
2.71ThrGly: 2.71 ± 1.171
1.355ThrHis: 1.355 ± 0.516
1.355ThrIle: 1.355 ± 0.573
1.626ThrLys: 1.626 ± 0.961
8.401ThrLeu: 8.401 ± 1.681
1.084ThrMet: 1.084 ± 0.455
2.439ThrAsn: 2.439 ± 0.567
4.065ThrPro: 4.065 ± 1.575
0.813ThrGln: 0.813 ± 0.345
3.523ThrArg: 3.523 ± 0.442
4.607ThrSer: 4.607 ± 1.215
3.252ThrThr: 3.252 ± 1.209
3.523ThrVal: 3.523 ± 0.867
2.439ThrTrp: 2.439 ± 0.87
1.626ThrTyr: 1.626 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
3.794ValAla: 3.794 ± 1.428
1.084ValCys: 1.084 ± 0.378
2.168ValAsp: 2.168 ± 0.373
4.336ValGlu: 4.336 ± 1.347
1.626ValPhe: 1.626 ± 0.481
1.897ValGly: 1.897 ± 0.558
0.813ValHis: 0.813 ± 0.493
2.981ValIle: 2.981 ± 1.167
3.252ValLys: 3.252 ± 1.412
5.691ValLeu: 5.691 ± 1.006
0.813ValMet: 0.813 ± 0.493
1.355ValAsn: 1.355 ± 0.479
2.439ValPro: 2.439 ± 1.275
2.981ValGln: 2.981 ± 0.743
2.439ValArg: 2.439 ± 0.842
4.065ValSer: 4.065 ± 0.837
3.523ValThr: 3.523 ± 0.729
2.439ValVal: 2.439 ± 0.439
0.813ValTrp: 0.813 ± 0.938
2.439ValTyr: 2.439 ± 0.648
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.329
1.355TrpCys: 1.355 ± 0.821
1.355TrpAsp: 1.355 ± 0.995
1.626TrpGlu: 1.626 ± 0.702
0.271TrpPhe: 0.271 ± 0.164
0.813TrpGly: 0.813 ± 0.493
0.542TrpHis: 0.542 ± 0.329
1.084TrpIle: 1.084 ± 0.378
1.355TrpLys: 1.355 ± 0.489
1.626TrpLeu: 1.626 ± 1.07
0.542TrpMet: 0.542 ± 0.329
0.813TrpAsn: 0.813 ± 0.535
1.084TrpPro: 1.084 ± 0.378
0.271TrpGln: 0.271 ± 0.367
0.813TrpArg: 0.813 ± 0.351
1.355TrpSer: 1.355 ± 0.316
1.897TrpThr: 1.897 ± 0.718
0.271TrpVal: 0.271 ± 0.313
0.0TrpTrp: 0.0 ± 0.0
0.271TrpTyr: 0.271 ± 0.454
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.897TyrAla: 1.897 ± 0.604
0.271TyrCys: 0.271 ± 0.164
1.355TyrAsp: 1.355 ± 0.316
1.084TyrGlu: 1.084 ± 0.375
1.084TyrPhe: 1.084 ± 0.273
1.355TyrGly: 1.355 ± 0.516
0.542TyrHis: 0.542 ± 0.329
1.897TyrIle: 1.897 ± 0.574
1.897TyrLys: 1.897 ± 1.089
3.252TyrLeu: 3.252 ± 0.667
0.0TyrMet: 0.0 ± 0.0
0.813TyrAsn: 0.813 ± 0.272
1.355TyrPro: 1.355 ± 0.409
2.439TyrGln: 2.439 ± 0.399
1.626TyrArg: 1.626 ± 0.438
3.794TyrSer: 3.794 ± 1.291
1.897TyrThr: 1.897 ± 0.51
1.084TyrVal: 1.084 ± 0.378
0.542TyrTrp: 0.542 ± 0.329
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3691 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski