Amino acid dipepetide frequency for Free State vervet virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.046AlaAla: 9.046 ± 1.391
3.438AlaCys: 3.438 ± 0.598
3.076AlaAsp: 3.076 ± 1.037
3.257AlaGlu: 3.257 ± 1.054
5.066AlaPhe: 5.066 ± 1.718
4.523AlaGly: 4.523 ± 1.148
2.714AlaHis: 2.714 ± 0.779
5.609AlaIle: 5.609 ± 0.808
3.438AlaLys: 3.438 ± 0.796
10.313AlaLeu: 10.313 ± 1.547
0.181AlaMet: 0.181 ± 0.292
3.076AlaAsn: 3.076 ± 0.822
5.609AlaPro: 5.609 ± 0.98
2.171AlaGln: 2.171 ± 0.467
2.895AlaArg: 2.895 ± 0.887
5.247AlaSer: 5.247 ± 1.003
4.523AlaThr: 4.523 ± 1.026
6.152AlaVal: 6.152 ± 0.927
0.724AlaTrp: 0.724 ± 0.714
1.628AlaTyr: 1.628 ± 0.559
0.0AlaXaa: 0.0 ± 0.0
Cys
2.895CysAla: 2.895 ± 0.498
1.447CysCys: 1.447 ± 0.777
1.628CysAsp: 1.628 ± 0.499
0.362CysGlu: 0.362 ± 0.243
1.086CysPhe: 1.086 ± 0.612
1.99CysGly: 1.99 ± 0.46
1.809CysHis: 1.809 ± 0.497
2.171CysIle: 2.171 ± 0.798
0.724CysLys: 0.724 ± 0.34
3.438CysLeu: 3.438 ± 1.322
1.086CysMet: 1.086 ± 0.464
1.628CysAsn: 1.628 ± 0.621
1.99CysPro: 1.99 ± 1.186
0.543CysGln: 0.543 ± 0.212
1.99CysArg: 1.99 ± 0.587
2.533CysSer: 2.533 ± 0.792
1.99CysThr: 1.99 ± 0.45
2.714CysVal: 2.714 ± 0.571
0.905CysTrp: 0.905 ± 0.478
1.99CysTyr: 1.99 ± 0.584
0.0CysXaa: 0.0 ± 0.0
Asp
3.619AspAla: 3.619 ± 0.518
1.267AspCys: 1.267 ± 0.506
1.99AspAsp: 1.99 ± 0.611
1.086AspGlu: 1.086 ± 0.471
2.533AspPhe: 2.533 ± 0.814
3.257AspGly: 3.257 ± 1.043
0.905AspHis: 0.905 ± 0.421
1.628AspIle: 1.628 ± 0.897
1.086AspLys: 1.086 ± 0.718
5.066AspLeu: 5.066 ± 0.708
0.543AspMet: 0.543 ± 0.274
0.181AspAsn: 0.181 ± 0.239
3.438AspPro: 3.438 ± 1.03
1.267AspGln: 1.267 ± 0.609
1.99AspArg: 1.99 ± 0.69
2.714AspSer: 2.714 ± 0.726
2.895AspThr: 2.895 ± 0.529
3.98AspVal: 3.98 ± 0.938
1.086AspTrp: 1.086 ± 0.407
1.628AspTyr: 1.628 ± 0.665
0.0AspXaa: 0.0 ± 0.0
Glu
3.076GluAla: 3.076 ± 0.804
1.267GluCys: 1.267 ± 0.402
1.086GluAsp: 1.086 ± 0.568
1.809GluGlu: 1.809 ± 0.561
1.086GluPhe: 1.086 ± 0.356
2.533GluGly: 2.533 ± 1.369
1.086GluHis: 1.086 ± 0.395
1.809GluIle: 1.809 ± 0.753
0.905GluLys: 0.905 ± 0.551
2.895GluLeu: 2.895 ± 0.866
0.362GluMet: 0.362 ± 0.155
1.086GluAsn: 1.086 ± 0.313
2.352GluPro: 2.352 ± 0.857
1.086GluGln: 1.086 ± 0.444
2.171GluArg: 2.171 ± 0.682
2.533GluSer: 2.533 ± 0.595
2.352GluThr: 2.352 ± 0.487
2.895GluVal: 2.895 ± 0.675
0.905GluTrp: 0.905 ± 0.259
1.267GluTyr: 1.267 ± 0.403
0.0GluXaa: 0.0 ± 0.0
Phe
3.438PheAla: 3.438 ± 1.175
2.171PheCys: 2.171 ± 0.803
2.533PheAsp: 2.533 ± 0.535
1.99PheGlu: 1.99 ± 0.446
3.257PhePhe: 3.257 ± 1.118
3.619PheGly: 3.619 ± 0.924
1.086PheHis: 1.086 ± 0.566
2.533PheIle: 2.533 ± 0.643
2.352PheLys: 2.352 ± 0.66
4.161PheLeu: 4.161 ± 1.767
0.905PheMet: 0.905 ± 0.561
1.628PheAsn: 1.628 ± 0.979
2.171PhePro: 2.171 ± 0.403
1.99PheGln: 1.99 ± 0.633
1.809PheArg: 1.809 ± 0.419
4.523PheSer: 4.523 ± 0.751
3.076PheThr: 3.076 ± 1.07
4.523PheVal: 4.523 ± 1.529
1.447PheTrp: 1.447 ± 0.773
1.447PheTyr: 1.447 ± 0.614
0.0PheXaa: 0.0 ± 0.0
Gly
4.342GlyAla: 4.342 ± 1.649
2.533GlyCys: 2.533 ± 0.556
4.885GlyAsp: 4.885 ± 1.097
2.171GlyGlu: 2.171 ± 0.738
3.619GlyPhe: 3.619 ± 0.625
3.8GlyGly: 3.8 ± 0.764
1.086GlyHis: 1.086 ± 0.416
2.895GlyIle: 2.895 ± 0.506
3.257GlyLys: 3.257 ± 0.849
7.237GlyLeu: 7.237 ± 0.848
1.086GlyMet: 1.086 ± 0.364
2.714GlyAsn: 2.714 ± 0.707
3.076GlyPro: 3.076 ± 0.871
1.809GlyGln: 1.809 ± 0.393
2.895GlyArg: 2.895 ± 0.637
4.523GlySer: 4.523 ± 0.943
4.885GlyThr: 4.885 ± 0.815
5.79GlyVal: 5.79 ± 1.499
0.905GlyTrp: 0.905 ± 0.403
4.342GlyTyr: 4.342 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
2.714HisAla: 2.714 ± 0.824
1.267HisCys: 1.267 ± 0.381
0.724HisAsp: 0.724 ± 0.448
0.362HisGlu: 0.362 ± 0.503
2.171HisPhe: 2.171 ± 0.468
1.809HisGly: 1.809 ± 0.442
1.086HisHis: 1.086 ± 0.512
1.628HisIle: 1.628 ± 0.654
0.543HisLys: 0.543 ± 0.331
2.714HisLeu: 2.714 ± 0.65
1.086HisMet: 1.086 ± 0.283
1.447HisAsn: 1.447 ± 1.048
1.809HisPro: 1.809 ± 0.69
1.086HisGln: 1.086 ± 0.544
2.171HisArg: 2.171 ± 1.192
1.628HisSer: 1.628 ± 0.994
2.352HisThr: 2.352 ± 0.829
1.267HisVal: 1.267 ± 0.375
0.724HisTrp: 0.724 ± 0.29
1.267HisTyr: 1.267 ± 0.351
0.0HisXaa: 0.0 ± 0.0
Ile
3.438IleAla: 3.438 ± 0.526
1.99IleCys: 1.99 ± 0.557
2.352IleAsp: 2.352 ± 0.885
0.905IleGlu: 0.905 ± 0.259
2.714IlePhe: 2.714 ± 1.118
2.895IleGly: 2.895 ± 0.818
1.267IleHis: 1.267 ± 0.737
3.98IleIle: 3.98 ± 1.372
1.99IleLys: 1.99 ± 0.459
5.247IleLeu: 5.247 ± 0.586
1.086IleMet: 1.086 ± 0.522
0.724IleAsn: 0.724 ± 0.437
3.619IlePro: 3.619 ± 1.211
1.628IleGln: 1.628 ± 0.644
1.809IleArg: 1.809 ± 0.384
5.428IleSer: 5.428 ± 1.075
5.066IleThr: 5.066 ± 1.203
2.895IleVal: 2.895 ± 1.044
0.181IleTrp: 0.181 ± 0.241
1.628IleTyr: 1.628 ± 0.641
0.0IleXaa: 0.0 ± 0.0
Lys
2.352LysAla: 2.352 ± 0.646
0.905LysCys: 0.905 ± 0.385
2.352LysAsp: 2.352 ± 0.81
1.628LysGlu: 1.628 ± 0.509
2.352LysPhe: 2.352 ± 0.659
2.895LysGly: 2.895 ± 1.05
1.086LysHis: 1.086 ± 0.412
1.267LysIle: 1.267 ± 0.433
1.99LysLys: 1.99 ± 0.83
4.342LysLeu: 4.342 ± 0.774
0.543LysMet: 0.543 ± 0.48
1.809LysAsn: 1.809 ± 0.577
2.714LysPro: 2.714 ± 0.942
1.267LysGln: 1.267 ± 0.363
2.352LysArg: 2.352 ± 0.628
2.533LysSer: 2.533 ± 0.637
2.895LysThr: 2.895 ± 0.672
2.533LysVal: 2.533 ± 0.71
0.0LysTrp: 0.0 ± 0.0
1.086LysTyr: 1.086 ± 0.562
0.0LysXaa: 0.0 ± 0.0
Leu
11.218LeuAla: 11.218 ± 2.449
4.161LeuCys: 4.161 ± 0.832
5.066LeuAsp: 5.066 ± 0.665
3.98LeuGlu: 3.98 ± 0.381
6.513LeuPhe: 6.513 ± 2.191
8.323LeuGly: 8.323 ± 1.346
1.99LeuHis: 1.99 ± 0.99
4.161LeuIle: 4.161 ± 1.113
3.8LeuLys: 3.8 ± 0.881
13.389LeuLeu: 13.389 ± 3.86
1.447LeuMet: 1.447 ± 0.381
3.438LeuAsn: 3.438 ± 0.625
5.428LeuPro: 5.428 ± 0.916
2.352LeuGln: 2.352 ± 0.679
5.066LeuArg: 5.066 ± 1.36
10.675LeuSer: 10.675 ± 2.607
7.599LeuThr: 7.599 ± 1.029
7.418LeuVal: 7.418 ± 0.986
1.447LeuTrp: 1.447 ± 0.704
3.076LeuTyr: 3.076 ± 0.877
0.0LeuXaa: 0.0 ± 0.0
Met
2.352MetAla: 2.352 ± 0.588
0.362MetCys: 0.362 ± 0.155
0.362MetAsp: 0.362 ± 0.318
0.362MetGlu: 0.362 ± 0.243
0.362MetPhe: 0.362 ± 0.237
0.905MetGly: 0.905 ± 0.47
0.543MetHis: 0.543 ± 0.353
0.905MetIle: 0.905 ± 0.421
0.905MetLys: 0.905 ± 0.265
0.724MetLeu: 0.724 ± 0.496
0.181MetMet: 0.181 ± 0.24
0.543MetAsn: 0.543 ± 0.359
0.905MetPro: 0.905 ± 0.302
0.543MetGln: 0.543 ± 0.381
0.905MetArg: 0.905 ± 0.276
0.905MetSer: 0.905 ± 0.622
0.181MetThr: 0.181 ± 0.12
1.809MetVal: 1.809 ± 0.633
0.181MetTrp: 0.181 ± 0.12
0.362MetTyr: 0.362 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
2.171AsnAla: 2.171 ± 0.691
1.809AsnCys: 1.809 ± 0.646
1.086AsnAsp: 1.086 ± 0.471
1.99AsnGlu: 1.99 ± 0.517
1.628AsnPhe: 1.628 ± 0.391
3.619AsnGly: 3.619 ± 0.926
1.628AsnHis: 1.628 ± 1.071
1.99AsnIle: 1.99 ± 1.153
1.447AsnLys: 1.447 ± 0.458
2.352AsnLeu: 2.352 ± 0.489
0.362AsnMet: 0.362 ± 0.242
2.352AsnAsn: 2.352 ± 1.744
2.714AsnPro: 2.714 ± 0.588
0.362AsnGln: 0.362 ± 0.44
1.086AsnArg: 1.086 ± 0.297
2.171AsnSer: 2.171 ± 0.477
2.714AsnThr: 2.714 ± 0.84
1.628AsnVal: 1.628 ± 0.534
0.0AsnTrp: 0.0 ± 0.0
2.533AsnTyr: 2.533 ± 0.813
0.0AsnXaa: 0.0 ± 0.0
Pro
6.333ProAla: 6.333 ± 1.604
1.086ProCys: 1.086 ± 0.471
2.895ProAsp: 2.895 ± 0.737
3.257ProGlu: 3.257 ± 0.983
2.533ProPhe: 2.533 ± 0.499
3.8ProGly: 3.8 ± 1.114
2.171ProHis: 2.171 ± 0.459
3.257ProIle: 3.257 ± 0.682
2.171ProLys: 2.171 ± 0.782
5.609ProLeu: 5.609 ± 1.176
0.362ProMet: 0.362 ± 0.375
2.714ProAsn: 2.714 ± 0.503
4.704ProPro: 4.704 ± 1.445
1.267ProGln: 1.267 ± 0.273
3.076ProArg: 3.076 ± 0.614
4.342ProSer: 4.342 ± 0.603
3.8ProThr: 3.8 ± 0.689
4.161ProVal: 4.161 ± 1.293
1.267ProTrp: 1.267 ± 0.532
2.714ProTyr: 2.714 ± 0.787
0.0ProXaa: 0.0 ± 0.0
Gln
2.171GlnAla: 2.171 ± 0.448
0.724GlnCys: 0.724 ± 0.362
0.543GlnAsp: 0.543 ± 0.416
0.905GlnGlu: 0.905 ± 0.273
1.086GlnPhe: 1.086 ± 0.433
2.533GlnGly: 2.533 ± 0.63
0.724GlnHis: 0.724 ± 0.463
1.628GlnIle: 1.628 ± 0.412
0.543GlnLys: 0.543 ± 0.204
3.257GlnLeu: 3.257 ± 0.776
0.0GlnMet: 0.0 ± 0.0
1.809GlnAsn: 1.809 ± 0.971
2.352GlnPro: 2.352 ± 0.602
1.086GlnGln: 1.086 ± 0.363
0.724GlnArg: 0.724 ± 0.41
1.99GlnSer: 1.99 ± 1.247
1.809GlnThr: 1.809 ± 0.454
1.99GlnVal: 1.99 ± 0.562
0.181GlnTrp: 0.181 ± 0.211
1.447GlnTyr: 1.447 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
3.076ArgAla: 3.076 ± 0.694
1.267ArgCys: 1.267 ± 0.568
1.267ArgAsp: 1.267 ± 0.397
1.809ArgGlu: 1.809 ± 0.692
2.533ArgPhe: 2.533 ± 0.558
2.533ArgGly: 2.533 ± 0.788
2.714ArgHis: 2.714 ± 0.596
1.809ArgIle: 1.809 ± 0.528
1.267ArgLys: 1.267 ± 0.331
5.971ArgLeu: 5.971 ± 0.827
0.543ArgMet: 0.543 ± 0.204
1.99ArgAsn: 1.99 ± 0.425
2.895ArgPro: 2.895 ± 0.752
2.352ArgGln: 2.352 ± 0.659
3.076ArgArg: 3.076 ± 0.796
2.714ArgSer: 2.714 ± 0.497
2.714ArgThr: 2.714 ± 0.68
3.257ArgVal: 3.257 ± 1.006
0.543ArgTrp: 0.543 ± 0.334
2.533ArgTyr: 2.533 ± 0.704
0.0ArgXaa: 0.0 ± 0.0
Ser
5.79SerAla: 5.79 ± 0.972
2.714SerCys: 2.714 ± 0.514
2.352SerAsp: 2.352 ± 0.519
2.352SerGlu: 2.352 ± 0.589
3.98SerPhe: 3.98 ± 1.636
5.609SerGly: 5.609 ± 1.224
2.714SerHis: 2.714 ± 0.887
3.8SerIle: 3.8 ± 1.275
3.619SerLys: 3.619 ± 1.07
9.046SerLeu: 9.046 ± 2.471
1.447SerMet: 1.447 ± 0.33
1.267SerAsn: 1.267 ± 0.626
4.342SerPro: 4.342 ± 0.915
1.99SerGln: 1.99 ± 0.752
3.98SerArg: 3.98 ± 0.649
7.418SerSer: 7.418 ± 1.795
5.609SerThr: 5.609 ± 0.891
4.523SerVal: 4.523 ± 1.428
0.724SerTrp: 0.724 ± 0.44
2.533SerTyr: 2.533 ± 0.703
0.0SerXaa: 0.0 ± 0.0
Thr
6.513ThrAla: 6.513 ± 1.449
2.171ThrCys: 2.171 ± 0.364
2.352ThrAsp: 2.352 ± 0.815
2.533ThrGlu: 2.533 ± 0.617
2.714ThrPhe: 2.714 ± 1.005
5.79ThrGly: 5.79 ± 1.791
1.99ThrHis: 1.99 ± 0.418
2.714ThrIle: 2.714 ± 0.964
3.438ThrLys: 3.438 ± 0.817
8.323ThrLeu: 8.323 ± 1.464
1.447ThrMet: 1.447 ± 0.478
2.714ThrAsn: 2.714 ± 0.793
5.066ThrPro: 5.066 ± 0.985
1.628ThrGln: 1.628 ± 0.803
3.619ThrArg: 3.619 ± 0.613
4.523ThrSer: 4.523 ± 0.818
5.79ThrThr: 5.79 ± 1.203
3.8ThrVal: 3.8 ± 0.696
0.724ThrTrp: 0.724 ± 0.659
2.533ThrTyr: 2.533 ± 0.518
0.0ThrXaa: 0.0 ± 0.0
Val
4.704ValAla: 4.704 ± 1.042
2.171ValCys: 2.171 ± 0.842
3.8ValAsp: 3.8 ± 1.005
2.352ValGlu: 2.352 ± 0.563
2.714ValPhe: 2.714 ± 0.411
4.704ValGly: 4.704 ± 1.237
1.809ValHis: 1.809 ± 0.452
3.619ValIle: 3.619 ± 0.685
3.076ValLys: 3.076 ± 0.636
8.866ValLeu: 8.866 ± 1.693
0.543ValMet: 0.543 ± 0.285
3.076ValAsn: 3.076 ± 0.754
4.161ValPro: 4.161 ± 1.456
0.905ValGln: 0.905 ± 0.289
2.895ValArg: 2.895 ± 0.922
5.971ValSer: 5.971 ± 1.225
6.513ValThr: 6.513 ± 1.521
7.237ValVal: 7.237 ± 1.297
0.181ValTrp: 0.181 ± 0.144
2.895ValTyr: 2.895 ± 1.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.543TrpAla: 0.543 ± 0.302
0.181TrpCys: 0.181 ± 0.12
0.905TrpAsp: 0.905 ± 0.386
0.0TrpGlu: 0.0 ± 0.0
0.905TrpPhe: 0.905 ± 0.255
0.543TrpGly: 0.543 ± 0.406
0.0TrpHis: 0.0 ± 0.0
0.905TrpIle: 0.905 ± 0.646
0.724TrpLys: 0.724 ± 0.647
2.352TrpLeu: 2.352 ± 1.047
0.181TrpMet: 0.181 ± 0.255
0.724TrpAsn: 0.724 ± 0.326
0.543TrpPro: 0.543 ± 0.274
0.905TrpGln: 0.905 ± 0.402
0.543TrpArg: 0.543 ± 0.409
0.362TrpSer: 0.362 ± 0.155
0.724TrpThr: 0.724 ± 0.326
0.905TrpVal: 0.905 ± 0.323
0.0TrpTrp: 0.0 ± 0.0
0.724TrpTyr: 0.724 ± 0.591
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.076TyrAla: 3.076 ± 0.91
1.99TyrCys: 1.99 ± 0.584
0.905TyrAsp: 0.905 ± 0.295
1.447TyrGlu: 1.447 ± 0.347
1.809TyrPhe: 1.809 ± 0.461
2.171TyrGly: 2.171 ± 0.741
1.447TyrHis: 1.447 ± 0.58
2.533TyrIle: 2.533 ± 0.573
1.628TyrLys: 1.628 ± 0.399
5.247TyrLeu: 5.247 ± 0.942
0.724TyrMet: 0.724 ± 0.43
0.724TyrAsn: 0.724 ± 0.371
1.628TyrPro: 1.628 ± 0.75
1.267TyrGln: 1.267 ± 0.648
1.809TyrArg: 1.809 ± 0.589
3.076TyrSer: 3.076 ± 0.791
2.895TyrThr: 2.895 ± 0.833
2.714TyrVal: 2.714 ± 0.574
0.543TyrTrp: 0.543 ± 0.236
2.533TyrTyr: 2.533 ± 0.612
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (5528 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski