Amino acid dipepetide frequency for Wongabel hapavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.855AlaAla: 1.855 ± 1.061
0.232AlaCys: 0.232 ± 0.276
2.551AlaAsp: 2.551 ± 0.44
1.16AlaGlu: 1.16 ± 0.302
0.696AlaPhe: 0.696 ± 0.548
1.16AlaGly: 1.16 ± 0.54
0.0AlaHis: 0.0 ± 0.0
2.319AlaIle: 2.319 ± 0.557
2.551AlaLys: 2.551 ± 0.799
3.479AlaLeu: 3.479 ± 0.801
0.232AlaMet: 0.232 ± 0.141
2.087AlaAsn: 2.087 ± 0.565
0.696AlaPro: 0.696 ± 0.325
1.16AlaGln: 1.16 ± 0.548
0.928AlaArg: 0.928 ± 0.428
2.087AlaSer: 2.087 ± 0.606
2.551AlaThr: 2.551 ± 0.777
1.16AlaVal: 1.16 ± 0.461
1.16AlaTrp: 1.16 ± 0.4
2.087AlaTyr: 2.087 ± 0.597
0.0AlaXaa: 0.0 ± 0.0
Cys
1.391CysAla: 1.391 ± 0.474
0.928CysCys: 0.928 ± 1.32
0.464CysAsp: 0.464 ± 0.282
0.928CysGlu: 0.928 ± 0.374
0.696CysPhe: 0.696 ± 0.255
0.696CysGly: 0.696 ± 0.439
1.391CysHis: 1.391 ± 0.925
0.928CysIle: 0.928 ± 0.451
2.319CysLys: 2.319 ± 0.788
1.16CysLeu: 1.16 ± 0.47
0.0CysMet: 0.0 ± 0.0
0.928CysAsn: 0.928 ± 0.463
1.391CysPro: 1.391 ± 0.709
0.464CysGln: 0.464 ± 0.372
1.855CysArg: 1.855 ± 0.466
1.391CysSer: 1.391 ± 0.588
0.232CysThr: 0.232 ± 0.141
0.464CysVal: 0.464 ± 0.225
0.696CysTrp: 0.696 ± 0.423
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.391AspAla: 1.391 ± 0.61
0.696AspCys: 0.696 ± 0.273
4.638AspAsp: 4.638 ± 1.419
3.942AspGlu: 3.942 ± 1.259
4.174AspPhe: 4.174 ± 0.869
2.319AspGly: 2.319 ± 0.505
1.623AspHis: 1.623 ± 0.711
3.711AspIle: 3.711 ± 1.028
4.406AspLys: 4.406 ± 0.999
6.725AspLeu: 6.725 ± 1.432
2.087AspMet: 2.087 ± 0.604
4.406AspAsn: 4.406 ± 0.696
4.174AspPro: 4.174 ± 0.718
2.087AspGln: 2.087 ± 0.801
2.087AspArg: 2.087 ± 0.539
2.551AspSer: 2.551 ± 0.643
2.319AspThr: 2.319 ± 0.977
2.551AspVal: 2.551 ± 1.269
1.855AspTrp: 1.855 ± 0.489
4.406AspTyr: 4.406 ± 0.919
0.0AspXaa: 0.0 ± 0.0
Glu
1.623GluAla: 1.623 ± 0.378
0.928GluCys: 0.928 ± 0.696
5.334GluAsp: 5.334 ± 1.031
3.711GluGlu: 3.711 ± 1.032
3.015GluPhe: 3.015 ± 0.762
2.551GluGly: 2.551 ± 0.388
1.391GluHis: 1.391 ± 0.505
6.725GluIle: 6.725 ± 0.844
3.942GluLys: 3.942 ± 1.758
5.798GluLeu: 5.798 ± 0.686
1.391GluMet: 1.391 ± 0.418
3.711GluAsn: 3.711 ± 0.82
2.319GluPro: 2.319 ± 1.189
1.16GluGln: 1.16 ± 0.442
1.16GluArg: 1.16 ± 0.565
5.334GluSer: 5.334 ± 1.036
2.551GluThr: 2.551 ± 0.732
3.015GluVal: 3.015 ± 0.87
0.928GluTrp: 0.928 ± 0.374
3.015GluTyr: 3.015 ± 0.759
0.0GluXaa: 0.0 ± 0.0
Phe
0.928PheAla: 0.928 ± 0.345
0.928PheCys: 0.928 ± 0.258
4.174PheAsp: 4.174 ± 1.246
3.711PheGlu: 3.711 ± 0.851
3.015PhePhe: 3.015 ± 1.139
4.174PheGly: 4.174 ± 0.886
0.696PheHis: 0.696 ± 0.315
2.783PheIle: 2.783 ± 0.755
3.247PheLys: 3.247 ± 0.542
4.638PheLeu: 4.638 ± 0.941
1.855PheMet: 1.855 ± 0.965
1.391PheAsn: 1.391 ± 0.382
2.551PhePro: 2.551 ± 0.865
2.783PheGln: 2.783 ± 0.822
3.015PheArg: 3.015 ± 0.519
3.247PheSer: 3.247 ± 1.331
1.855PheThr: 1.855 ± 0.687
4.174PheVal: 4.174 ± 1.219
0.464PheTrp: 0.464 ± 0.225
1.855PheTyr: 1.855 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
0.232GlyAla: 0.232 ± 0.276
0.232GlyCys: 0.232 ± 0.141
3.942GlyAsp: 3.942 ± 0.931
3.479GlyGlu: 3.479 ± 0.88
4.174GlyPhe: 4.174 ± 0.587
2.319GlyGly: 2.319 ± 0.681
1.16GlyHis: 1.16 ± 0.521
5.798GlyIle: 5.798 ± 1.479
3.479GlyLys: 3.479 ± 0.898
7.189GlyLeu: 7.189 ± 0.957
1.16GlyMet: 1.16 ± 0.627
2.783GlyAsn: 2.783 ± 0.852
1.623GlyPro: 1.623 ± 0.632
2.783GlyGln: 2.783 ± 0.684
2.319GlyArg: 2.319 ± 1.181
5.566GlySer: 5.566 ± 0.796
3.015GlyThr: 3.015 ± 1.103
1.391GlyVal: 1.391 ± 0.452
0.696GlyTrp: 0.696 ± 0.324
1.16GlyTyr: 1.16 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
0.696HisAla: 0.696 ± 0.273
0.0HisCys: 0.0 ± 0.0
1.16HisAsp: 1.16 ± 0.33
1.855HisGlu: 1.855 ± 0.493
1.623HisPhe: 1.623 ± 0.695
0.696HisGly: 0.696 ± 0.439
0.928HisHis: 0.928 ± 0.328
3.015HisIle: 3.015 ± 0.989
1.623HisLys: 1.623 ± 0.411
2.551HisLeu: 2.551 ± 0.495
0.928HisMet: 0.928 ± 0.43
1.16HisAsn: 1.16 ± 0.825
1.855HisPro: 1.855 ± 0.902
0.696HisGln: 0.696 ± 0.324
1.16HisArg: 1.16 ± 0.548
1.855HisSer: 1.855 ± 0.459
0.464HisThr: 0.464 ± 0.388
2.319HisVal: 2.319 ± 0.7
0.464HisTrp: 0.464 ± 0.282
0.696HisTyr: 0.696 ± 0.423
0.0HisXaa: 0.0 ± 0.0
Ile
2.087IleAla: 2.087 ± 0.441
1.623IleCys: 1.623 ± 0.59
6.03IleAsp: 6.03 ± 0.749
6.03IleGlu: 6.03 ± 0.876
3.711IlePhe: 3.711 ± 0.996
6.03IleGly: 6.03 ± 1.378
1.391IleHis: 1.391 ± 0.421
8.349IleIle: 8.349 ± 1.209
7.189IleLys: 7.189 ± 0.944
7.885IleLeu: 7.885 ± 2.109
1.855IleMet: 1.855 ± 0.537
4.406IleAsn: 4.406 ± 1.838
4.638IlePro: 4.638 ± 1.046
1.623IleGln: 1.623 ± 0.451
4.174IleArg: 4.174 ± 0.99
5.566IleSer: 5.566 ± 1.23
4.87IleThr: 4.87 ± 0.763
0.928IleVal: 0.928 ± 0.418
3.479IleTrp: 3.479 ± 1.092
3.942IleTyr: 3.942 ± 0.862
0.0IleXaa: 0.0 ± 0.0
Lys
1.16LysAla: 1.16 ± 0.332
0.696LysCys: 0.696 ± 0.829
2.087LysAsp: 2.087 ± 0.678
5.566LysGlu: 5.566 ± 0.718
3.015LysPhe: 3.015 ± 1.21
4.174LysGly: 4.174 ± 0.93
2.319LysHis: 2.319 ± 0.742
6.957LysIle: 6.957 ± 1.358
6.03LysLys: 6.03 ± 1.664
6.957LysLeu: 6.957 ± 1.271
2.087LysMet: 2.087 ± 0.874
5.334LysAsn: 5.334 ± 0.814
1.855LysPro: 1.855 ± 0.891
0.464LysGln: 0.464 ± 0.225
3.711LysArg: 3.711 ± 1.058
6.957LysSer: 6.957 ± 1.158
5.102LysThr: 5.102 ± 1.47
3.942LysVal: 3.942 ± 1.096
1.855LysTrp: 1.855 ± 0.466
2.319LysTyr: 2.319 ± 1.023
0.0LysXaa: 0.0 ± 0.0
Leu
4.174LeuAla: 4.174 ± 0.903
2.783LeuCys: 2.783 ± 1.018
6.262LeuAsp: 6.262 ± 1.323
6.494LeuGlu: 6.494 ± 1.716
4.87LeuPhe: 4.87 ± 0.75
6.725LeuGly: 6.725 ± 1.244
1.623LeuHis: 1.623 ± 0.987
9.276LeuIle: 9.276 ± 1.798
7.189LeuLys: 7.189 ± 1.02
7.421LeuLeu: 7.421 ± 1.106
2.551LeuMet: 2.551 ± 0.368
5.798LeuAsn: 5.798 ± 1.237
1.855LeuPro: 1.855 ± 0.462
4.174LeuGln: 4.174 ± 1.154
3.479LeuArg: 3.479 ± 1.091
8.117LeuSer: 8.117 ± 0.519
6.03LeuThr: 6.03 ± 1.404
7.421LeuVal: 7.421 ± 1.423
0.928LeuTrp: 0.928 ± 0.281
2.319LeuTyr: 2.319 ± 0.657
0.0LeuXaa: 0.0 ± 0.0
Met
1.16MetAla: 1.16 ± 0.451
0.0MetCys: 0.0 ± 0.0
1.16MetAsp: 1.16 ± 0.332
1.855MetGlu: 1.855 ± 0.546
0.928MetPhe: 0.928 ± 0.519
1.623MetGly: 1.623 ± 0.773
0.464MetHis: 0.464 ± 0.312
2.319MetIle: 2.319 ± 0.846
1.855MetLys: 1.855 ± 0.872
2.551MetLeu: 2.551 ± 1.063
0.232MetMet: 0.232 ± 0.282
0.464MetAsn: 0.464 ± 0.282
0.232MetPro: 0.232 ± 0.263
0.464MetGln: 0.464 ± 0.259
1.16MetArg: 1.16 ± 0.707
1.623MetSer: 1.623 ± 0.549
1.16MetThr: 1.16 ± 0.589
1.623MetVal: 1.623 ± 0.626
0.696MetTrp: 0.696 ± 0.333
1.855MetTyr: 1.855 ± 0.77
0.0MetXaa: 0.0 ± 0.0
Asn
2.087AsnAla: 2.087 ± 0.973
1.855AsnCys: 1.855 ± 0.481
2.783AsnAsp: 2.783 ± 0.669
1.16AsnGlu: 1.16 ± 0.383
2.551AsnPhe: 2.551 ± 0.861
2.783AsnGly: 2.783 ± 1.371
1.623AsnHis: 1.623 ± 0.501
6.494AsnIle: 6.494 ± 0.976
3.479AsnLys: 3.479 ± 0.862
9.045AsnLeu: 9.045 ± 1.262
1.16AsnMet: 1.16 ± 0.475
2.783AsnAsn: 2.783 ± 1.131
4.406AsnPro: 4.406 ± 0.802
2.551AsnGln: 2.551 ± 0.697
1.855AsnArg: 1.855 ± 0.862
3.942AsnSer: 3.942 ± 1.056
1.623AsnThr: 1.623 ± 0.401
1.623AsnVal: 1.623 ± 0.803
1.623AsnTrp: 1.623 ± 0.511
3.711AsnTyr: 3.711 ± 1.059
0.0AsnXaa: 0.0 ± 0.0
Pro
1.391ProAla: 1.391 ± 0.588
0.232ProCys: 0.232 ± 0.327
3.711ProAsp: 3.711 ± 1.023
1.623ProGlu: 1.623 ± 0.485
2.087ProPhe: 2.087 ± 0.608
2.087ProGly: 2.087 ± 0.692
1.16ProHis: 1.16 ± 0.426
4.174ProIle: 4.174 ± 0.609
1.855ProLys: 1.855 ± 0.793
4.638ProLeu: 4.638 ± 0.835
0.464ProMet: 0.464 ± 0.5
2.551ProAsn: 2.551 ± 0.653
2.783ProPro: 2.783 ± 0.617
1.623ProGln: 1.623 ± 0.923
2.319ProArg: 2.319 ± 1.034
4.174ProSer: 4.174 ± 0.773
2.319ProThr: 2.319 ± 0.523
1.855ProVal: 1.855 ± 0.612
0.696ProTrp: 0.696 ± 0.377
1.623ProTyr: 1.623 ± 0.608
0.0ProXaa: 0.0 ± 0.0
Gln
1.16GlnAla: 1.16 ± 0.341
0.696GlnCys: 0.696 ± 0.339
2.087GlnAsp: 2.087 ± 0.886
2.087GlnGlu: 2.087 ± 0.643
0.928GlnPhe: 0.928 ± 0.462
1.855GlnGly: 1.855 ± 0.493
0.696GlnHis: 0.696 ± 0.324
2.783GlnIle: 2.783 ± 0.611
2.783GlnLys: 2.783 ± 0.677
2.087GlnLeu: 2.087 ± 0.624
0.464GlnMet: 0.464 ± 0.282
1.855GlnAsn: 1.855 ± 0.554
1.16GlnPro: 1.16 ± 0.622
0.0GlnGln: 0.0 ± 0.0
1.391GlnArg: 1.391 ± 0.393
3.247GlnSer: 3.247 ± 1.247
2.087GlnThr: 2.087 ± 0.557
2.087GlnVal: 2.087 ± 0.477
0.696GlnTrp: 0.696 ± 0.255
0.928GlnTyr: 0.928 ± 0.347
0.0GlnXaa: 0.0 ± 0.0
Arg
1.623ArgAla: 1.623 ± 0.746
1.16ArgCys: 1.16 ± 0.357
3.015ArgAsp: 3.015 ± 0.762
2.551ArgGlu: 2.551 ± 0.607
2.783ArgPhe: 2.783 ± 0.714
2.087ArgGly: 2.087 ± 0.836
0.928ArgHis: 0.928 ± 0.258
2.783ArgIle: 2.783 ± 0.675
3.247ArgLys: 3.247 ± 0.488
2.783ArgLeu: 2.783 ± 0.981
1.16ArgMet: 1.16 ± 0.401
2.551ArgAsn: 2.551 ± 0.65
2.551ArgPro: 2.551 ± 1.234
1.855ArgGln: 1.855 ± 0.717
2.087ArgArg: 2.087 ± 0.462
2.783ArgSer: 2.783 ± 0.557
3.015ArgThr: 3.015 ± 0.569
1.855ArgVal: 1.855 ± 0.438
0.232ArgTrp: 0.232 ± 0.141
1.16ArgTyr: 1.16 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
2.551SerAla: 2.551 ± 0.69
1.391SerCys: 1.391 ± 0.66
3.711SerAsp: 3.711 ± 0.725
4.638SerGlu: 4.638 ± 0.802
3.711SerPhe: 3.711 ± 1.274
2.783SerGly: 2.783 ± 0.664
4.174SerHis: 4.174 ± 0.758
6.03SerIle: 6.03 ± 1.336
4.87SerLys: 4.87 ± 1.104
10.204SerLeu: 10.204 ± 3.456
1.16SerMet: 1.16 ± 0.427
5.334SerAsn: 5.334 ± 1.32
3.711SerPro: 3.711 ± 1.514
2.783SerGln: 2.783 ± 0.624
2.551SerArg: 2.551 ± 0.709
6.725SerSer: 6.725 ± 1.635
4.87SerThr: 4.87 ± 1.273
4.174SerVal: 4.174 ± 0.947
2.087SerTrp: 2.087 ± 0.466
2.319SerTyr: 2.319 ± 0.67
0.0SerXaa: 0.0 ± 0.0
Thr
1.391ThrAla: 1.391 ± 0.535
1.391ThrCys: 1.391 ± 0.99
3.015ThrAsp: 3.015 ± 1.036
2.319ThrGlu: 2.319 ± 0.46
2.551ThrPhe: 2.551 ± 0.779
2.551ThrGly: 2.551 ± 0.673
1.16ThrHis: 1.16 ± 0.367
3.711ThrIle: 3.711 ± 0.709
5.566ThrLys: 5.566 ± 1.411
3.015ThrLeu: 3.015 ± 0.597
2.551ThrMet: 2.551 ± 0.792
3.247ThrAsn: 3.247 ± 0.671
1.16ThrPro: 1.16 ± 0.426
1.623ThrGln: 1.623 ± 0.605
2.551ThrArg: 2.551 ± 0.715
4.174ThrSer: 4.174 ± 1.437
2.551ThrThr: 2.551 ± 0.536
2.319ThrVal: 2.319 ± 1.509
1.391ThrTrp: 1.391 ± 0.732
2.319ThrTyr: 2.319 ± 0.942
0.0ThrXaa: 0.0 ± 0.0
Val
1.16ValAla: 1.16 ± 0.739
1.623ValCys: 1.623 ± 0.555
3.015ValAsp: 3.015 ± 0.94
1.623ValGlu: 1.623 ± 0.758
2.783ValPhe: 2.783 ± 0.937
3.942ValGly: 3.942 ± 1.732
0.928ValHis: 0.928 ± 0.809
2.783ValIle: 2.783 ± 0.609
3.015ValLys: 3.015 ± 0.734
5.334ValLeu: 5.334 ± 0.999
0.928ValMet: 0.928 ± 0.328
2.551ValAsn: 2.551 ± 0.673
2.783ValPro: 2.783 ± 0.59
1.391ValGln: 1.391 ± 0.442
1.855ValArg: 1.855 ± 0.629
5.334ValSer: 5.334 ± 0.587
1.623ValThr: 1.623 ± 0.485
2.319ValVal: 2.319 ± 0.956
1.16ValTrp: 1.16 ± 0.676
2.319ValTyr: 2.319 ± 1.47
0.0ValXaa: 0.0 ± 0.0
Trp
1.16TrpAla: 1.16 ± 0.567
0.0TrpCys: 0.0 ± 0.0
1.16TrpAsp: 1.16 ± 0.591
2.087TrpGlu: 2.087 ± 0.718
1.855TrpPhe: 1.855 ± 0.608
2.319TrpGly: 2.319 ± 0.922
0.464TrpHis: 0.464 ± 0.225
1.16TrpIle: 1.16 ± 0.487
0.928TrpLys: 0.928 ± 0.564
1.16TrpLeu: 1.16 ± 0.391
0.696TrpMet: 0.696 ± 0.379
1.623TrpAsn: 1.623 ± 0.532
0.696TrpPro: 0.696 ± 0.518
0.928TrpGln: 0.928 ± 0.337
0.696TrpArg: 0.696 ± 0.315
1.391TrpSer: 1.391 ± 0.626
0.928TrpThr: 0.928 ± 0.317
1.16TrpVal: 1.16 ± 0.492
0.696TrpTrp: 0.696 ± 0.484
0.928TrpTyr: 0.928 ± 0.905
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.928TyrAla: 0.928 ± 0.406
0.928TyrCys: 0.928 ± 0.519
1.855TyrAsp: 1.855 ± 0.531
2.783TyrGlu: 2.783 ± 1.321
2.319TyrPhe: 2.319 ± 0.408
1.623TyrGly: 1.623 ± 0.616
1.623TyrHis: 1.623 ± 0.367
3.711TyrIle: 3.711 ± 0.942
3.015TyrLys: 3.015 ± 0.823
5.102TyrLeu: 5.102 ± 1.069
0.232TyrMet: 0.232 ± 0.248
4.174TyrAsn: 4.174 ± 0.845
0.928TyrPro: 0.928 ± 0.428
0.464TyrGln: 0.464 ± 0.818
1.855TyrArg: 1.855 ± 0.794
3.711TyrSer: 3.711 ± 0.55
1.391TyrThr: 1.391 ± 0.673
2.087TyrVal: 2.087 ± 0.606
0.232TyrTrp: 0.232 ± 0.423
1.16TyrTyr: 1.16 ± 0.705
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4313 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski