Amino acid dipepetide frequency for Hantaan virus (strain 76-118) (Korean hemorrhagic fever virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.847AlaAla: 4.847 ± 0.214
1.346AlaCys: 1.346 ± 0.896
2.693AlaAsp: 2.693 ± 0.605
3.231AlaGlu: 3.231 ± 1.806
1.885AlaPhe: 1.885 ± 0.393
2.423AlaGly: 2.423 ± 1.336
2.154AlaHis: 2.154 ± 0.611
4.577AlaIle: 4.577 ± 1.236
3.231AlaLys: 3.231 ± 0.428
5.924AlaLeu: 5.924 ± 1.345
1.616AlaMet: 1.616 ± 0.973
2.693AlaAsn: 2.693 ± 1.022
2.154AlaPro: 2.154 ± 0.235
2.962AlaGln: 2.962 ± 1.169
1.885AlaArg: 1.885 ± 0.966
3.5AlaSer: 3.5 ± 1.09
3.5AlaThr: 3.5 ± 1.008
5.116AlaVal: 5.116 ± 1.379
1.077AlaTrp: 1.077 ± 0.364
2.154AlaTyr: 2.154 ± 0.728
0.0AlaXaa: 0.0 ± 0.0
Cys
1.077CysAla: 1.077 ± 0.649
0.269CysCys: 0.269 ± 0.25
1.077CysAsp: 1.077 ± 0.649
1.077CysGlu: 1.077 ± 1.0
2.154CysPhe: 2.154 ± 0.981
1.077CysGly: 1.077 ± 0.553
0.539CysHis: 0.539 ± 0.5
1.616CysIle: 1.616 ± 0.546
1.616CysLys: 1.616 ± 0.917
1.077CysLeu: 1.077 ± 0.877
0.269CysMet: 0.269 ± 0.25
1.885CysAsn: 1.885 ± 1.749
2.423CysPro: 2.423 ± 1.607
1.616CysGln: 1.616 ± 0.546
1.077CysArg: 1.077 ± 0.649
1.346CysSer: 1.346 ± 0.303
1.616CysThr: 1.616 ± 0.812
2.693CysVal: 2.693 ± 0.91
0.269CysTrp: 0.269 ± 0.25
1.346CysTyr: 1.346 ± 1.249
0.0CysXaa: 0.0 ± 0.0
Asp
2.154AspAla: 2.154 ± 0.913
1.346AspCys: 1.346 ± 0.577
3.231AspAsp: 3.231 ± 0.428
2.962AspGlu: 2.962 ± 1.267
1.885AspPhe: 1.885 ± 0.561
3.5AspGly: 3.5 ± 1.062
0.808AspHis: 0.808 ± 0.713
3.77AspIle: 3.77 ± 0.259
2.962AspLys: 2.962 ± 0.855
6.731AspLeu: 6.731 ± 1.435
2.423AspMet: 2.423 ± 0.719
2.962AspAsn: 2.962 ± 0.492
2.423AspPro: 2.423 ± 1.328
2.423AspGln: 2.423 ± 0.713
2.154AspArg: 2.154 ± 2.08
2.693AspSer: 2.693 ± 0.823
2.693AspThr: 2.693 ± 0.34
2.962AspVal: 2.962 ± 0.603
1.616AspTrp: 1.616 ± 0.965
2.154AspTyr: 2.154 ± 0.534
0.0AspXaa: 0.0 ± 0.0
Glu
4.847GluAla: 4.847 ± 0.18
1.616GluCys: 1.616 ± 0.546
3.5GluAsp: 3.5 ± 0.718
5.385GluGlu: 5.385 ± 0.479
2.693GluPhe: 2.693 ± 1.022
2.423GluGly: 2.423 ± 0.949
0.808GluHis: 0.808 ± 0.487
3.231GluIle: 3.231 ± 1.092
4.308GluLys: 4.308 ± 1.44
6.193GluLeu: 6.193 ± 1.692
0.808GluMet: 0.808 ± 0.487
2.693GluAsn: 2.693 ± 1.155
2.962GluPro: 2.962 ± 1.855
2.423GluGln: 2.423 ± 0.719
2.154GluArg: 2.154 ± 0.235
5.116GluSer: 5.116 ± 0.373
3.231GluThr: 3.231 ± 0.262
3.231GluVal: 3.231 ± 1.623
1.616GluTrp: 1.616 ± 0.475
1.346GluTyr: 1.346 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
2.154PheAla: 2.154 ± 0.534
0.808PheCys: 0.808 ± 0.406
1.346PheAsp: 1.346 ± 0.577
4.577PheGlu: 4.577 ± 0.934
3.5PhePhe: 3.5 ± 0.927
1.885PheGly: 1.885 ± 1.053
1.346PheHis: 1.346 ± 0.715
3.231PheIle: 3.231 ± 0.766
2.423PheLys: 2.423 ± 0.674
4.577PheLeu: 4.577 ± 0.934
1.616PheMet: 1.616 ± 0.66
3.231PheAsn: 3.231 ± 1.33
2.154PhePro: 2.154 ± 0.728
1.885PheGln: 1.885 ± 0.593
3.5PheArg: 3.5 ± 1.292
4.577PheSer: 4.577 ± 0.797
2.693PheThr: 2.693 ± 1.155
2.423PheVal: 2.423 ± 0.847
0.269PheTrp: 0.269 ± 0.162
0.808PheTyr: 0.808 ± 0.482
0.0PheXaa: 0.0 ± 0.0
Gly
3.5GlyAla: 3.5 ± 0.606
1.346GlyCys: 1.346 ± 0.715
2.962GlyAsp: 2.962 ± 0.492
4.039GlyGlu: 4.039 ± 1.276
1.885GlyPhe: 1.885 ± 0.393
2.154GlyGly: 2.154 ± 0.99
1.616GlyHis: 1.616 ± 0.475
4.308GlyIle: 4.308 ± 1.641
2.962GlyLys: 2.962 ± 1.192
5.385GlyLeu: 5.385 ± 1.137
1.885GlyMet: 1.885 ± 0.593
3.231GlyAsn: 3.231 ± 0.492
1.616GlyPro: 1.616 ± 0.917
2.693GlyGln: 2.693 ± 1.304
1.077GlyArg: 1.077 ± 1.068
2.693GlySer: 2.693 ± 1.155
3.231GlyThr: 3.231 ± 0.428
4.847GlyVal: 4.847 ± 0.555
0.808GlyTrp: 0.808 ± 0.406
2.423GlyTyr: 2.423 ± 0.738
0.0GlyXaa: 0.0 ± 0.0
His
1.346HisAla: 1.346 ± 0.715
1.077HisCys: 1.077 ± 0.364
1.346HisAsp: 1.346 ± 0.511
0.808HisGlu: 0.808 ± 0.482
0.808HisPhe: 0.808 ± 0.238
2.423HisGly: 2.423 ± 1.218
0.539HisHis: 0.539 ± 0.324
1.885HisIle: 1.885 ± 0.561
2.693HisLys: 2.693 ± 1.022
2.693HisLeu: 2.693 ± 1.917
0.539HisMet: 0.539 ± 0.324
0.808HisAsn: 0.808 ± 0.487
0.808HisPro: 0.808 ± 0.238
0.269HisGln: 0.269 ± 0.25
0.269HisArg: 0.269 ± 0.25
1.616HisSer: 1.616 ± 0.546
2.423HisThr: 2.423 ± 0.932
1.077HisVal: 1.077 ± 0.364
0.808HisTrp: 0.808 ± 0.406
0.808HisTyr: 0.808 ± 0.406
0.0HisXaa: 0.0 ± 0.0
Ile
4.847IleAla: 4.847 ± 1.438
1.616IleCys: 1.616 ± 0.546
5.116IleAsp: 5.116 ± 0.556
6.731IleGlu: 6.731 ± 0.749
3.5IlePhe: 3.5 ± 1.864
4.039IleGly: 4.039 ± 0.616
1.616IleHis: 1.616 ± 0.665
4.039IleIle: 4.039 ± 0.285
3.77IleLys: 3.77 ± 0.482
5.654IleLeu: 5.654 ± 0.382
1.346IleMet: 1.346 ± 0.303
1.616IleAsn: 1.616 ± 0.322
4.308IlePro: 4.308 ± 1.685
2.154IleGln: 2.154 ± 0.995
3.5IleArg: 3.5 ± 1.278
4.847IleSer: 4.847 ± 1.349
4.577IleThr: 4.577 ± 0.71
4.847IleVal: 4.847 ± 0.555
0.808IleTrp: 0.808 ± 0.713
1.346IleTyr: 1.346 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
4.577LysAla: 4.577 ± 1.37
1.616LysCys: 1.616 ± 0.812
3.77LysAsp: 3.77 ± 1.634
3.5LysGlu: 3.5 ± 2.353
3.5LysPhe: 3.5 ± 1.487
4.577LysGly: 4.577 ± 0.71
3.231LysHis: 3.231 ± 0.492
5.116LysIle: 5.116 ± 0.826
3.5LysLys: 3.5 ± 0.435
5.654LysLeu: 5.654 ± 1.594
1.077LysMet: 1.077 ± 0.364
1.885LysAsn: 1.885 ± 0.78
1.885LysPro: 1.885 ± 0.129
2.693LysGln: 2.693 ± 0.823
1.885LysArg: 1.885 ± 0.966
6.193LysSer: 6.193 ± 1.112
3.77LysThr: 3.77 ± 0.482
5.385LysVal: 5.385 ± 1.048
0.539LysTrp: 0.539 ± 0.182
2.693LysTyr: 2.693 ± 0.782
0.0LysXaa: 0.0 ± 0.0
Leu
5.116LeuAla: 5.116 ± 1.594
2.154LeuCys: 2.154 ± 0.981
6.193LeuAsp: 6.193 ± 1.372
5.385LeuGlu: 5.385 ± 2.043
5.924LeuPhe: 5.924 ± 0.858
5.116LeuGly: 5.116 ± 2.566
2.154LeuHis: 2.154 ± 0.728
7.808LeuIle: 7.808 ± 1.36
8.078LeuLys: 8.078 ± 0.483
8.078LeuLeu: 8.078 ± 1.345
1.346LeuMet: 1.346 ± 0.53
4.577LeuAsn: 4.577 ± 0.446
2.962LeuPro: 2.962 ± 0.492
3.231LeuGln: 3.231 ± 0.966
4.847LeuArg: 4.847 ± 1.389
5.654LeuSer: 5.654 ± 0.696
5.924LeuThr: 5.924 ± 2.134
4.847LeuVal: 4.847 ± 1.304
1.077LeuTrp: 1.077 ± 0.364
3.77LeuTyr: 3.77 ± 1.08
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 0.951
0.269MetCys: 0.269 ± 0.25
1.616MetAsp: 1.616 ± 0.246
1.616MetGlu: 1.616 ± 0.879
0.808MetPhe: 0.808 ± 0.487
0.539MetGly: 0.539 ± 0.61
0.539MetHis: 0.539 ± 0.5
2.154MetIle: 2.154 ± 0.728
2.693MetLys: 2.693 ± 0.448
2.154MetLeu: 2.154 ± 0.784
0.808MetMet: 0.808 ± 0.238
0.808MetAsn: 0.808 ± 0.238
0.0MetPro: 0.0 ± 0.0
0.539MetGln: 0.539 ± 0.324
1.077MetArg: 1.077 ± 0.497
4.039MetSer: 4.039 ± 1.606
0.808MetThr: 0.808 ± 0.238
2.423MetVal: 2.423 ± 0.467
0.808MetTrp: 0.808 ± 0.238
0.539MetTyr: 0.539 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
1.616AsnAla: 1.616 ± 0.951
0.808AsnCys: 0.808 ± 0.238
1.616AsnAsp: 1.616 ± 0.546
1.077AsnGlu: 1.077 ± 0.649
2.154AsnPhe: 2.154 ± 0.235
2.154AsnGly: 2.154 ± 0.16
2.693AsnHis: 2.693 ± 0.448
4.308AsnIle: 4.308 ± 1.685
2.693AsnLys: 2.693 ± 0.818
4.577AsnLeu: 4.577 ± 0.934
1.616AsnMet: 1.616 ± 0.554
1.077AsnAsn: 1.077 ± 0.649
1.616AsnPro: 1.616 ± 0.546
0.539AsnGln: 0.539 ± 0.61
1.885AsnArg: 1.885 ± 0.129
3.231AsnSer: 3.231 ± 0.95
2.154AsnThr: 2.154 ± 0.611
1.885AsnVal: 1.885 ± 0.41
0.808AsnTrp: 0.808 ± 0.238
1.077AsnTyr: 1.077 ± 0.649
0.0AsnXaa: 0.0 ± 0.0
Pro
2.962ProAla: 2.962 ± 0.603
1.077ProCys: 1.077 ± 0.553
3.231ProAsp: 3.231 ± 1.117
1.616ProGlu: 1.616 ± 0.246
0.539ProPhe: 0.539 ± 0.324
4.577ProGly: 4.577 ± 1.455
1.616ProHis: 1.616 ± 0.812
2.154ProIle: 2.154 ± 0.235
1.346ProLys: 1.346 ± 0.303
2.423ProLeu: 2.423 ± 0.847
1.077ProMet: 1.077 ± 0.364
1.077ProAsn: 1.077 ± 0.497
1.077ProPro: 1.077 ± 0.364
1.616ProGln: 1.616 ± 0.546
1.346ProArg: 1.346 ± 0.511
2.962ProSer: 2.962 ± 0.606
2.962ProThr: 2.962 ± 1.327
2.423ProVal: 2.423 ± 0.847
0.269ProTrp: 0.269 ± 0.25
1.885ProTyr: 1.885 ± 0.593
0.0ProXaa: 0.0 ± 0.0
Gln
2.693GlnAla: 2.693 ± 0.823
1.346GlnCys: 1.346 ± 0.577
1.885GlnAsp: 1.885 ± 0.966
1.346GlnGlu: 1.346 ± 0.303
1.885GlnPhe: 1.885 ± 0.593
1.885GlnGly: 1.885 ± 0.593
1.346GlnHis: 1.346 ± 0.561
1.616GlnIle: 1.616 ± 0.322
1.885GlnLys: 1.885 ± 0.78
2.154GlnLeu: 2.154 ± 1.398
0.808GlnMet: 0.808 ± 0.475
2.423GlnAsn: 2.423 ± 0.738
0.539GlnPro: 0.539 ± 0.324
1.077GlnGln: 1.077 ± 0.364
2.154GlnArg: 2.154 ± 1.429
4.039GlnSer: 4.039 ± 2.096
2.154GlnThr: 2.154 ± 0.728
3.77GlnVal: 3.77 ± 0.435
1.077GlnTrp: 1.077 ± 0.497
1.616GlnTyr: 1.616 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
2.693ArgAla: 2.693 ± 1.622
1.346ArgCys: 1.346 ± 0.396
2.693ArgAsp: 2.693 ± 0.793
3.5ArgGlu: 3.5 ± 1.435
2.693ArgPhe: 2.693 ± 0.448
1.885ArgGly: 1.885 ± 0.129
1.346ArgHis: 1.346 ± 0.511
2.693ArgIle: 2.693 ± 1.917
3.77ArgLys: 3.77 ± 0.786
3.77ArgLeu: 3.77 ± 1.382
0.539ArgMet: 0.539 ± 0.324
2.154ArgAsn: 2.154 ± 0.534
0.269ArgPro: 0.269 ± 0.162
2.154ArgGln: 2.154 ± 4.11
2.154ArgArg: 2.154 ± 0.16
2.154ArgSer: 2.154 ± 0.611
2.693ArgThr: 2.693 ± 1.917
1.885ArgVal: 1.885 ± 0.561
0.539ArgTrp: 0.539 ± 0.324
2.693ArgTyr: 2.693 ± 0.448
0.0ArgXaa: 0.0 ± 0.0
Ser
2.423SerAla: 2.423 ± 0.932
1.616SerCys: 1.616 ± 1.144
2.962SerAsp: 2.962 ± 0.606
2.962SerGlu: 2.962 ± 0.269
4.847SerPhe: 4.847 ± 0.997
5.385SerGly: 5.385 ± 1.653
0.539SerHis: 0.539 ± 0.324
6.731SerIle: 6.731 ± 1.076
5.385SerLys: 5.385 ± 0.897
10.77SerLeu: 10.77 ± 1.929
3.77SerMet: 3.77 ± 1.637
2.154SerAsn: 2.154 ± 0.534
3.77SerPro: 3.77 ± 0.259
2.962SerGln: 2.962 ± 0.916
4.308SerArg: 4.308 ± 1.455
6.731SerSer: 6.731 ± 0.846
4.039SerThr: 4.039 ± 1.276
4.308SerVal: 4.308 ± 0.834
0.808SerTrp: 0.808 ± 0.406
3.5SerTyr: 3.5 ± 0.358
0.0SerXaa: 0.0 ± 0.0
Thr
4.847ThrAla: 4.847 ± 0.716
1.885ThrCys: 1.885 ± 1.139
1.885ThrAsp: 1.885 ± 0.129
4.577ThrGlu: 4.577 ± 1.327
3.77ThrPhe: 3.77 ± 1.185
2.693ThrGly: 2.693 ± 1.466
0.808ThrHis: 0.808 ± 0.406
3.5ThrIle: 3.5 ± 0.435
3.77ThrLys: 3.77 ± 0.445
4.308ThrLeu: 4.308 ± 1.601
1.616ThrMet: 1.616 ± 0.246
1.346ThrAsn: 1.346 ± 0.396
2.693ThrPro: 2.693 ± 0.711
2.154ThrGln: 2.154 ± 0.534
1.885ThrArg: 1.885 ± 0.817
6.193ThrSer: 6.193 ± 1.43
3.5ThrThr: 3.5 ± 1.069
4.847ThrVal: 4.847 ± 0.793
0.269ThrTrp: 0.269 ± 0.162
2.154ThrTyr: 2.154 ± 0.728
0.0ThrXaa: 0.0 ± 0.0
Val
3.5ValAla: 3.5 ± 1.087
2.423ValCys: 2.423 ± 1.607
4.308ValAsp: 4.308 ± 1.074
2.962ValGlu: 2.962 ± 0.269
1.885ValPhe: 1.885 ± 0.393
2.423ValGly: 2.423 ± 1.227
0.808ValHis: 0.808 ± 0.75
3.231ValIle: 3.231 ± 0.428
4.577ValLys: 4.577 ± 0.07
6.731ValLeu: 6.731 ± 1.229
1.346ValMet: 1.346 ± 0.396
1.885ValAsn: 1.885 ± 0.129
2.962ValPro: 2.962 ± 1.711
3.231ValGln: 3.231 ± 1.092
3.231ValArg: 3.231 ± 1.131
7.001ValSer: 7.001 ± 1.161
4.577ValThr: 4.577 ± 1.327
2.423ValVal: 2.423 ± 0.09
1.346ValTrp: 1.346 ± 0.391
3.231ValTyr: 3.231 ± 0.262
0.0ValXaa: 0.0 ± 0.0
Trp
1.077TrpAla: 1.077 ± 0.364
0.539TrpCys: 0.539 ± 0.182
0.269TrpAsp: 0.269 ± 0.162
0.539TrpGlu: 0.539 ± 0.324
2.154TrpPhe: 2.154 ± 0.728
1.616TrpGly: 1.616 ± 0.246
0.539TrpHis: 0.539 ± 0.182
0.808TrpIle: 0.808 ± 0.75
1.346TrpLys: 1.346 ± 0.391
1.885TrpLeu: 1.885 ± 0.817
0.0TrpMet: 0.0 ± 0.0
0.269TrpAsn: 0.269 ± 0.25
0.539TrpPro: 0.539 ± 0.182
0.0TrpGln: 0.0 ± 0.0
0.539TrpArg: 0.539 ± 0.182
1.616TrpSer: 1.616 ± 0.66
0.539TrpThr: 0.539 ± 0.182
1.077TrpVal: 1.077 ± 0.364
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.077TyrAla: 1.077 ± 0.649
1.616TyrCys: 1.616 ± 0.812
2.154TyrAsp: 2.154 ± 0.728
2.693TyrGlu: 2.693 ± 1.089
0.539TyrPhe: 0.539 ± 0.324
1.885TyrGly: 1.885 ± 0.129
0.0TyrHis: 0.0 ± 0.0
3.5TyrIle: 3.5 ± 1.487
4.039TyrLys: 4.039 ± 1.172
3.5TyrLeu: 3.5 ± 1.278
1.077TyrMet: 1.077 ± 0.325
0.808TyrAsn: 0.808 ± 0.487
1.077TyrPro: 1.077 ± 0.364
1.077TyrGln: 1.077 ± 0.364
2.693TyrArg: 2.693 ± 0.34
3.77TyrSer: 3.77 ± 1.185
1.616TyrThr: 1.616 ± 0.546
1.616TyrVal: 1.616 ± 0.246
0.539TyrTrp: 0.539 ± 0.324
1.885TyrTyr: 1.885 ± 0.593
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3715 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski