Amino acid dipepetide frequency for Hantaan orthohantavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.577AlaAla: 4.577 ± 0.069
1.346AlaCys: 1.346 ± 0.881
2.962AlaAsp: 2.962 ± 0.509
3.231AlaGlu: 3.231 ± 1.277
1.885AlaPhe: 1.885 ± 0.367
2.423AlaGly: 2.423 ± 0.918
2.154AlaHis: 2.154 ± 0.603
4.039AlaIle: 4.039 ± 0.784
3.231AlaLys: 3.231 ± 0.307
6.193AlaLeu: 6.193 ± 1.434
1.616AlaMet: 1.616 ± 0.86
2.423AlaAsn: 2.423 ± 0.637
2.154AlaPro: 2.154 ± 0.225
2.962AlaGln: 2.962 ± 0.786
1.885AlaArg: 1.885 ± 0.806
3.231AlaSer: 3.231 ± 0.84
3.77AlaThr: 3.77 ± 0.611
5.654AlaVal: 5.654 ± 1.245
1.077AlaTrp: 1.077 ± 0.233
2.423AlaTyr: 2.423 ± 0.637
0.0AlaXaa: 0.0 ± 0.0
Cys
1.346CysAla: 1.346 ± 0.717
0.269CysCys: 0.269 ± 0.25
1.077CysAsp: 1.077 ± 0.632
1.077CysGlu: 1.077 ± 0.999
2.154CysPhe: 2.154 ± 0.913
1.077CysGly: 1.077 ± 0.447
0.539CysHis: 0.539 ± 0.5
1.616CysIle: 1.616 ± 0.447
1.616CysLys: 1.616 ± 0.881
1.077CysLeu: 1.077 ± 0.795
0.269CysMet: 0.269 ± 0.25
1.616CysAsn: 1.616 ± 1.499
2.423CysPro: 2.423 ± 1.605
1.616CysGln: 1.616 ± 0.447
0.808CysArg: 0.808 ± 0.43
1.616CysSer: 1.616 ± 0.174
1.616CysThr: 1.616 ± 0.771
2.693CysVal: 2.693 ± 0.744
0.269CysTrp: 0.269 ± 0.25
1.346CysTyr: 1.346 ± 1.249
0.0CysXaa: 0.0 ± 0.0
Asp
2.154AspAla: 2.154 ± 0.915
1.346AspCys: 1.346 ± 0.528
3.231AspAsp: 3.231 ± 0.307
2.693AspGlu: 2.693 ± 0.933
1.885AspPhe: 1.885 ± 0.401
3.77AspGly: 3.77 ± 0.789
0.808AspHis: 0.808 ± 0.588
3.77AspIle: 3.77 ± 0.172
2.962AspLys: 2.962 ± 0.551
6.731AspLeu: 6.731 ± 0.981
2.154AspMet: 2.154 ± 0.466
2.962AspAsn: 2.962 ± 0.317
2.693AspPro: 2.693 ± 0.806
2.693AspGln: 2.693 ± 0.081
2.154AspArg: 2.154 ± 1.4
2.693AspSer: 2.693 ± 0.807
2.693AspThr: 2.693 ± 0.34
3.231AspVal: 3.231 ± 0.307
1.346AspTrp: 1.346 ± 0.709
2.154AspTyr: 2.154 ± 0.536
0.0AspXaa: 0.0 ± 0.0
Glu
4.847GluAla: 4.847 ± 0.181
1.616GluCys: 1.616 ± 0.447
3.5GluAsp: 3.5 ± 0.701
5.385GluGlu: 5.385 ± 0.307
2.423GluPhe: 2.423 ± 0.637
2.423GluGly: 2.423 ± 0.754
0.808GluHis: 0.808 ± 0.43
2.962GluIle: 2.962 ± 0.786
4.308GluLys: 4.308 ± 1.382
6.193GluLeu: 6.193 ± 1.275
1.077GluMet: 1.077 ± 0.255
3.231GluAsn: 3.231 ± 0.893
2.693GluPro: 2.693 ± 1.273
2.423GluGln: 2.423 ± 0.485
2.154GluArg: 2.154 ± 0.225
4.847GluSer: 4.847 ± 0.204
3.231GluThr: 3.231 ± 0.179
3.231GluVal: 3.231 ± 1.371
1.885GluTrp: 1.885 ± 0.394
1.346GluTyr: 1.346 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
2.154PheAla: 2.154 ± 0.536
0.808PheCys: 0.808 ± 0.385
1.346PheAsp: 1.346 ± 0.528
4.577PheGlu: 4.577 ± 0.736
3.5PhePhe: 3.5 ± 0.856
1.885PheGly: 1.885 ± 1.017
1.346PheHis: 1.346 ± 0.652
3.231PheIle: 3.231 ± 0.713
2.693PheLys: 2.693 ± 0.43
4.577PheLeu: 4.577 ± 0.736
1.616PheMet: 1.616 ± 0.659
2.962PheAsn: 2.962 ± 0.906
2.154PhePro: 2.154 ± 0.51
1.885PheGln: 1.885 ± 0.394
3.231PheArg: 3.231 ± 0.893
4.577PheSer: 4.577 ± 0.674
2.962PheThr: 2.962 ± 0.968
2.423PheVal: 2.423 ± 0.847
0.269PheTrp: 0.269 ± 0.143
0.808PheTyr: 0.808 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
3.5GlyAla: 3.5 ± 0.517
1.346GlyCys: 1.346 ± 0.652
2.962GlyAsp: 2.962 ± 0.317
4.039GlyGlu: 4.039 ± 0.984
2.154GlyPhe: 2.154 ± 0.225
2.154GlyGly: 2.154 ± 0.982
1.616GlyHis: 1.616 ± 0.304
4.308GlyIle: 4.308 ± 1.453
2.962GlyLys: 2.962 ± 0.762
5.385GlyLeu: 5.385 ± 0.848
1.885GlyMet: 1.885 ± 0.394
3.231GlyAsn: 3.231 ± 0.349
1.616GlyPro: 1.616 ± 0.881
2.423GlyGln: 2.423 ± 0.485
1.077GlyArg: 1.077 ± 0.686
2.693GlySer: 2.693 ± 1.056
3.231GlyThr: 3.231 ± 0.307
4.847GlyVal: 4.847 ± 0.557
0.808GlyTrp: 0.808 ± 0.385
2.423GlyTyr: 2.423 ± 0.545
0.0GlyXaa: 0.0 ± 0.0
His
1.616HisAla: 1.616 ± 0.509
1.077HisCys: 1.077 ± 0.298
1.346HisAsp: 1.346 ± 0.385
0.808HisGlu: 0.808 ± 0.373
0.808HisPhe: 0.808 ± 0.152
2.423HisGly: 2.423 ± 1.156
0.539HisHis: 0.539 ± 0.287
1.885HisIle: 1.885 ± 0.401
2.693HisLys: 2.693 ± 0.77
2.693HisLeu: 2.693 ± 1.273
0.539HisMet: 0.539 ± 0.287
0.808HisAsn: 0.808 ± 0.43
0.808HisPro: 0.808 ± 0.152
0.269HisGln: 0.269 ± 0.25
0.269HisArg: 0.269 ± 0.25
1.616HisSer: 1.616 ± 0.447
2.154HisThr: 2.154 ± 0.913
1.077HisVal: 1.077 ± 0.298
0.808HisTrp: 0.808 ± 0.385
0.808HisTyr: 0.808 ± 0.385
0.0HisXaa: 0.0 ± 0.0
Ile
4.847IleAla: 4.847 ± 0.97
1.346IleCys: 1.346 ± 0.528
5.385IleAsp: 5.385 ± 0.431
5.924IleGlu: 5.924 ± 0.282
3.5IlePhe: 3.5 ± 1.787
4.039IleGly: 4.039 ± 0.526
1.616IleHis: 1.616 ± 0.522
3.77IleIle: 3.77 ± 0.319
3.77IleLys: 3.77 ± 0.336
5.924IleLeu: 5.924 ± 0.234
1.616IleMet: 1.616 ± 0.298
1.616IleAsn: 1.616 ± 0.298
4.308IlePro: 4.308 ± 1.29
2.154IleGln: 2.154 ± 0.886
3.5IleArg: 3.5 ± 1.098
4.847IleSer: 4.847 ± 1.339
4.847IleThr: 4.847 ± 0.467
4.577IleVal: 4.577 ± 0.691
0.539IleTrp: 0.539 ± 0.429
1.346IleTyr: 1.346 ± 0.264
0.0IleXaa: 0.0 ± 0.0
Lys
4.577LysAla: 4.577 ± 0.891
1.885LysCys: 1.885 ± 1.017
3.5LysAsp: 3.5 ± 1.136
3.77LysGlu: 3.77 ± 1.499
3.5LysPhe: 3.5 ± 1.183
4.577LysGly: 4.577 ± 0.475
3.231LysHis: 3.231 ± 0.474
5.116LysIle: 5.116 ± 0.53
3.5LysLys: 3.5 ± 0.378
5.654LysLeu: 5.654 ± 1.601
1.077LysMet: 1.077 ± 0.255
1.885LysAsn: 1.885 ± 0.784
1.885LysPro: 1.885 ± 0.086
2.693LysGln: 2.693 ± 0.807
1.885LysArg: 1.885 ± 0.806
6.193LysSer: 6.193 ± 0.724
3.77LysThr: 3.77 ± 0.336
5.385LysVal: 5.385 ± 1.054
0.539LysTrp: 0.539 ± 0.149
2.693LysTyr: 2.693 ± 0.529
0.0LysXaa: 0.0 ± 0.0
Leu
5.116LeuAla: 5.116 ± 1.392
2.154LeuCys: 2.154 ± 0.913
6.462LeuAsp: 6.462 ± 1.23
5.385LeuGlu: 5.385 ± 1.539
5.924LeuPhe: 5.924 ± 0.775
4.847LeuGly: 4.847 ± 1.352
2.154LeuHis: 2.154 ± 0.51
7.27LeuIle: 7.27 ± 1.504
8.078LeuLys: 8.078 ± 0.425
8.078LeuLeu: 8.078 ± 1.29
1.616LeuMet: 1.616 ± 0.282
4.577LeuAsn: 4.577 ± 0.447
2.962LeuPro: 2.962 ± 0.317
3.231LeuGln: 3.231 ± 0.95
4.847LeuArg: 4.847 ± 1.049
5.924LeuSer: 5.924 ± 0.612
5.924LeuThr: 5.924 ± 2.084
4.847LeuVal: 4.847 ± 1.245
1.077LeuTrp: 1.077 ± 0.255
3.77LeuTyr: 3.77 ± 0.877
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 0.629
0.269MetCys: 0.269 ± 0.25
1.346MetAsp: 1.346 ± 0.226
1.885MetGlu: 1.885 ± 0.534
0.808MetPhe: 0.808 ± 0.43
0.539MetGly: 0.539 ± 0.429
0.539MetHis: 0.539 ± 0.5
2.154MetIle: 2.154 ± 0.466
2.962MetLys: 2.962 ± 0.571
2.154MetLeu: 2.154 ± 0.598
0.808MetMet: 0.808 ± 0.152
1.077MetAsn: 1.077 ± 0.255
0.269MetPro: 0.269 ± 0.25
0.539MetGln: 0.539 ± 0.287
1.077MetArg: 1.077 ± 0.443
4.039MetSer: 4.039 ± 1.509
0.808MetThr: 0.808 ± 0.152
2.423MetVal: 2.423 ± 0.468
1.077MetTrp: 1.077 ± 0.298
0.539MetTyr: 0.539 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
1.616AsnAla: 1.616 ± 0.629
0.808AsnCys: 0.808 ± 0.152
1.616AsnAsp: 1.616 ± 0.447
1.077AsnGlu: 1.077 ± 0.573
2.154AsnPhe: 2.154 ± 0.225
2.154AsnGly: 2.154 ± 0.16
2.693AsnHis: 2.693 ± 0.43
4.577AsnIle: 4.577 ± 1.427
2.693AsnLys: 2.693 ± 0.74
4.847AsnLeu: 4.847 ± 0.876
1.077AsnMet: 1.077 ± 0.447
1.077AsnAsn: 1.077 ± 0.573
1.616AsnPro: 1.616 ± 0.447
0.808AsnGln: 0.808 ± 0.588
1.885AsnArg: 1.885 ± 0.086
3.231AsnSer: 3.231 ± 0.663
2.154AsnThr: 2.154 ± 0.603
1.616AsnVal: 1.616 ± 0.298
0.808AsnTrp: 0.808 ± 0.152
1.077AsnTyr: 1.077 ± 0.573
0.0AsnXaa: 0.0 ± 0.0
Pro
2.693ProAla: 2.693 ± 0.451
1.077ProCys: 1.077 ± 0.447
3.231ProAsp: 3.231 ± 0.792
1.616ProGlu: 1.616 ± 0.174
0.539ProPhe: 0.539 ± 0.287
4.577ProGly: 4.577 ± 1.449
1.616ProHis: 1.616 ± 0.771
2.154ProIle: 2.154 ± 0.225
1.346ProLys: 1.346 ± 0.226
2.423ProLeu: 2.423 ± 0.847
1.346ProMet: 1.346 ± 0.528
1.077ProAsn: 1.077 ± 0.443
1.077ProPro: 1.077 ± 0.233
1.616ProGln: 1.616 ± 0.447
1.346ProArg: 1.346 ± 0.385
2.962ProSer: 2.962 ± 0.571
3.231ProThr: 3.231 ± 0.799
2.154ProVal: 2.154 ± 0.603
0.269ProTrp: 0.269 ± 0.25
1.885ProTyr: 1.885 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
2.693GlnAla: 2.693 ± 0.807
1.346GlnCys: 1.346 ± 0.528
1.885GlnAsp: 1.885 ± 0.806
1.346GlnGlu: 1.346 ± 0.226
1.885GlnPhe: 1.885 ± 0.394
1.885GlnGly: 1.885 ± 0.394
1.346GlnHis: 1.346 ± 0.543
1.616GlnIle: 1.616 ± 0.298
1.885GlnLys: 1.885 ± 0.784
2.154GlnLeu: 2.154 ± 0.924
0.808GlnMet: 0.808 ± 0.314
2.154GlnAsn: 2.154 ± 0.596
0.539GlnPro: 0.539 ± 0.287
1.077GlnGln: 1.077 ± 0.255
2.154GlnArg: 2.154 ± 0.914
4.308GlnSer: 4.308 ± 1.478
2.154GlnThr: 2.154 ± 0.466
3.77GlnVal: 3.77 ± 0.319
1.077GlnTrp: 1.077 ± 0.443
1.616GlnTyr: 1.616 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
2.693ArgAla: 2.693 ± 1.185
1.077ArgCys: 1.077 ± 0.233
2.693ArgAsp: 2.693 ± 0.615
3.77ArgGlu: 3.77 ± 1.568
2.693ArgPhe: 2.693 ± 0.43
1.885ArgGly: 1.885 ± 0.086
1.346ArgHis: 1.346 ± 0.385
2.693ArgIle: 2.693 ± 1.273
3.77ArgLys: 3.77 ± 0.734
3.77ArgLeu: 3.77 ± 1.021
0.539ArgMet: 0.539 ± 0.287
2.154ArgAsn: 2.154 ± 0.536
0.269ArgPro: 0.269 ± 0.143
2.154ArgGln: 2.154 ± 2.631
2.154ArgArg: 2.154 ± 0.16
1.885ArgSer: 1.885 ± 0.367
2.693ArgThr: 2.693 ± 0.806
1.885ArgVal: 1.885 ± 0.401
0.539ArgTrp: 0.539 ± 0.287
2.423ArgTyr: 2.423 ± 0.291
0.0ArgXaa: 0.0 ± 0.0
Ser
2.423SerAla: 2.423 ± 0.82
1.616SerCys: 1.616 ± 1.13
2.962SerAsp: 2.962 ± 0.571
3.231SerGlu: 3.231 ± 0.307
4.847SerPhe: 4.847 ± 0.821
5.385SerGly: 5.385 ± 1.234
0.539SerHis: 0.539 ± 0.287
7.001SerIle: 7.001 ± 1.035
5.385SerLys: 5.385 ± 0.86
10.501SerLeu: 10.501 ± 1.495
4.308SerMet: 4.308 ± 0.959
2.693SerAsn: 2.693 ± 0.43
3.77SerPro: 3.77 ± 0.172
2.693SerGln: 2.693 ± 0.529
4.308SerArg: 4.308 ± 1.02
6.193SerSer: 6.193 ± 0.618
4.039SerThr: 4.039 ± 0.984
3.77SerVal: 3.77 ± 0.615
0.808SerTrp: 0.808 ± 0.385
2.962SerTyr: 2.962 ± 0.509
0.0SerXaa: 0.0 ± 0.0
Thr
5.116ThrAla: 5.116 ± 0.655
1.885ThrCys: 1.885 ± 1.119
1.616ThrAsp: 1.616 ± 0.174
4.577ThrGlu: 4.577 ± 0.849
3.77ThrPhe: 3.77 ± 0.789
2.693ThrGly: 2.693 ± 1.033
0.808ThrHis: 0.808 ± 0.385
3.77ThrIle: 3.77 ± 0.615
3.77ThrLys: 3.77 ± 0.375
4.039ThrLeu: 4.039 ± 1.589
1.616ThrMet: 1.616 ± 0.174
1.077ThrAsn: 1.077 ± 0.447
2.693ThrPro: 2.693 ± 0.716
2.154ThrGln: 2.154 ± 0.536
1.885ThrArg: 1.885 ± 0.569
6.193ThrSer: 6.193 ± 1.429
3.5ThrThr: 3.5 ± 1.068
4.847ThrVal: 4.847 ± 0.467
0.269ThrTrp: 0.269 ± 0.143
2.693ThrTyr: 2.693 ± 1.056
0.0ThrXaa: 0.0 ± 0.0
Val
3.5ValAla: 3.5 ± 0.823
2.693ValCys: 2.693 ± 1.472
4.577ValAsp: 4.577 ± 0.736
2.962ValGlu: 2.962 ± 0.215
1.885ValPhe: 1.885 ± 0.367
2.423ValGly: 2.423 ± 1.226
0.808ValHis: 0.808 ± 0.749
3.231ValIle: 3.231 ± 0.307
4.308ValLys: 4.308 ± 0.094
6.462ValLeu: 6.462 ± 0.948
1.346ValMet: 1.346 ± 0.307
1.885ValAsn: 1.885 ± 0.086
2.693ValPro: 2.693 ± 1.472
3.231ValGln: 3.231 ± 0.765
3.231ValArg: 3.231 ± 1.085
7.001ValSer: 7.001 ± 1.159
4.308ValThr: 4.308 ± 0.839
2.693ValVal: 2.693 ± 0.384
1.346ValTrp: 1.346 ± 0.264
3.231ValTyr: 3.231 ± 0.179
0.0ValXaa: 0.0 ± 0.0
Trp
1.077TrpAla: 1.077 ± 0.298
0.539TrpCys: 0.539 ± 0.149
0.269TrpAsp: 0.269 ± 0.143
0.539TrpGlu: 0.539 ± 0.287
2.154TrpPhe: 2.154 ± 0.51
1.616TrpGly: 1.616 ± 0.174
0.539TrpHis: 0.539 ± 0.149
0.808TrpIle: 0.808 ± 0.749
1.346TrpLys: 1.346 ± 0.264
1.885TrpLeu: 1.885 ± 0.569
0.269TrpMet: 0.269 ± 0.143
0.269TrpAsn: 0.269 ± 0.25
0.539TrpPro: 0.539 ± 0.149
0.0TrpGln: 0.0 ± 0.0
0.539TrpArg: 0.539 ± 0.149
1.616TrpSer: 1.616 ± 0.659
0.269TrpThr: 0.269 ± 0.25
1.077TrpVal: 1.077 ± 0.233
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.077TyrAla: 1.077 ± 0.573
1.616TyrCys: 1.616 ± 0.771
2.154TyrAsp: 2.154 ± 0.596
2.693TyrGlu: 2.693 ± 1.094
0.539TyrPhe: 0.539 ± 0.287
1.885TyrGly: 1.885 ± 0.086
0.0TyrHis: 0.0 ± 0.0
3.5TyrIle: 3.5 ± 1.183
4.039TyrLys: 4.039 ± 0.793
3.5TyrLeu: 3.5 ± 1.098
1.077TyrMet: 1.077 ± 0.243
0.808TyrAsn: 0.808 ± 0.43
1.077TyrPro: 1.077 ± 0.233
1.077TyrGln: 1.077 ± 0.233
2.693TyrArg: 2.693 ± 0.34
3.77TyrSer: 3.77 ± 0.789
1.616TyrThr: 1.616 ± 0.447
1.616TyrVal: 1.616 ± 0.174
0.539TyrTrp: 0.539 ± 0.287
1.885TyrTyr: 1.885 ± 0.394
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3715 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski