Amino acid dipepetide frequency for Seoul orthohantavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.388AlaAla: 5.388 ± 0.342
2.155AlaCys: 2.155 ± 1.745
2.963AlaAsp: 2.963 ± 1.039
3.772AlaGlu: 3.772 ± 1.357
1.886AlaPhe: 1.886 ± 0.4
3.233AlaGly: 3.233 ± 1.765
2.694AlaHis: 2.694 ± 0.366
4.041AlaIle: 4.041 ± 1.057
3.502AlaLys: 3.502 ± 0.545
7.274AlaLeu: 7.274 ± 1.575
1.886AlaMet: 1.886 ± 0.437
2.155AlaAsn: 2.155 ± 0.442
2.425AlaPro: 2.425 ± 0.83
2.425AlaGln: 2.425 ± 1.197
1.886AlaArg: 1.886 ± 0.614
2.963AlaSer: 2.963 ± 1.266
2.963AlaThr: 2.963 ± 0.634
5.119AlaVal: 5.119 ± 1.591
1.078AlaTrp: 1.078 ± 0.256
2.425AlaTyr: 2.425 ± 1.408
0.0AlaXaa: 0.0 ± 0.0
Cys
1.616CysAla: 1.616 ± 0.939
0.269CysCys: 0.269 ± 0.268
0.808CysAsp: 0.808 ± 0.412
1.347CysGlu: 1.347 ± 1.339
2.155CysPhe: 2.155 ± 0.976
1.078CysGly: 1.078 ± 0.492
0.808CysHis: 0.808 ± 0.803
1.886CysIle: 1.886 ± 0.433
1.347CysLys: 1.347 ± 1.109
0.808CysLeu: 0.808 ± 0.646
0.539CysMet: 0.539 ± 0.159
2.155CysAsn: 2.155 ± 2.143
1.886CysPro: 1.886 ± 1.618
1.347CysGln: 1.347 ± 0.565
0.539CysArg: 0.539 ± 0.313
0.539CysSer: 0.539 ± 0.159
2.155CysThr: 2.155 ± 1.061
2.425CysVal: 2.425 ± 0.877
0.269CysTrp: 0.269 ± 0.268
1.347CysTyr: 1.347 ± 1.339
0.0CysXaa: 0.0 ± 0.0
Asp
2.694AspAla: 2.694 ± 0.789
1.616AspCys: 1.616 ± 0.335
3.233AspAsp: 3.233 ± 1.096
2.425AspGlu: 2.425 ± 0.993
1.886AspPhe: 1.886 ± 0.433
3.772AspGly: 3.772 ± 0.874
1.078AspHis: 1.078 ± 0.492
3.233AspIle: 3.233 ± 0.774
2.694AspLys: 2.694 ± 0.598
7.274AspLeu: 7.274 ± 1.624
2.425AspMet: 2.425 ± 0.83
2.963AspAsn: 2.963 ± 0.347
2.155AspPro: 2.155 ± 1.009
2.694AspGln: 2.694 ± 0.484
2.425AspArg: 2.425 ± 1.515
3.233AspSer: 3.233 ± 1.175
1.347AspThr: 1.347 ± 0.287
3.233AspVal: 3.233 ± 0.774
1.347AspTrp: 1.347 ± 0.763
1.347AspTyr: 1.347 ± 0.242
0.0AspXaa: 0.0 ± 0.0
Glu
3.233GluAla: 3.233 ± 0.195
1.616GluCys: 1.616 ± 1.21
3.502GluAsp: 3.502 ± 0.252
5.119GluGlu: 5.119 ± 0.697
2.963GluPhe: 2.963 ± 0.619
2.425GluGly: 2.425 ± 0.505
1.078GluHis: 1.078 ± 0.472
3.772GluIle: 3.772 ± 1.182
4.31GluLys: 4.31 ± 0.777
5.927GluLeu: 5.927 ± 0.651
1.616GluMet: 1.616 ± 0.939
2.694GluAsn: 2.694 ± 0.797
3.772GluPro: 3.772 ± 1.438
1.886GluGln: 1.886 ± 0.437
2.155GluArg: 2.155 ± 0.17
5.119GluSer: 5.119 ± 0.749
3.502GluThr: 3.502 ± 0.762
3.772GluVal: 3.772 ± 1.127
1.886GluTrp: 1.886 ± 0.093
1.616GluTyr: 1.616 ± 0.478
0.0GluXaa: 0.0 ± 0.0
Phe
2.694PheAla: 2.694 ± 0.366
0.808PheCys: 0.808 ± 0.803
1.347PheAsp: 1.347 ± 0.424
3.772PheGlu: 3.772 ± 0.874
3.233PhePhe: 3.233 ± 0.774
1.616PheGly: 1.616 ± 0.824
1.616PheHis: 1.616 ± 0.193
3.233PheIle: 3.233 ± 0.846
4.31PheLys: 4.31 ± 0.5
4.31PheLeu: 4.31 ± 1.013
1.886PheMet: 1.886 ± 0.86
3.502PheAsn: 3.502 ± 1.3
2.155PhePro: 2.155 ± 0.564
1.886PheGln: 1.886 ± 0.438
2.694PheArg: 2.694 ± 0.771
4.31PheSer: 4.31 ± 0.812
2.425PheThr: 2.425 ± 0.502
2.155PheVal: 2.155 ± 0.669
0.269PheTrp: 0.269 ± 0.156
1.078PheTyr: 1.078 ± 0.472
0.0PheXaa: 0.0 ± 0.0
Gly
3.502GlyAla: 3.502 ± 1.762
1.078GlyCys: 1.078 ± 0.492
2.963GlyAsp: 2.963 ± 0.347
4.31GlyGlu: 4.31 ± 1.018
1.886GlyPhe: 1.886 ± 0.4
2.155GlyGly: 2.155 ± 1.061
1.886GlyHis: 1.886 ± 0.437
4.849GlyIle: 4.849 ± 1.419
3.502GlyLys: 3.502 ± 0.383
6.466GlyLeu: 6.466 ± 0.962
2.425GlyMet: 2.425 ± 0.934
2.963GlyAsn: 2.963 ± 0.53
1.616GlyPro: 1.616 ± 1.21
3.233GlyGln: 3.233 ± 1.475
1.078GlyArg: 1.078 ± 0.758
3.772GlySer: 3.772 ± 0.344
2.425GlyThr: 2.425 ± 0.098
3.233GlyVal: 3.233 ± 0.195
0.808GlyTrp: 0.808 ± 0.412
2.694GlyTyr: 2.694 ± 0.574
0.0GlyXaa: 0.0 ± 0.0
His
1.347HisAla: 1.347 ± 0.287
0.808HisCys: 0.808 ± 0.412
1.347HisAsp: 1.347 ± 0.242
1.616HisGlu: 1.616 ± 0.705
1.347HisPhe: 1.347 ± 0.565
1.616HisGly: 1.616 ± 0.824
0.808HisHis: 0.808 ± 0.167
1.886HisIle: 1.886 ± 0.093
1.347HisLys: 1.347 ± 0.424
2.963HisLeu: 2.963 ± 0.841
0.269HisMet: 0.269 ± 0.156
0.539HisAsn: 0.539 ± 0.159
1.078HisPro: 1.078 ± 0.282
1.078HisGln: 1.078 ± 0.319
1.347HisArg: 1.347 ± 0.287
2.155HisSer: 2.155 ± 0.245
1.886HisThr: 1.886 ± 0.72
1.078HisVal: 1.078 ± 0.626
0.808HisTrp: 0.808 ± 0.412
0.808HisTyr: 0.808 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
2.694IleAla: 2.694 ± 0.936
2.425IleCys: 2.425 ± 1.619
4.041IleAsp: 4.041 ± 0.602
4.849IleGlu: 4.849 ± 1.011
1.886IlePhe: 1.886 ± 0.433
4.041IleGly: 4.041 ± 0.563
1.616IleHis: 1.616 ± 0.478
5.927IleIle: 5.927 ± 1.851
4.041IleLys: 4.041 ± 1.241
7.543IleLeu: 7.543 ± 0.562
1.616IleMet: 1.616 ± 0.705
1.886IleAsn: 1.886 ± 0.433
4.849IlePro: 4.849 ± 1.407
3.502IleGln: 3.502 ± 0.721
2.963IleArg: 2.963 ± 1.329
6.466IleSer: 6.466 ± 0.778
5.119IleThr: 5.119 ± 0.626
3.772IleVal: 3.772 ± 0.186
1.078IleTrp: 1.078 ± 0.492
1.078IleTyr: 1.078 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
3.772LysAla: 3.772 ± 0.862
1.616LysCys: 1.616 ± 1.21
4.31LysAsp: 4.31 ± 1.972
4.31LysGlu: 4.31 ± 1.085
3.772LysPhe: 3.772 ± 1.085
3.772LysGly: 3.772 ± 0.371
2.694LysHis: 2.694 ± 0.465
4.58LysIle: 4.58 ± 0.482
4.041LysLys: 4.041 ± 0.92
4.849LysLeu: 4.849 ± 1.443
1.078LysMet: 1.078 ± 0.626
2.694LysAsn: 2.694 ± 0.089
2.155LysPro: 2.155 ± 0.245
2.425LysGln: 2.425 ± 0.721
1.616LysArg: 1.616 ± 0.697
4.31LysSer: 4.31 ± 0.658
5.119LysThr: 5.119 ± 0.585
6.196LysVal: 6.196 ± 0.417
0.269LysTrp: 0.269 ± 0.156
2.425LysTyr: 2.425 ± 0.502
0.0LysXaa: 0.0 ± 0.0
Leu
7.004LeuAla: 7.004 ± 0.949
2.155LeuCys: 2.155 ± 0.976
5.657LeuAsp: 5.657 ± 1.313
5.119LeuGlu: 5.119 ± 2.225
5.657LeuPhe: 5.657 ± 1.082
4.849LeuGly: 4.849 ± 0.951
1.886LeuHis: 1.886 ± 0.093
7.274LeuIle: 7.274 ± 1.122
6.466LeuLys: 6.466 ± 0.736
8.89LeuLeu: 8.89 ± 1.842
1.616LeuMet: 1.616 ± 0.742
4.58LeuAsn: 4.58 ± 0.887
2.425LeuPro: 2.425 ± 0.877
3.772LeuGln: 3.772 ± 0.7
5.927LeuArg: 5.927 ± 1.422
6.466LeuSer: 6.466 ± 0.778
5.657LeuThr: 5.657 ± 2.004
5.388LeuVal: 5.388 ± 0.53
1.347LeuTrp: 1.347 ± 0.287
3.233LeuTyr: 3.233 ± 1.096
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 1.596
0.269MetCys: 0.269 ± 0.268
2.155MetAsp: 2.155 ± 0.17
1.886MetGlu: 1.886 ± 0.591
1.078MetPhe: 1.078 ± 0.472
1.078MetGly: 1.078 ± 0.867
0.539MetHis: 0.539 ± 0.159
1.078MetIle: 1.078 ± 0.282
2.694MetLys: 2.694 ± 0.789
1.886MetLeu: 1.886 ± 0.093
0.808MetMet: 0.808 ± 0.167
0.539MetAsn: 0.539 ± 0.159
0.269MetPro: 0.269 ± 0.422
0.539MetGln: 0.539 ± 0.313
1.886MetArg: 1.886 ± 0.614
4.041MetSer: 4.041 ± 2.038
1.616MetThr: 1.616 ± 0.939
1.616MetVal: 1.616 ± 0.697
0.808MetTrp: 0.808 ± 0.412
0.808MetTyr: 0.808 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
1.347AsnAla: 1.347 ± 0.339
0.539AsnCys: 0.539 ± 0.313
1.616AsnAsp: 1.616 ± 0.335
1.886AsnGlu: 1.886 ± 0.727
2.425AsnPhe: 2.425 ± 0.313
2.425AsnGly: 2.425 ± 0.098
0.539AsnHis: 0.539 ± 0.159
5.388AsnIle: 5.388 ± 0.633
1.886AsnLys: 1.886 ± 0.614
4.58AsnLeu: 4.58 ± 0.933
1.347AsnMet: 1.347 ± 0.339
2.155AsnAsn: 2.155 ± 0.564
2.694AsnPro: 2.694 ± 0.089
1.347AsnGln: 1.347 ± 0.711
2.425AsnArg: 2.425 ± 0.098
2.963AsnSer: 2.963 ± 0.248
2.425AsnThr: 2.425 ± 0.83
2.694AsnVal: 2.694 ± 0.849
0.539AsnTrp: 0.539 ± 0.159
0.808AsnTyr: 0.808 ± 0.469
0.0AsnXaa: 0.0 ± 0.0
Pro
2.425ProAla: 2.425 ± 0.313
1.078ProCys: 1.078 ± 0.492
3.502ProAsp: 3.502 ± 0.383
2.694ProGlu: 2.694 ± 0.089
0.808ProPhe: 0.808 ± 0.167
4.58ProGly: 4.58 ± 1.565
1.616ProHis: 1.616 ± 0.824
1.886ProIle: 1.886 ± 0.614
1.616ProLys: 1.616 ± 0.628
1.886ProLeu: 1.886 ± 0.72
1.616ProMet: 1.616 ± 0.555
1.347ProAsn: 1.347 ± 0.242
1.347ProPro: 1.347 ± 0.339
0.808ProGln: 0.808 ± 0.167
1.078ProArg: 1.078 ± 0.626
2.963ProSer: 2.963 ± 0.619
3.233ProThr: 3.233 ± 0.768
2.425ProVal: 2.425 ± 1.193
0.269ProTrp: 0.269 ± 0.268
1.886ProTyr: 1.886 ± 0.437
0.0ProXaa: 0.0 ± 0.0
Gln
4.041GlnAla: 4.041 ± 0.511
1.078GlnCys: 1.078 ± 0.319
1.616GlnAsp: 1.616 ± 0.798
1.886GlnGlu: 1.886 ± 0.4
1.886GlnPhe: 1.886 ± 0.437
2.694GlnGly: 2.694 ± 0.771
1.886GlnHis: 1.886 ± 0.842
2.425GlnIle: 2.425 ± 0.538
3.233GlnLys: 3.233 ± 0.634
2.963GlnLeu: 2.963 ± 0.853
0.539GlnMet: 0.539 ± 0.313
2.963GlnAsn: 2.963 ± 0.604
0.269GlnPro: 0.269 ± 0.156
1.347GlnGln: 1.347 ± 0.287
1.616GlnArg: 1.616 ± 1.1
3.502GlnSer: 3.502 ± 1.282
1.616GlnThr: 1.616 ± 0.798
1.886GlnVal: 1.886 ± 0.437
0.539GlnTrp: 0.539 ± 0.313
1.347GlnTyr: 1.347 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
2.425ArgAla: 2.425 ± 0.313
1.347ArgCys: 1.347 ± 0.242
2.963ArgAsp: 2.963 ± 0.542
2.694ArgGlu: 2.694 ± 0.789
3.502ArgPhe: 3.502 ± 0.26
2.425ArgGly: 2.425 ± 0.098
2.425ArgHis: 2.425 ± 0.502
2.963ArgIle: 2.963 ± 1.348
3.502ArgLys: 3.502 ± 0.568
3.772ArgLeu: 3.772 ± 1.813
0.808ArgMet: 0.808 ± 0.167
1.886ArgAsn: 1.886 ± 0.86
0.539ArgPro: 0.539 ± 0.313
1.616ArgGln: 1.616 ± 2.534
1.616ArgArg: 1.616 ± 0.317
1.886ArgSer: 1.886 ± 0.614
2.694ArgThr: 2.694 ± 0.484
1.347ArgVal: 1.347 ± 0.424
0.539ArgTrp: 0.539 ± 0.313
2.425ArgTyr: 2.425 ± 0.703
0.0ArgXaa: 0.0 ± 0.0
Ser
4.041SerAla: 4.041 ± 0.645
1.347SerCys: 1.347 ± 0.943
2.425SerAsp: 2.425 ± 1.408
3.772SerGlu: 3.772 ± 0.874
4.58SerPhe: 4.58 ± 0.748
6.196SerGly: 6.196 ± 0.792
0.269SerHis: 0.269 ± 0.156
7.004SerIle: 7.004 ± 0.791
5.927SerLys: 5.927 ± 0.693
10.237SerLeu: 10.237 ± 1.127
1.616SerMet: 1.616 ± 1.111
2.155SerAsn: 2.155 ± 0.17
3.502SerPro: 3.502 ± 0.721
2.963SerGln: 2.963 ± 0.716
3.233SerArg: 3.233 ± 0.389
5.388SerSer: 5.388 ± 0.849
3.233SerThr: 3.233 ± 0.67
4.31SerVal: 4.31 ± 1.305
0.808SerTrp: 0.808 ± 0.167
2.425SerTyr: 2.425 ± 0.934
0.0SerXaa: 0.0 ± 0.0
Thr
6.196ThrAla: 6.196 ± 0.173
1.347ThrCys: 1.347 ± 0.711
2.155ThrAsp: 2.155 ± 0.638
3.233ThrGlu: 3.233 ± 0.67
3.772ThrPhe: 3.772 ± 1.127
2.963ThrGly: 2.963 ± 0.88
1.078ThrHis: 1.078 ± 0.319
4.31ThrIle: 4.31 ± 0.491
3.233ThrLys: 3.233 ± 0.717
4.041ThrLeu: 4.041 ± 1.455
1.347ThrMet: 1.347 ± 0.242
0.808ThrAsn: 0.808 ± 0.167
1.886ThrPro: 1.886 ± 0.093
1.886ThrGln: 1.886 ± 0.842
2.425ThrArg: 2.425 ± 0.721
5.657ThrSer: 5.657 ± 2.004
2.155ThrThr: 2.155 ± 0.17
4.041ThrVal: 4.041 ± 0.35
0.269ThrTrp: 0.269 ± 0.156
2.425ThrTyr: 2.425 ± 0.586
0.0ThrXaa: 0.0 ± 0.0
Val
2.963ValAla: 2.963 ± 0.895
2.155ValCys: 2.155 ± 1.469
4.041ValAsp: 4.041 ± 1.637
3.772ValGlu: 3.772 ± 0.875
1.886ValPhe: 1.886 ± 0.803
2.694ValGly: 2.694 ± 1.442
0.808ValHis: 0.808 ± 0.412
2.963ValIle: 2.963 ± 0.235
3.502ValLys: 3.502 ± 1.143
4.849ValLeu: 4.849 ± 1.005
1.616ValMet: 1.616 ± 0.697
2.963ValAsn: 2.963 ± 0.347
2.425ValPro: 2.425 ± 1.322
3.502ValGln: 3.502 ± 0.984
3.772ValArg: 3.772 ± 1.453
6.196ValSer: 6.196 ± 0.417
2.963ValThr: 2.963 ± 0.604
3.233ValVal: 3.233 ± 0.387
0.808ValTrp: 0.808 ± 0.469
3.502ValTyr: 3.502 ± 0.405
0.0ValXaa: 0.0 ± 0.0
Trp
1.616TrpAla: 1.616 ± 0.478
0.269TrpCys: 0.269 ± 0.268
0.269TrpAsp: 0.269 ± 0.156
0.269TrpGlu: 0.269 ± 0.156
1.886TrpPhe: 1.886 ± 0.727
1.347TrpGly: 1.347 ± 0.242
0.539TrpHis: 0.539 ± 0.159
0.0TrpIle: 0.0 ± 0.0
0.808TrpLys: 0.808 ± 0.469
1.347TrpLeu: 1.347 ± 0.763
0.269TrpMet: 0.269 ± 0.156
0.0TrpAsn: 0.0 ± 0.0
0.269TrpPro: 0.269 ± 0.156
0.0TrpGln: 0.0 ± 0.0
1.078TrpArg: 1.078 ± 0.677
1.616TrpSer: 1.616 ± 0.574
1.078TrpThr: 1.078 ± 0.492
1.616TrpVal: 1.616 ± 0.555
0.0TrpTrp: 0.0 ± 0.0
0.269TrpTyr: 0.269 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.616TyrAla: 1.616 ± 0.335
1.347TyrCys: 1.347 ± 0.565
2.155TyrAsp: 2.155 ± 0.638
3.233TyrGlu: 3.233 ± 0.724
0.808TyrPhe: 0.808 ± 0.469
1.886TyrGly: 1.886 ± 0.093
0.269TyrHis: 0.269 ± 0.156
2.694TyrIle: 2.694 ± 0.465
4.041TyrLys: 4.041 ± 0.861
3.502TyrLeu: 3.502 ± 0.748
1.616TyrMet: 1.616 ± 0.302
1.078TyrAsn: 1.078 ± 0.626
1.347TyrPro: 1.347 ± 0.242
1.078TyrGln: 1.078 ± 0.492
1.616TyrArg: 1.616 ± 0.335
2.155TyrSer: 2.155 ± 0.564
1.616TyrThr: 1.616 ± 0.478
1.078TyrVal: 1.078 ± 0.472
0.808TyrTrp: 0.808 ± 0.469
1.616TyrTyr: 1.616 ± 0.335
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3713 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski