Amino acid dipepetide frequency for Puumala orthohantavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.015AlaAla: 4.015 ± 0.536
1.874AlaCys: 1.874 ± 0.761
3.747AlaAsp: 3.747 ± 1.127
3.747AlaGlu: 3.747 ± 2.667
3.212AlaPhe: 3.212 ± 0.819
2.677AlaGly: 2.677 ± 0.455
1.874AlaHis: 1.874 ± 0.407
2.409AlaIle: 2.409 ± 0.689
3.48AlaLys: 3.48 ± 1.023
5.621AlaLeu: 5.621 ± 0.853
2.141AlaMet: 2.141 ± 0.439
1.338AlaAsn: 1.338 ± 0.277
2.677AlaPro: 2.677 ± 1.019
2.944AlaGln: 2.944 ± 1.211
2.409AlaArg: 2.409 ± 0.642
4.55AlaSer: 4.55 ± 0.463
2.677AlaThr: 2.677 ± 0.555
3.48AlaVal: 3.48 ± 0.691
0.803AlaTrp: 0.803 ± 0.172
3.212AlaTyr: 3.212 ± 0.689
0.0AlaXaa: 0.0 ± 0.0
Cys
1.606CysAla: 1.606 ± 0.291
0.535CysCys: 0.535 ± 0.491
0.535CysAsp: 0.535 ± 0.144
1.338CysGlu: 1.338 ± 1.227
2.141CysPhe: 2.141 ± 0.878
1.338CysGly: 1.338 ± 0.341
0.268CysHis: 0.268 ± 0.245
1.874CysIle: 1.874 ± 0.382
1.338CysLys: 1.338 ± 0.277
3.212CysLeu: 3.212 ± 2.202
0.0CysMet: 0.0 ± 0.0
1.338CysAsn: 1.338 ± 0.857
2.409CysPro: 2.409 ± 1.536
2.677CysGln: 2.677 ± 1.015
0.0CysArg: 0.0 ± 0.0
1.874CysSer: 1.874 ± 0.647
2.141CysThr: 2.141 ± 0.878
1.606CysVal: 1.606 ± 0.344
0.268CysTrp: 0.268 ± 0.245
0.803CysTyr: 0.803 ± 0.736
0.0CysXaa: 0.0 ± 0.0
Asp
1.874AspAla: 1.874 ± 0.786
1.071AspCys: 1.071 ± 0.613
4.55AspAsp: 4.55 ± 1.033
1.606AspGlu: 1.606 ± 0.931
1.338AspPhe: 1.338 ± 0.438
3.212AspGly: 3.212 ± 0.689
1.606AspHis: 1.606 ± 0.718
5.086AspIle: 5.086 ± 1.326
2.677AspLys: 2.677 ± 0.872
6.156AspLeu: 6.156 ± 0.89
1.606AspMet: 1.606 ± 0.618
3.212AspAsn: 3.212 ± 0.762
3.212AspPro: 3.212 ± 1.283
2.944AspGln: 2.944 ± 0.82
2.409AspArg: 2.409 ± 0.689
5.086AspSer: 5.086 ± 1.044
2.141AspThr: 2.141 ± 1.33
3.48AspVal: 3.48 ± 1.38
1.071AspTrp: 1.071 ± 0.736
2.141AspTyr: 2.141 ± 0.514
0.0AspXaa: 0.0 ± 0.0
Glu
3.212GluAla: 3.212 ± 0.762
2.409GluCys: 2.409 ± 0.465
2.409GluAsp: 2.409 ± 0.952
4.55GluGlu: 4.55 ± 1.832
2.677GluPhe: 2.677 ± 1.061
2.677GluGly: 2.677 ± 0.876
1.071GluHis: 1.071 ± 0.294
3.48GluIle: 3.48 ± 0.917
5.889GluLys: 5.889 ± 1.369
6.692GluLeu: 6.692 ± 0.242
0.803GluMet: 0.803 ± 0.783
1.606GluAsn: 1.606 ± 0.291
2.677GluPro: 2.677 ± 0.961
3.48GluGln: 3.48 ± 0.266
2.677GluArg: 2.677 ± 1.061
3.212GluSer: 3.212 ± 0.453
4.283GluThr: 4.283 ± 0.877
4.818GluVal: 4.818 ± 0.986
1.606GluTrp: 1.606 ± 0.196
1.338GluTyr: 1.338 ± 0.277
0.0GluXaa: 0.0 ± 0.0
Phe
2.141PheAla: 2.141 ± 0.228
1.071PheCys: 1.071 ± 0.289
2.141PheAsp: 2.141 ± 0.578
3.48PheGlu: 3.48 ± 1.291
3.48PhePhe: 3.48 ± 0.691
1.338PheGly: 1.338 ± 0.277
1.606PheHis: 1.606 ± 0.196
3.48PheIle: 3.48 ± 0.266
4.818PheLys: 4.818 ± 0.986
4.55PheLeu: 4.55 ± 0.464
1.338PheMet: 1.338 ± 0.684
2.944PheAsn: 2.944 ± 0.685
1.071PhePro: 1.071 ± 0.43
2.944PheGln: 2.944 ± 0.931
2.409PheArg: 2.409 ± 0.083
2.944PheSer: 2.944 ± 0.345
2.409PheThr: 2.409 ± 0.465
1.338PheVal: 1.338 ± 0.341
0.268PheTrp: 0.268 ± 0.155
1.338PheTyr: 1.338 ± 0.531
0.0PheXaa: 0.0 ± 0.0
Gly
2.944GlyAla: 2.944 ± 0.574
1.606GlyCys: 1.606 ± 1.101
3.747GlyAsp: 3.747 ± 0.4
2.677GlyGlu: 2.677 ± 0.624
2.944GlyPhe: 2.944 ± 0.534
1.338GlyGly: 1.338 ± 0.277
1.874GlyHis: 1.874 ± 0.457
3.212GlyIle: 3.212 ± 1.166
2.677GlyLys: 2.677 ± 0.328
7.227GlyLeu: 7.227 ± 0.653
2.141GlyMet: 2.141 ± 0.544
3.48GlyAsn: 3.48 ± 0.266
1.071GlyPro: 1.071 ± 0.613
1.874GlyGln: 1.874 ± 0.761
1.071GlyArg: 1.071 ± 0.777
4.818GlySer: 4.818 ± 1.577
2.944GlyThr: 2.944 ± 1.04
4.818GlyVal: 4.818 ± 0.547
1.338GlyTrp: 1.338 ± 0.857
2.409GlyTyr: 2.409 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
2.944HisAla: 2.944 ± 1.355
0.268HisCys: 0.268 ± 0.245
1.606HisAsp: 1.606 ± 0.344
0.803HisGlu: 0.803 ± 0.369
1.606HisPhe: 1.606 ± 0.291
1.338HisGly: 1.338 ± 0.507
0.268HisHis: 0.268 ± 0.155
1.874HisIle: 1.874 ± 0.407
0.535HisLys: 0.535 ± 0.144
3.212HisLeu: 3.212 ± 0.771
0.268HisMet: 0.268 ± 0.155
0.268HisAsn: 0.268 ± 0.155
0.803HisPro: 0.803 ± 0.466
0.0HisGln: 0.0 ± 0.0
0.803HisArg: 0.803 ± 0.172
1.874HisSer: 1.874 ± 0.407
1.071HisThr: 1.071 ± 0.982
0.535HisVal: 0.535 ± 0.368
0.803HisTrp: 0.803 ± 0.172
0.268HisTyr: 0.268 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
2.141IleAla: 2.141 ± 0.578
1.071IleCys: 1.071 ± 0.613
4.818IleAsp: 4.818 ± 1.283
5.889IleGlu: 5.889 ± 0.292
2.141IlePhe: 2.141 ± 0.878
3.212IleGly: 3.212 ± 1.166
1.338IleHis: 1.338 ± 0.438
3.48IleIle: 3.48 ± 1.061
3.747IleLys: 3.747 ± 1.071
6.156IleLeu: 6.156 ± 0.733
2.409IleMet: 2.409 ± 0.787
1.874IleAsn: 1.874 ± 0.091
3.747IlePro: 3.747 ± 1.011
5.086IleGln: 5.086 ± 0.402
2.677IleArg: 2.677 ± 1.15
6.424IleSer: 6.424 ± 1.265
2.677IleThr: 2.677 ± 0.455
4.283IleVal: 4.283 ± 0.725
0.803IleTrp: 0.803 ± 0.172
1.874IleTyr: 1.874 ± 0.408
0.0IleXaa: 0.0 ± 0.0
Lys
4.283LysAla: 4.283 ± 0.549
1.071LysCys: 1.071 ± 0.613
2.944LysAsp: 2.944 ± 0.857
5.353LysGlu: 5.353 ± 1.396
2.944LysPhe: 2.944 ± 0.606
2.944LysGly: 2.944 ± 0.987
1.606LysHis: 1.606 ± 0.653
5.621LysIle: 5.621 ± 1.121
4.818LysLys: 4.818 ± 0.433
5.621LysLeu: 5.621 ± 1.213
1.338LysMet: 1.338 ± 0.226
2.409LysAsn: 2.409 ± 0.459
2.677LysPro: 2.677 ± 0.945
1.874LysGln: 1.874 ± 0.407
2.677LysArg: 2.677 ± 0.72
5.086LysSer: 5.086 ± 0.791
4.55LysThr: 4.55 ± 0.958
6.959LysVal: 6.959 ± 0.532
0.535LysTrp: 0.535 ± 0.144
3.212LysTyr: 3.212 ± 1.175
0.0LysXaa: 0.0 ± 0.0
Leu
5.086LeuAla: 5.086 ± 0.759
2.944LeuCys: 2.944 ± 0.685
5.889LeuAsp: 5.889 ± 0.978
7.227LeuGlu: 7.227 ± 1.264
4.55LeuPhe: 4.55 ± 0.834
6.424LeuGly: 6.424 ± 0.374
2.409LeuHis: 2.409 ± 0.73
8.833LeuIle: 8.833 ± 1.094
8.298LeuLys: 8.298 ± 1.387
9.368LeuLeu: 9.368 ± 1.76
2.944LeuMet: 2.944 ± 1.355
4.015LeuAsn: 4.015 ± 1.042
3.747LeuPro: 3.747 ± 0.319
3.747LeuGln: 3.747 ± 0.449
5.621LeuArg: 5.621 ± 0.853
5.353LeuSer: 5.353 ± 0.743
4.283LeuThr: 4.283 ± 1.222
5.353LeuVal: 5.353 ± 1.329
0.268LeuTrp: 0.268 ± 0.245
4.015LeuTyr: 4.015 ± 0.679
0.0LeuXaa: 0.0 ± 0.0
Met
2.677MetAla: 2.677 ± 0.872
0.535MetCys: 0.535 ± 0.491
2.409MetAsp: 2.409 ± 0.941
2.677MetGlu: 2.677 ± 0.099
1.071MetPhe: 1.071 ± 0.621
2.141MetGly: 2.141 ± 0.685
0.0MetHis: 0.0 ± 0.0
0.803MetIle: 0.803 ± 0.372
1.606MetLys: 1.606 ± 0.196
1.071MetLeu: 1.071 ± 0.777
0.268MetMet: 0.268 ± 0.245
0.535MetAsn: 0.535 ± 0.31
0.268MetPro: 0.268 ± 0.428
0.803MetGln: 0.803 ± 0.369
1.874MetArg: 1.874 ± 0.091
2.409MetSer: 2.409 ± 0.642
1.071MetThr: 1.071 ± 0.621
1.606MetVal: 1.606 ± 0.536
0.268MetTrp: 0.268 ± 0.155
0.268MetTyr: 0.268 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
1.338AsnAla: 1.338 ± 0.684
1.071AsnCys: 1.071 ± 0.621
1.071AsnAsp: 1.071 ± 0.294
2.944AsnGlu: 2.944 ± 1.025
1.606AsnPhe: 1.606 ± 0.588
1.338AsnGly: 1.338 ± 0.691
1.071AsnHis: 1.071 ± 0.257
4.015AsnIle: 4.015 ± 1.331
2.944AsnLys: 2.944 ± 1.211
5.353AsnLeu: 5.353 ± 1.117
0.535AsnMet: 0.535 ± 0.31
1.071AsnAsn: 1.071 ± 0.621
3.48AsnPro: 3.48 ± 0.713
1.071AsnGln: 1.071 ± 0.493
0.803AsnArg: 0.803 ± 0.172
0.803AsnSer: 0.803 ± 0.172
1.874AsnThr: 1.874 ± 1.139
2.677AsnVal: 2.677 ± 0.099
0.535AsnTrp: 0.535 ± 0.144
1.338AsnTyr: 1.338 ± 0.438
0.0AsnXaa: 0.0 ± 0.0
Pro
1.874ProAla: 1.874 ± 0.382
0.535ProCys: 0.535 ± 0.144
2.409ProAsp: 2.409 ± 1.4
2.409ProGlu: 2.409 ± 0.459
1.606ProPhe: 1.606 ± 0.536
4.283ProGly: 4.283 ± 0.839
1.071ProHis: 1.071 ± 0.982
1.606ProIle: 1.606 ± 0.291
2.677ProLys: 2.677 ± 0.837
3.747ProLeu: 3.747 ± 0.939
0.803ProMet: 0.803 ± 0.681
1.606ProAsn: 1.606 ± 0.653
1.071ProPro: 1.071 ± 0.493
0.535ProGln: 0.535 ± 0.31
1.606ProArg: 1.606 ± 0.588
3.747ProSer: 3.747 ± 0.319
4.015ProThr: 4.015 ± 2.053
3.48ProVal: 3.48 ± 0.695
0.535ProTrp: 0.535 ± 0.49
1.071ProTyr: 1.071 ± 0.294
0.0ProXaa: 0.0 ± 0.0
Gln
4.55GlnAla: 4.55 ± 0.463
1.071GlnCys: 1.071 ± 0.289
2.409GlnAsp: 2.409 ± 0.459
2.409GlnGlu: 2.409 ± 0.952
0.535GlnPhe: 0.535 ± 0.491
2.677GlnGly: 2.677 ± 0.722
1.606GlnHis: 1.606 ± 0.588
2.141GlnIle: 2.141 ± 0.162
1.874GlnLys: 1.874 ± 0.605
3.747GlnLeu: 3.747 ± 0.449
0.535GlnMet: 0.535 ± 0.31
2.141GlnAsn: 2.141 ± 0.439
2.409GlnPro: 2.409 ± 0.517
2.141GlnGln: 2.141 ± 0.514
2.409GlnArg: 2.409 ± 0.552
2.409GlnSer: 2.409 ± 0.952
3.747GlnThr: 3.747 ± 0.881
2.141GlnVal: 2.141 ± 0.611
1.338GlnTrp: 1.338 ± 0.438
1.606GlnTyr: 1.606 ± 0.588
0.0GlnXaa: 0.0 ± 0.0
Arg
0.535ArgAla: 0.535 ± 0.31
1.071ArgCys: 1.071 ± 0.257
2.409ArgAsp: 2.409 ± 0.73
2.677ArgGlu: 2.677 ± 1.22
3.48ArgPhe: 3.48 ± 0.354
3.212ArgGly: 3.212 ± 0.391
1.071ArgHis: 1.071 ± 0.43
2.141ArgIle: 2.141 ± 1.029
4.818ArgLys: 4.818 ± 0.688
2.677ArgLeu: 2.677 ± 1.552
1.071ArgMet: 1.071 ± 0.493
3.212ArgAsn: 3.212 ± 1.005
1.338ArgPro: 1.338 ± 0.226
1.606ArgGln: 1.606 ± 1.628
1.874ArgArg: 1.874 ± 1.132
2.409ArgSer: 2.409 ± 0.305
2.944ArgThr: 2.944 ± 0.345
3.212ArgVal: 3.212 ± 0.883
0.535ArgTrp: 0.535 ± 0.31
2.141ArgTyr: 2.141 ± 0.162
0.0ArgXaa: 0.0 ± 0.0
Ser
5.086SerAla: 5.086 ± 0.592
1.874SerCys: 1.874 ± 1.718
3.747SerAsp: 3.747 ± 0.393
2.944SerGlu: 2.944 ± 0.606
4.283SerPhe: 4.283 ± 0.875
5.353SerGly: 5.353 ± 1.416
0.535SerHis: 0.535 ± 0.491
4.818SerIle: 4.818 ± 0.688
5.353SerLys: 5.353 ± 0.973
9.636SerLeu: 9.636 ± 1.89
1.606SerMet: 1.606 ± 0.718
1.338SerAsn: 1.338 ± 0.226
2.409SerPro: 2.409 ± 0.952
2.409SerGln: 2.409 ± 0.73
3.747SerArg: 3.747 ± 0.393
5.889SerSer: 5.889 ± 1.904
5.889SerThr: 5.889 ± 0.435
2.677SerVal: 2.677 ± 0.555
0.535SerTrp: 0.535 ± 0.31
2.141SerTyr: 2.141 ± 0.162
0.0SerXaa: 0.0 ± 0.0
Thr
5.353ThrAla: 5.353 ± 0.932
1.874ThrCys: 1.874 ± 1.143
2.677ThrAsp: 2.677 ± 0.682
3.747ThrGlu: 3.747 ± 0.913
2.944ThrPhe: 2.944 ± 0.606
4.283ThrGly: 4.283 ± 1.429
0.268ThrHis: 0.268 ± 0.155
3.747ThrIle: 3.747 ± 0.676
3.212ThrLys: 3.212 ± 1.197
4.55ThrLeu: 4.55 ± 1.075
1.874ThrMet: 1.874 ± 0.382
0.535ThrAsn: 0.535 ± 0.31
2.141ThrPro: 2.141 ± 0.685
3.48ThrGln: 3.48 ± 0.354
2.944ThrArg: 2.944 ± 0.857
5.086ThrSer: 5.086 ± 1.556
5.086ThrThr: 5.086 ± 2.126
4.818ThrVal: 4.818 ± 0.892
1.071ThrTrp: 1.071 ± 0.289
1.874ThrTyr: 1.874 ± 0.407
0.0ThrXaa: 0.0 ± 0.0
Val
3.48ValAla: 3.48 ± 0.79
2.944ValCys: 2.944 ± 1.717
3.747ValAsp: 3.747 ± 0.709
2.677ValGlu: 2.677 ± 0.099
2.409ValPhe: 2.409 ± 0.305
2.944ValGly: 2.944 ± 1.717
0.535ValHis: 0.535 ± 0.491
3.747ValIle: 3.747 ± 1.141
4.818ValLys: 4.818 ± 0.917
6.959ValLeu: 6.959 ± 0.78
1.071ValMet: 1.071 ± 0.257
2.409ValAsn: 2.409 ± 0.459
2.141ValPro: 2.141 ± 0.439
2.677ValGln: 2.677 ± 0.624
3.48ValArg: 3.48 ± 0.917
5.086ValSer: 5.086 ± 0.642
4.55ValThr: 4.55 ± 0.946
2.409ValVal: 2.409 ± 0.689
1.338ValTrp: 1.338 ± 0.277
3.212ValTyr: 3.212 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
1.874TrpAla: 1.874 ± 0.382
1.071TrpCys: 1.071 ± 0.289
0.535TrpAsp: 0.535 ± 0.31
0.268TrpGlu: 0.268 ± 0.245
1.071TrpPhe: 1.071 ± 0.294
1.338TrpGly: 1.338 ± 0.341
0.535TrpHis: 0.535 ± 0.31
1.071TrpIle: 1.071 ± 0.613
0.535TrpLys: 0.535 ± 0.31
1.071TrpLeu: 1.071 ± 0.621
0.535TrpMet: 0.535 ± 0.491
0.0TrpAsn: 0.0 ± 0.0
0.268TrpPro: 0.268 ± 0.155
0.268TrpGln: 0.268 ± 0.245
0.535TrpArg: 0.535 ± 0.31
1.606TrpSer: 1.606 ± 0.588
0.803TrpThr: 0.803 ± 0.736
0.803TrpVal: 0.803 ± 0.369
0.0TrpTrp: 0.0 ± 0.0
0.268TrpTyr: 0.268 ± 0.428
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.874TyrAla: 1.874 ± 0.091
1.338TyrCys: 1.338 ± 0.507
2.677TyrAsp: 2.677 ± 1.2
2.141TyrGlu: 2.141 ± 0.878
1.071TyrPhe: 1.071 ± 0.43
2.141TyrGly: 2.141 ± 0.162
0.535TyrHis: 0.535 ± 0.31
2.409TyrIle: 2.409 ± 1.046
2.409TyrLys: 2.409 ± 0.689
3.747TyrLeu: 3.747 ± 0.709
0.803TyrMet: 0.803 ± 0.762
1.606TyrAsn: 1.606 ± 0.344
1.071TyrPro: 1.071 ± 0.294
1.338TyrGln: 1.338 ± 0.226
2.409TyrArg: 2.409 ± 0.545
1.606TyrSer: 1.606 ± 0.588
2.409TyrThr: 2.409 ± 0.465
2.141TyrVal: 2.141 ± 1.029
0.535TyrTrp: 0.535 ± 0.31
1.071TyrTyr: 1.071 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3737 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski