Amino acid dipepetide frequency for Flanders hapavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.889AlaAla: 1.889 ± 1.017
0.944AlaCys: 0.944 ± 0.573
1.416AlaAsp: 1.416 ± 1.269
2.833AlaGlu: 2.833 ± 1.105
3.777AlaPhe: 3.777 ± 1.049
2.833AlaGly: 2.833 ± 1.47
1.416AlaHis: 1.416 ± 0.617
1.889AlaIle: 1.889 ± 0.776
3.305AlaLys: 3.305 ± 0.901
2.833AlaLeu: 2.833 ± 1.097
0.472AlaMet: 0.472 ± 0.419
1.889AlaAsn: 1.889 ± 0.941
1.889AlaPro: 1.889 ± 1.211
0.944AlaGln: 0.944 ± 0.571
0.944AlaArg: 0.944 ± 0.431
2.833AlaSer: 2.833 ± 1.403
1.416AlaThr: 1.416 ± 0.895
1.889AlaVal: 1.889 ± 0.848
1.416AlaTrp: 1.416 ± 0.551
2.833AlaTyr: 2.833 ± 1.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.944CysAla: 0.944 ± 0.656
0.472CysCys: 0.472 ± 0.512
0.472CysAsp: 0.472 ± 0.334
0.944CysGlu: 0.944 ± 0.668
0.472CysPhe: 0.472 ± 0.512
1.416CysGly: 1.416 ± 1.089
0.944CysHis: 0.944 ± 0.431
0.944CysIle: 0.944 ± 0.512
3.305CysLys: 3.305 ± 1.068
0.944CysLeu: 0.944 ± 0.691
0.0CysMet: 0.0 ± 0.0
0.944CysAsn: 0.944 ± 0.668
0.944CysPro: 0.944 ± 0.431
0.944CysGln: 0.944 ± 0.668
0.944CysArg: 0.944 ± 0.568
1.889CysSer: 1.889 ± 0.765
0.472CysThr: 0.472 ± 0.545
0.472CysVal: 0.472 ± 0.559
0.472CysTrp: 0.472 ± 0.559
0.944CysTyr: 0.944 ± 0.431
0.0CysXaa: 0.0 ± 0.0
Asp
2.833AspAla: 2.833 ± 1.656
2.361AspCys: 2.361 ± 0.73
5.194AspAsp: 5.194 ± 2.466
2.833AspGlu: 2.833 ± 0.887
0.472AspPhe: 0.472 ± 0.553
2.833AspGly: 2.833 ± 1.23
0.472AspHis: 0.472 ± 0.334
4.721AspIle: 4.721 ± 1.796
5.666AspLys: 5.666 ± 1.26
5.194AspLeu: 5.194 ± 1.258
0.944AspMet: 0.944 ± 0.846
3.305AspAsn: 3.305 ± 0.725
3.305AspPro: 3.305 ± 0.804
3.777AspGln: 3.777 ± 0.904
1.416AspArg: 1.416 ± 0.96
3.777AspSer: 3.777 ± 0.953
2.833AspThr: 2.833 ± 1.135
3.777AspVal: 3.777 ± 1.52
1.416AspTrp: 1.416 ± 0.989
3.777AspTyr: 3.777 ± 1.474
0.0AspXaa: 0.0 ± 0.0
Glu
4.249GluAla: 4.249 ± 1.123
0.472GluCys: 0.472 ± 0.334
4.721GluAsp: 4.721 ± 1.147
5.666GluGlu: 5.666 ± 1.224
0.944GluPhe: 0.944 ± 0.668
1.416GluGly: 1.416 ± 0.746
0.472GluHis: 0.472 ± 0.559
2.833GluIle: 2.833 ± 2.301
3.305GluLys: 3.305 ± 1.31
5.666GluLeu: 5.666 ± 1.089
1.889GluMet: 1.889 ± 0.766
3.777GluAsn: 3.777 ± 1.639
1.416GluPro: 1.416 ± 0.551
1.416GluGln: 1.416 ± 0.511
0.944GluArg: 0.944 ± 0.571
5.666GluSer: 5.666 ± 1.878
1.416GluThr: 1.416 ± 1.001
3.305GluVal: 3.305 ± 0.999
1.416GluTrp: 1.416 ± 0.617
1.889GluTyr: 1.889 ± 0.67
0.0GluXaa: 0.0 ± 0.0
Phe
2.361PheAla: 2.361 ± 1.735
0.472PheCys: 0.472 ± 0.423
5.194PheAsp: 5.194 ± 1.176
1.416PheGlu: 1.416 ± 0.759
3.305PhePhe: 3.305 ± 1.402
4.249PheGly: 4.249 ± 1.039
2.361PheHis: 2.361 ± 1.227
2.361PheIle: 2.361 ± 0.816
2.833PheLys: 2.833 ± 1.176
4.249PheLeu: 4.249 ± 1.001
0.0PheMet: 0.0 ± 0.0
1.416PheAsn: 1.416 ± 0.849
2.361PhePro: 2.361 ± 0.708
2.833PheGln: 2.833 ± 0.909
1.416PheArg: 1.416 ± 0.511
3.305PheSer: 3.305 ± 0.785
0.944PheThr: 0.944 ± 0.683
3.305PheVal: 3.305 ± 1.122
0.0PheTrp: 0.0 ± 0.0
1.416PheTyr: 1.416 ± 0.727
0.0PheXaa: 0.0 ± 0.0
Gly
0.472GlyAla: 0.472 ± 0.423
0.0GlyCys: 0.0 ± 0.0
3.305GlyAsp: 3.305 ± 0.972
2.361GlyGlu: 2.361 ± 0.73
2.833GlyPhe: 2.833 ± 1.24
2.361GlyGly: 2.361 ± 0.646
0.0GlyHis: 0.0 ± 0.0
8.026GlyIle: 8.026 ± 1.313
8.026GlyLys: 8.026 ± 1.731
3.305GlyLeu: 3.305 ± 1.446
0.472GlyMet: 0.472 ± 0.334
2.361GlyAsn: 2.361 ± 1.146
0.472GlyPro: 0.472 ± 0.334
1.416GlyGln: 1.416 ± 0.617
3.305GlyArg: 3.305 ± 1.529
3.777GlySer: 3.777 ± 1.042
5.194GlyThr: 5.194 ± 1.226
5.194GlyVal: 5.194 ± 0.979
0.944GlyTrp: 0.944 ± 0.431
2.833GlyTyr: 2.833 ± 1.02
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.361HisAsp: 2.361 ± 1.044
0.472HisGlu: 0.472 ± 0.572
1.416HisPhe: 1.416 ± 0.96
2.833HisGly: 2.833 ± 0.919
0.472HisHis: 0.472 ± 0.334
0.944HisIle: 0.944 ± 0.578
0.472HisLys: 0.472 ± 0.334
3.305HisLeu: 3.305 ± 0.763
0.472HisMet: 0.472 ± 0.572
0.944HisAsn: 0.944 ± 0.593
2.833HisPro: 2.833 ± 0.916
0.0HisGln: 0.0 ± 0.0
0.472HisArg: 0.472 ± 0.334
1.416HisSer: 1.416 ± 0.763
0.0HisThr: 0.0 ± 0.0
1.889HisVal: 1.889 ± 1.13
0.0HisTrp: 0.0 ± 0.0
0.472HisTyr: 0.472 ± 0.559
0.0HisXaa: 0.0 ± 0.0
Ile
2.361IleAla: 2.361 ± 0.848
2.361IleCys: 2.361 ± 0.896
4.249IleAsp: 4.249 ± 1.62
4.249IleGlu: 4.249 ± 0.936
2.361IlePhe: 2.361 ± 1.276
6.61IleGly: 6.61 ± 1.478
1.416IleHis: 1.416 ± 1.081
4.249IleIle: 4.249 ± 1.769
5.194IleLys: 5.194 ± 2.235
4.721IleLeu: 4.721 ± 1.527
1.416IleMet: 1.416 ± 0.666
2.361IleAsn: 2.361 ± 1.265
2.361IlePro: 2.361 ± 1.048
2.361IleGln: 2.361 ± 0.604
4.249IleArg: 4.249 ± 0.907
9.915IleSer: 9.915 ± 2.353
5.666IleThr: 5.666 ± 1.434
5.666IleVal: 5.666 ± 1.663
0.472IleTrp: 0.472 ± 0.545
6.61IleTyr: 6.61 ± 1.26
0.0IleXaa: 0.0 ± 0.0
Lys
0.944LysAla: 0.944 ± 0.683
2.833LysCys: 2.833 ± 1.573
4.249LysAsp: 4.249 ± 1.103
4.249LysGlu: 4.249 ± 1.349
2.361LysPhe: 2.361 ± 0.958
4.249LysGly: 4.249 ± 0.879
1.889LysHis: 1.889 ± 0.876
7.082LysIle: 7.082 ± 1.373
6.138LysLys: 6.138 ± 1.903
5.666LysLeu: 5.666 ± 1.734
2.833LysMet: 2.833 ± 1.669
4.249LysAsn: 4.249 ± 1.247
1.889LysPro: 1.889 ± 1.033
1.416LysGln: 1.416 ± 0.645
7.554LysArg: 7.554 ± 2.146
6.138LysSer: 6.138 ± 2.098
4.249LysThr: 4.249 ± 1.311
4.721LysVal: 4.721 ± 1.263
0.944LysTrp: 0.944 ± 0.725
2.361LysTyr: 2.361 ± 0.785
0.0LysXaa: 0.0 ± 0.0
Leu
3.305LeuAla: 3.305 ± 1.067
1.416LeuCys: 1.416 ± 1.089
4.721LeuAsp: 4.721 ± 1.193
5.666LeuGlu: 5.666 ± 1.238
3.777LeuPhe: 3.777 ± 1.364
4.249LeuGly: 4.249 ± 1.545
1.889LeuHis: 1.889 ± 0.974
5.666LeuIle: 5.666 ± 2.109
5.666LeuLys: 5.666 ± 1.912
6.138LeuLeu: 6.138 ± 1.551
2.361LeuMet: 2.361 ± 0.708
7.082LeuAsn: 7.082 ± 2.029
4.721LeuPro: 4.721 ± 0.755
4.249LeuGln: 4.249 ± 1.372
4.249LeuArg: 4.249 ± 1.49
4.721LeuSer: 4.721 ± 1.451
3.777LeuThr: 3.777 ± 1.04
2.833LeuVal: 2.833 ± 1.289
3.305LeuTrp: 3.305 ± 1.108
5.194LeuTyr: 5.194 ± 1.779
0.0LeuXaa: 0.0 ± 0.0
Met
1.889MetAla: 1.889 ± 0.947
0.0MetCys: 0.0 ± 0.0
2.361MetAsp: 2.361 ± 0.721
0.944MetGlu: 0.944 ± 1.0
1.889MetPhe: 1.889 ± 1.364
1.889MetGly: 1.889 ± 0.649
0.0MetHis: 0.0 ± 0.0
2.833MetIle: 2.833 ± 0.788
2.361MetLys: 2.361 ± 1.214
1.889MetLeu: 1.889 ± 0.848
0.472MetMet: 0.472 ± 0.423
1.889MetAsn: 1.889 ± 0.617
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.416MetArg: 1.416 ± 1.131
1.889MetSer: 1.889 ± 1.152
1.416MetThr: 1.416 ± 0.874
2.361MetVal: 2.361 ± 1.617
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.889AsnAla: 1.889 ± 0.862
0.0AsnCys: 0.0 ± 0.0
0.944AsnAsp: 0.944 ± 0.621
0.944AsnGlu: 0.944 ± 0.431
1.416AsnPhe: 1.416 ± 0.676
4.721AsnGly: 4.721 ± 1.695
0.944AsnHis: 0.944 ± 0.573
6.138AsnIle: 6.138 ± 1.658
5.666AsnLys: 5.666 ± 1.496
4.249AsnLeu: 4.249 ± 2.241
2.361AsnMet: 2.361 ± 1.126
4.249AsnAsn: 4.249 ± 1.211
4.249AsnPro: 4.249 ± 0.606
0.944AsnGln: 0.944 ± 0.593
0.944AsnArg: 0.944 ± 0.431
5.194AsnSer: 5.194 ± 1.442
2.833AsnThr: 2.833 ± 1.003
2.833AsnVal: 2.833 ± 1.187
0.944AsnTrp: 0.944 ± 0.803
2.361AsnTyr: 2.361 ± 1.229
0.0AsnXaa: 0.0 ± 0.0
Pro
0.472ProAla: 0.472 ± 0.423
0.472ProCys: 0.472 ± 0.334
3.305ProAsp: 3.305 ± 1.422
1.416ProGlu: 1.416 ± 0.7
2.361ProPhe: 2.361 ± 0.847
3.777ProGly: 3.777 ± 1.776
1.416ProHis: 1.416 ± 1.001
3.777ProIle: 3.777 ± 1.89
7.554ProLys: 7.554 ± 1.062
3.777ProLeu: 3.777 ± 1.208
0.944ProMet: 0.944 ± 0.789
1.889ProAsn: 1.889 ± 0.909
3.777ProPro: 3.777 ± 1.632
2.833ProGln: 2.833 ± 0.66
0.944ProArg: 0.944 ± 0.683
4.721ProSer: 4.721 ± 0.869
1.889ProThr: 1.889 ± 0.947
1.416ProVal: 1.416 ± 0.763
0.944ProTrp: 0.944 ± 0.749
1.416ProTyr: 1.416 ± 0.874
0.0ProXaa: 0.0 ± 0.0
Gln
0.472GlnAla: 0.472 ± 0.559
0.0GlnCys: 0.0 ± 0.0
1.889GlnAsp: 1.889 ± 0.766
3.777GlnGlu: 3.777 ± 1.552
1.416GlnPhe: 1.416 ± 0.778
0.472GlnGly: 0.472 ± 0.559
0.472GlnHis: 0.472 ± 0.559
2.361GlnIle: 2.361 ± 1.265
1.416GlnLys: 1.416 ± 0.624
4.249GlnLeu: 4.249 ± 1.227
1.889GlnMet: 1.889 ± 0.605
1.889GlnAsn: 1.889 ± 0.57
1.416GlnPro: 1.416 ± 1.716
0.944GlnGln: 0.944 ± 0.431
0.944GlnArg: 0.944 ± 0.712
1.416GlnSer: 1.416 ± 0.874
3.305GlnThr: 3.305 ± 1.067
1.416GlnVal: 1.416 ± 0.617
0.472GlnTrp: 0.472 ± 0.334
3.777GlnTyr: 3.777 ± 1.06
0.0GlnXaa: 0.0 ± 0.0
Arg
4.249ArgAla: 4.249 ± 1.389
0.472ArgCys: 0.472 ± 0.334
2.361ArgAsp: 2.361 ± 1.044
2.833ArgGlu: 2.833 ± 1.102
4.249ArgPhe: 4.249 ± 1.393
2.833ArgGly: 2.833 ± 1.044
0.472ArgHis: 0.472 ± 0.423
3.305ArgIle: 3.305 ± 0.807
2.833ArgLys: 2.833 ± 1.113
1.889ArgLeu: 1.889 ± 0.766
0.944ArgMet: 0.944 ± 0.621
3.305ArgAsn: 3.305 ± 0.901
1.889ArgPro: 1.889 ± 0.67
0.944ArgGln: 0.944 ± 0.512
2.833ArgArg: 2.833 ± 1.734
4.249ArgSer: 4.249 ± 1.143
1.889ArgThr: 1.889 ± 1.481
2.361ArgVal: 2.361 ± 1.26
0.0ArgTrp: 0.0 ± 0.0
0.944ArgTyr: 0.944 ± 0.431
0.0ArgXaa: 0.0 ± 0.0
Ser
2.361SerAla: 2.361 ± 1.113
0.472SerCys: 0.472 ± 0.334
5.666SerAsp: 5.666 ± 1.516
2.361SerGlu: 2.361 ± 1.192
4.721SerPhe: 4.721 ± 1.709
3.305SerGly: 3.305 ± 1.235
1.416SerHis: 1.416 ± 0.486
8.499SerIle: 8.499 ± 1.954
4.721SerLys: 4.721 ± 1.538
7.082SerLeu: 7.082 ± 1.536
1.416SerMet: 1.416 ± 1.169
3.305SerAsn: 3.305 ± 2.048
3.777SerPro: 3.777 ± 1.798
3.777SerGln: 3.777 ± 0.821
4.721SerArg: 4.721 ± 1.282
7.082SerSer: 7.082 ± 1.884
5.194SerThr: 5.194 ± 1.105
2.833SerVal: 2.833 ± 1.682
2.833SerTrp: 2.833 ± 1.06
6.138SerTyr: 6.138 ± 2.015
0.0SerXaa: 0.0 ± 0.0
Thr
2.833ThrAla: 2.833 ± 2.164
0.472ThrCys: 0.472 ± 0.5
1.416ThrAsp: 1.416 ± 1.269
2.361ThrGlu: 2.361 ± 0.821
3.777ThrPhe: 3.777 ± 1.507
3.305ThrGly: 3.305 ± 1.567
0.944ThrHis: 0.944 ± 0.579
2.833ThrIle: 2.833 ± 0.917
2.361ThrLys: 2.361 ± 1.659
5.666ThrLeu: 5.666 ± 1.405
0.944ThrMet: 0.944 ± 0.578
3.305ThrAsn: 3.305 ± 1.115
4.721ThrPro: 4.721 ± 0.675
1.416ThrGln: 1.416 ± 0.681
2.361ThrArg: 2.361 ± 0.891
3.777ThrSer: 3.777 ± 1.991
2.361ThrThr: 2.361 ± 1.133
2.833ThrVal: 2.833 ± 1.076
2.361ThrTrp: 2.361 ± 1.242
0.472ThrTyr: 0.472 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
3.777ValAla: 3.777 ± 1.631
1.889ValCys: 1.889 ± 0.743
4.721ValAsp: 4.721 ± 2.344
4.249ValGlu: 4.249 ± 1.081
1.416ValPhe: 1.416 ± 0.763
1.416ValGly: 1.416 ± 0.764
0.944ValHis: 0.944 ± 0.593
4.249ValIle: 4.249 ± 1.115
3.777ValLys: 3.777 ± 1.289
4.721ValLeu: 4.721 ± 1.329
0.0ValMet: 0.0 ± 0.0
3.305ValAsn: 3.305 ± 1.436
3.777ValPro: 3.777 ± 1.365
0.944ValGln: 0.944 ± 0.846
2.361ValArg: 2.361 ± 0.902
6.138ValSer: 6.138 ± 2.441
1.889ValThr: 1.889 ± 0.932
3.777ValVal: 3.777 ± 1.096
0.472ValTrp: 0.472 ± 0.334
3.305ValTyr: 3.305 ± 1.52
0.0ValXaa: 0.0 ± 0.0
Trp
0.944TrpAla: 0.944 ± 1.0
1.416TrpCys: 1.416 ± 0.624
0.472TrpAsp: 0.472 ± 0.423
1.889TrpGlu: 1.889 ± 1.146
0.944TrpPhe: 0.944 ± 0.668
0.944TrpGly: 0.944 ± 0.578
0.472TrpHis: 0.472 ± 0.559
3.305TrpIle: 3.305 ± 1.36
0.0TrpLys: 0.0 ± 0.0
1.889TrpLeu: 1.889 ± 0.947
1.889TrpMet: 1.889 ± 0.987
0.472TrpAsn: 0.472 ± 0.559
0.0TrpPro: 0.0 ± 0.0
0.944TrpGln: 0.944 ± 0.431
0.0TrpArg: 0.0 ± 0.0
0.472TrpSer: 0.472 ± 0.512
0.472TrpThr: 0.472 ± 0.334
0.472TrpVal: 0.472 ± 0.334
0.944TrpTrp: 0.944 ± 0.668
1.416TrpTyr: 1.416 ± 0.617
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.416TyrAla: 1.416 ± 0.874
2.361TyrCys: 2.361 ± 0.816
1.889TyrAsp: 1.889 ± 0.641
1.416TyrGlu: 1.416 ± 0.511
1.889TyrPhe: 1.889 ± 1.335
0.944TyrGly: 0.944 ± 0.668
2.361TyrHis: 2.361 ± 0.901
2.833TyrIle: 2.833 ± 0.876
1.416TyrLys: 1.416 ± 0.941
7.554TyrLeu: 7.554 ± 2.279
3.305TyrMet: 3.305 ± 0.941
2.361TyrAsn: 2.361 ± 1.266
3.305TyrPro: 3.305 ± 1.621
1.889TyrGln: 1.889 ± 1.186
2.833TyrArg: 2.833 ± 0.971
3.305TyrSer: 3.305 ± 0.975
3.305TyrThr: 3.305 ± 1.408
3.777TyrVal: 3.777 ± 1.149
0.0TyrTrp: 0.0 ± 0.0
0.472TyrTyr: 0.472 ± 0.572
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2119 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski