Amino acid dipepetide frequency for Hubei rhabdo-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.158AlaAla: 7.158 ± 2.536
2.386AlaCys: 2.386 ± 0.785
3.579AlaAsp: 3.579 ± 1.81
6.263AlaGlu: 6.263 ± 2.355
1.491AlaPhe: 1.491 ± 0.381
3.877AlaGly: 3.877 ± 0.73
2.386AlaHis: 2.386 ± 0.855
3.579AlaIle: 3.579 ± 0.5
3.281AlaLys: 3.281 ± 1.989
6.561AlaLeu: 6.561 ± 1.299
2.088AlaMet: 2.088 ± 0.543
1.789AlaAsn: 1.789 ± 0.899
4.772AlaPro: 4.772 ± 0.575
3.877AlaGln: 3.877 ± 0.367
5.07AlaArg: 5.07 ± 1.181
5.07AlaSer: 5.07 ± 2.297
4.175AlaThr: 4.175 ± 0.749
5.965AlaVal: 5.965 ± 1.696
1.789AlaTrp: 1.789 ± 0.914
3.281AlaTyr: 3.281 ± 0.569
0.0AlaXaa: 0.0 ± 0.0
Cys
1.789CysAla: 1.789 ± 0.381
0.596CysCys: 0.596 ± 0.309
0.596CysAsp: 0.596 ± 0.429
0.895CysGlu: 0.895 ± 0.264
0.298CysPhe: 0.298 ± 0.15
0.895CysGly: 0.895 ± 0.45
0.596CysHis: 0.596 ± 0.3
1.193CysIle: 1.193 ± 0.6
0.596CysLys: 0.596 ± 0.3
1.193CysLeu: 1.193 ± 0.298
0.298CysMet: 0.298 ± 0.15
0.596CysAsn: 0.596 ± 0.3
1.193CysPro: 1.193 ± 0.406
0.298CysGln: 0.298 ± 0.407
1.491CysArg: 1.491 ± 0.555
1.193CysSer: 1.193 ± 0.6
0.596CysThr: 0.596 ± 0.3
0.895CysVal: 0.895 ± 0.264
0.298CysTrp: 0.298 ± 0.15
0.596CysTyr: 0.596 ± 0.453
0.0CysXaa: 0.0 ± 0.0
Asp
4.474AspAla: 4.474 ± 0.927
0.596AspCys: 0.596 ± 0.3
2.088AspAsp: 2.088 ± 0.956
3.579AspGlu: 3.579 ± 0.538
1.789AspPhe: 1.789 ± 0.582
0.895AspGly: 0.895 ± 0.457
1.789AspHis: 1.789 ± 0.699
1.789AspIle: 1.789 ± 0.899
2.386AspLys: 2.386 ± 0.371
4.772AspLeu: 4.772 ± 1.423
1.193AspMet: 1.193 ± 0.6
2.982AspAsn: 2.982 ± 1.163
2.982AspPro: 2.982 ± 0.796
2.088AspGln: 2.088 ± 0.934
2.684AspArg: 2.684 ± 0.781
4.175AspSer: 4.175 ± 1.559
2.982AspThr: 2.982 ± 1.165
2.386AspVal: 2.386 ± 0.53
0.895AspTrp: 0.895 ± 0.533
3.579AspTyr: 3.579 ± 1.044
0.0AspXaa: 0.0 ± 0.0
Glu
1.789GluAla: 1.789 ± 0.528
1.193GluCys: 1.193 ± 0.618
2.088GluAsp: 2.088 ± 1.301
4.474GluGlu: 4.474 ± 1.746
2.982GluPhe: 2.982 ± 0.345
3.281GluGly: 3.281 ± 0.666
2.982GluHis: 2.982 ± 0.67
2.982GluIle: 2.982 ± 0.345
2.386GluLys: 2.386 ± 1.516
8.947GluLeu: 8.947 ± 1.564
2.088GluMet: 2.088 ± 0.708
0.895GluAsn: 0.895 ± 0.457
1.789GluPro: 1.789 ± 1.356
2.386GluGln: 2.386 ± 0.597
2.386GluArg: 2.386 ± 0.374
6.561GluSer: 6.561 ± 1.333
4.175GluThr: 4.175 ± 2.359
1.491GluVal: 1.491 ± 0.592
0.895GluTrp: 0.895 ± 0.45
0.596GluTyr: 0.596 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
2.684PheAla: 2.684 ± 0.768
0.596PheCys: 0.596 ± 0.669
2.088PheAsp: 2.088 ± 0.646
1.789PheGlu: 1.789 ± 1.359
1.491PhePhe: 1.491 ± 0.555
1.789PheGly: 1.789 ± 0.512
0.895PheHis: 0.895 ± 0.264
1.491PheIle: 1.491 ± 0.392
2.386PheLys: 2.386 ± 0.813
2.684PheLeu: 2.684 ± 0.974
0.298PheMet: 0.298 ± 0.497
0.895PheAsn: 0.895 ± 0.707
1.491PhePro: 1.491 ± 0.392
1.193PheGln: 1.193 ± 0.298
1.193PheArg: 1.193 ± 0.298
2.982PheSer: 2.982 ± 0.621
2.088PheThr: 2.088 ± 0.28
1.491PheVal: 1.491 ± 0.392
0.0PheTrp: 0.0 ± 0.0
0.895PheTyr: 0.895 ± 0.45
0.0PheXaa: 0.0 ± 0.0
Gly
4.772GlyAla: 4.772 ± 2.618
0.596GlyCys: 0.596 ± 0.3
3.281GlyAsp: 3.281 ± 0.957
2.088GlyGlu: 2.088 ± 1.525
1.789GlyPhe: 1.789 ± 0.25
3.579GlyGly: 3.579 ± 0.483
0.895GlyHis: 0.895 ± 0.264
3.281GlyIle: 3.281 ± 1.155
1.789GlyLys: 1.789 ± 0.593
5.368GlyLeu: 5.368 ± 0.65
2.088GlyMet: 2.088 ± 0.437
2.386GlyAsn: 2.386 ± 0.474
1.789GlyPro: 1.789 ± 0.604
2.684GlyGln: 2.684 ± 0.7
3.281GlyArg: 3.281 ± 0.592
2.982GlySer: 2.982 ± 1.221
2.386GlyThr: 2.386 ± 1.696
4.175GlyVal: 4.175 ± 0.553
0.596GlyTrp: 0.596 ± 0.3
2.088GlyTyr: 2.088 ± 0.28
0.0GlyXaa: 0.0 ± 0.0
His
0.596HisAla: 0.596 ± 0.3
0.596HisCys: 0.596 ± 0.3
2.088HisAsp: 2.088 ± 0.522
0.298HisGlu: 0.298 ± 0.15
0.895HisPhe: 0.895 ± 0.264
2.386HisGly: 2.386 ± 0.53
1.789HisHis: 1.789 ± 0.699
1.491HisIle: 1.491 ± 0.555
1.789HisLys: 1.789 ± 0.58
3.877HisLeu: 3.877 ± 0.656
1.193HisMet: 1.193 ± 0.618
0.895HisAsn: 0.895 ± 0.457
2.088HisPro: 2.088 ± 0.646
1.789HisGln: 1.789 ± 1.287
0.596HisArg: 0.596 ± 0.309
1.789HisSer: 1.789 ± 0.699
1.193HisThr: 1.193 ± 1.338
1.193HisVal: 1.193 ± 0.906
1.193HisTrp: 1.193 ± 0.436
1.491HisTyr: 1.491 ± 0.592
0.0HisXaa: 0.0 ± 0.0
Ile
3.281IleAla: 3.281 ± 0.77
1.193IleCys: 1.193 ± 0.6
3.281IleAsp: 3.281 ± 0.592
2.088IleGlu: 2.088 ± 0.858
0.895IlePhe: 0.895 ± 0.264
3.877IleGly: 3.877 ± 1.67
1.789IleHis: 1.789 ± 1.451
3.281IleIle: 3.281 ± 1.136
2.982IleLys: 2.982 ± 0.514
5.667IleLeu: 5.667 ± 0.604
2.088IleMet: 2.088 ± 0.28
0.895IleAsn: 0.895 ± 0.45
2.684IlePro: 2.684 ± 0.727
1.789IleGln: 1.789 ± 1.287
3.281IleArg: 3.281 ± 0.906
2.982IleSer: 2.982 ± 1.499
5.07IleThr: 5.07 ± 1.142
2.982IleVal: 2.982 ± 0.279
0.895IleTrp: 0.895 ± 0.45
3.579IleTyr: 3.579 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
4.474LysAla: 4.474 ± 0.902
0.298LysCys: 0.298 ± 0.15
1.193LysAsp: 1.193 ± 0.6
3.281LysGlu: 3.281 ± 0.77
1.193LysPhe: 1.193 ± 0.406
2.982LysGly: 2.982 ± 1.007
0.596LysHis: 0.596 ± 0.309
3.281LysIle: 3.281 ± 1.026
1.491LysLys: 1.491 ± 0.302
4.772LysLeu: 4.772 ± 1.29
1.491LysMet: 1.491 ± 1.194
1.789LysAsn: 1.789 ± 0.504
1.491LysPro: 1.491 ± 0.709
1.789LysGln: 1.789 ± 0.796
1.491LysArg: 1.491 ± 0.75
1.789LysSer: 1.789 ± 0.58
2.088LysThr: 2.088 ± 0.402
2.386LysVal: 2.386 ± 0.843
1.193LysTrp: 1.193 ± 0.6
2.386LysTyr: 2.386 ± 0.371
0.0LysXaa: 0.0 ± 0.0
Leu
9.842LeuAla: 9.842 ± 1.024
1.491LeuCys: 1.491 ± 0.392
5.667LeuAsp: 5.667 ± 1.299
5.667LeuGlu: 5.667 ± 1.038
3.281LeuPhe: 3.281 ± 0.906
6.263LeuGly: 6.263 ± 0.98
2.386LeuHis: 2.386 ± 0.839
6.263LeuIle: 6.263 ± 0.991
4.474LeuLys: 4.474 ± 0.674
12.228LeuLeu: 12.228 ± 2.135
2.386LeuMet: 2.386 ± 1.169
5.667LeuAsn: 5.667 ± 0.953
6.263LeuPro: 6.263 ± 1.118
5.965LeuGln: 5.965 ± 1.382
6.86LeuArg: 6.86 ± 0.632
9.544LeuSer: 9.544 ± 2.322
8.052LeuThr: 8.052 ± 2.687
5.07LeuVal: 5.07 ± 0.726
1.491LeuTrp: 1.491 ± 0.709
4.474LeuTyr: 4.474 ± 1.429
0.0LeuXaa: 0.0 ± 0.0
Met
2.982MetAla: 2.982 ± 1.219
0.298MetCys: 0.298 ± 0.15
0.298MetAsp: 0.298 ± 0.505
1.491MetGlu: 1.491 ± 0.626
1.193MetPhe: 1.193 ± 0.298
1.789MetGly: 1.789 ± 0.512
0.596MetHis: 0.596 ± 0.78
1.491MetIle: 1.491 ± 0.492
0.596MetLys: 0.596 ± 0.429
3.579MetLeu: 3.579 ± 1.209
1.193MetMet: 1.193 ± 0.6
0.298MetAsn: 0.298 ± 0.15
2.684MetPro: 2.684 ± 0.768
1.789MetGln: 1.789 ± 0.25
1.491MetArg: 1.491 ± 0.582
1.491MetSer: 1.491 ± 0.492
2.982MetThr: 2.982 ± 1.499
2.982MetVal: 2.982 ± 1.221
0.298MetTrp: 0.298 ± 0.15
1.193MetTyr: 1.193 ± 0.436
0.0MetXaa: 0.0 ± 0.0
Asn
2.386AsnAla: 2.386 ± 0.597
0.895AsnCys: 0.895 ± 0.264
1.789AsnAsp: 1.789 ± 0.899
2.088AsnGlu: 2.088 ± 0.28
0.895AsnPhe: 0.895 ± 0.707
0.596AsnGly: 0.596 ± 0.78
0.298AsnHis: 0.298 ± 0.15
1.193AsnIle: 1.193 ± 0.979
1.491AsnLys: 1.491 ± 0.592
4.474AsnLeu: 4.474 ± 0.644
1.193AsnMet: 1.193 ± 0.6
0.895AsnAsn: 0.895 ± 0.264
1.789AsnPro: 1.789 ± 0.528
0.895AsnGln: 0.895 ± 0.398
3.281AsnArg: 3.281 ± 0.9
2.982AsnSer: 2.982 ± 1.221
1.491AsnThr: 1.491 ± 0.392
2.088AsnVal: 2.088 ± 0.646
0.596AsnTrp: 0.596 ± 0.429
1.789AsnTyr: 1.789 ± 0.528
0.0AsnXaa: 0.0 ± 0.0
Pro
3.281ProAla: 3.281 ± 0.666
0.298ProCys: 0.298 ± 0.407
3.579ProAsp: 3.579 ± 1.044
2.684ProGlu: 2.684 ± 1.234
1.491ProPhe: 1.491 ± 0.492
1.789ProGly: 1.789 ± 0.855
0.895ProHis: 0.895 ± 0.45
2.684ProIle: 2.684 ± 0.727
2.386ProLys: 2.386 ± 0.371
7.158ProLeu: 7.158 ± 1.465
2.088ProMet: 2.088 ± 0.519
1.193ProAsn: 1.193 ± 0.436
2.982ProPro: 2.982 ± 2.525
1.789ProGln: 1.789 ± 1.065
1.789ProArg: 1.789 ± 0.528
5.667ProSer: 5.667 ± 1.596
2.684ProThr: 2.684 ± 0.931
3.281ProVal: 3.281 ± 1.443
0.298ProTrp: 0.298 ± 0.15
2.386ProTyr: 2.386 ± 0.597
0.0ProXaa: 0.0 ± 0.0
Gln
4.175GlnAla: 4.175 ± 1.158
0.596GlnCys: 0.596 ± 0.453
1.491GlnAsp: 1.491 ± 0.626
2.386GlnGlu: 2.386 ± 0.813
1.789GlnPhe: 1.789 ± 0.593
1.193GlnGly: 1.193 ± 0.298
1.491GlnHis: 1.491 ± 0.302
3.877GlnIle: 3.877 ± 0.912
2.386GlnLys: 2.386 ± 0.689
5.667GlnLeu: 5.667 ± 0.604
1.491GlnMet: 1.491 ± 0.626
0.596GlnAsn: 0.596 ± 0.669
1.491GlnPro: 1.491 ± 0.71
1.789GlnGln: 1.789 ± 0.58
3.877GlnArg: 3.877 ± 0.547
2.982GlnSer: 2.982 ± 1.174
1.491GlnThr: 1.491 ± 0.392
5.07GlnVal: 5.07 ± 1.237
1.789GlnTrp: 1.789 ± 0.25
0.596GlnTyr: 0.596 ± 0.3
0.0GlnXaa: 0.0 ± 0.0
Arg
3.579ArgAla: 3.579 ± 1.004
0.895ArgCys: 0.895 ± 0.264
2.684ArgAsp: 2.684 ± 0.927
4.175ArgGlu: 4.175 ± 0.896
1.193ArgPhe: 1.193 ± 0.298
2.982ArgGly: 2.982 ± 0.39
0.895ArgHis: 0.895 ± 0.457
2.088ArgIle: 2.088 ± 0.522
2.386ArgLys: 2.386 ± 0.53
7.456ArgLeu: 7.456 ± 1.826
1.491ArgMet: 1.491 ± 0.492
1.491ArgAsn: 1.491 ± 0.75
1.789ArgPro: 1.789 ± 1.071
3.281ArgGln: 3.281 ± 0.748
4.175ArgArg: 4.175 ± 1.202
5.368ArgSer: 5.368 ± 1.201
3.281ArgThr: 3.281 ± 1.603
3.877ArgVal: 3.877 ± 0.69
1.193ArgTrp: 1.193 ± 0.6
3.877ArgTyr: 3.877 ± 1.004
0.0ArgXaa: 0.0 ± 0.0
Ser
4.772SerAla: 4.772 ± 1.57
0.895SerCys: 0.895 ± 0.45
5.07SerAsp: 5.07 ± 1.223
4.175SerGlu: 4.175 ± 1.907
2.386SerPhe: 2.386 ± 0.836
4.772SerGly: 4.772 ± 1.06
2.982SerHis: 2.982 ± 0.279
6.263SerIle: 6.263 ± 1.148
1.789SerLys: 1.789 ± 0.528
9.544SerLeu: 9.544 ± 2.322
2.684SerMet: 2.684 ± 1.628
2.088SerAsn: 2.088 ± 0.543
2.684SerPro: 2.684 ± 0.471
3.281SerGln: 3.281 ± 0.421
4.474SerArg: 4.474 ± 0.995
4.772SerSer: 4.772 ± 0.748
6.561SerThr: 6.561 ± 1.738
3.579SerVal: 3.579 ± 1.044
1.789SerTrp: 1.789 ± 0.381
1.789SerTyr: 1.789 ± 0.512
0.0SerXaa: 0.0 ± 0.0
Thr
5.965ThrAla: 5.965 ± 3.75
0.895ThrCys: 0.895 ± 0.457
3.877ThrAsp: 3.877 ± 1.322
2.684ThrGlu: 2.684 ± 0.887
1.491ThrPhe: 1.491 ± 0.709
2.088ThrGly: 2.088 ± 1.212
1.193ThrHis: 1.193 ± 0.298
2.684ThrIle: 2.684 ± 0.801
1.789ThrLys: 1.789 ± 0.699
6.561ThrLeu: 6.561 ± 0.501
2.386ThrMet: 2.386 ± 0.843
3.579ThrAsn: 3.579 ± 1.799
4.474ThrPro: 4.474 ± 0.37
4.772ThrGln: 4.772 ± 1.525
3.579ThrArg: 3.579 ± 0.993
5.368ThrSer: 5.368 ± 0.52
3.281ThrThr: 3.281 ± 0.666
4.772ThrVal: 4.772 ± 0.646
0.895ThrTrp: 0.895 ± 0.45
1.789ThrTyr: 1.789 ± 0.796
0.0ThrXaa: 0.0 ± 0.0
Val
5.368ValAla: 5.368 ± 1.159
1.193ValCys: 1.193 ± 0.6
2.684ValAsp: 2.684 ± 0.953
3.877ValGlu: 3.877 ± 0.914
1.491ValPhe: 1.491 ± 0.592
3.281ValGly: 3.281 ± 0.615
1.789ValHis: 1.789 ± 0.965
2.386ValIle: 2.386 ± 1.199
0.895ValLys: 0.895 ± 0.533
6.86ValLeu: 6.86 ± 1.296
1.193ValMet: 1.193 ± 0.62
1.193ValAsn: 1.193 ± 0.6
2.982ValPro: 2.982 ± 0.801
2.088ValGln: 2.088 ± 0.935
3.579ValArg: 3.579 ± 0.5
4.772ValSer: 4.772 ± 1.208
6.561ValThr: 6.561 ± 1.332
4.175ValVal: 4.175 ± 0.379
0.596ValTrp: 0.596 ± 0.3
2.386ValTyr: 2.386 ± 0.948
0.0ValXaa: 0.0 ± 0.0
Trp
1.491TrpAla: 1.491 ± 0.71
0.0TrpCys: 0.0 ± 0.0
0.298TrpAsp: 0.298 ± 0.15
0.895TrpGlu: 0.895 ± 0.45
0.895TrpPhe: 0.895 ± 0.45
1.193TrpGly: 1.193 ± 0.406
0.596TrpHis: 0.596 ± 0.309
0.596TrpIle: 0.596 ± 0.429
1.789TrpLys: 1.789 ± 0.899
2.088TrpLeu: 2.088 ± 0.819
0.895TrpMet: 0.895 ± 0.45
1.193TrpAsn: 1.193 ± 0.298
0.596TrpPro: 0.596 ± 0.453
0.298TrpGln: 0.298 ± 0.15
1.789TrpArg: 1.789 ± 0.381
1.193TrpSer: 1.193 ± 0.436
1.491TrpThr: 1.491 ± 0.592
0.298TrpVal: 0.298 ± 0.15
0.596TrpTrp: 0.596 ± 0.3
0.298TrpTyr: 0.298 ± 0.407
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.579TyrAla: 3.579 ± 1.133
0.596TyrCys: 0.596 ± 0.429
2.386TyrAsp: 2.386 ± 1.199
1.491TyrGlu: 1.491 ± 0.302
1.491TyrPhe: 1.491 ± 0.392
2.386TyrGly: 2.386 ± 0.371
2.386TyrHis: 2.386 ± 0.785
2.684TyrIle: 2.684 ± 0.768
2.684TyrLys: 2.684 ± 0.417
3.877TyrLeu: 3.877 ± 0.812
0.596TyrMet: 0.596 ± 0.3
1.789TyrAsn: 1.789 ± 0.512
2.386TyrPro: 2.386 ± 0.689
2.386TyrGln: 2.386 ± 0.371
1.789TyrArg: 1.789 ± 0.25
2.684TyrSer: 2.684 ± 0.68
1.491TyrThr: 1.491 ± 0.492
1.193TyrVal: 1.193 ± 0.507
1.193TyrTrp: 1.193 ± 0.618
2.088TyrTyr: 2.088 ± 0.646
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3354 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski