Amino acid dipepetide frequency for Bujaru virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.696AlaAla: 6.696 ± 4.402
1.488AlaCys: 1.488 ± 0.806
2.976AlaAsp: 2.976 ± 0.685
4.712AlaGlu: 4.712 ± 1.501
2.976AlaPhe: 2.976 ± 0.906
2.232AlaGly: 2.232 ± 0.459
1.736AlaHis: 1.736 ± 0.984
4.712AlaIle: 4.712 ± 0.687
1.488AlaLys: 1.488 ± 0.521
5.208AlaLeu: 5.208 ± 1.267
1.984AlaMet: 1.984 ± 0.757
1.488AlaAsn: 1.488 ± 0.699
2.232AlaPro: 2.232 ± 0.975
2.232AlaGln: 2.232 ± 0.633
2.728AlaArg: 2.728 ± 1.207
5.456AlaSer: 5.456 ± 0.57
3.224AlaThr: 3.224 ± 0.908
2.976AlaVal: 2.976 ± 1.672
0.744AlaTrp: 0.744 ± 0.5
1.736AlaTyr: 1.736 ± 0.982
0.0AlaXaa: 0.0 ± 0.0
Cys
0.248CysAla: 0.248 ± 0.17
0.496CysCys: 0.496 ± 0.339
0.744CysAsp: 0.744 ± 0.231
0.248CysGlu: 0.248 ± 0.223
1.736CysPhe: 1.736 ± 0.391
0.496CysGly: 0.496 ± 0.446
0.744CysHis: 0.744 ± 0.669
1.488CysIle: 1.488 ± 0.453
1.736CysLys: 1.736 ± 0.621
1.984CysLeu: 1.984 ± 0.331
0.744CysMet: 0.744 ± 0.231
1.488CysAsn: 1.488 ± 0.503
1.984CysPro: 1.984 ± 0.604
1.984CysGln: 1.984 ± 0.56
0.992CysArg: 0.992 ± 0.892
3.224CysSer: 3.224 ± 1.209
3.224CysThr: 3.224 ± 0.885
1.24CysVal: 1.24 ± 0.776
0.0CysTrp: 0.0 ± 0.0
0.744CysTyr: 0.744 ± 0.341
0.0CysXaa: 0.0 ± 0.0
Asp
2.728AspAla: 2.728 ± 0.948
1.24AspCys: 1.24 ± 1.115
5.208AspAsp: 5.208 ± 2.327
3.72AspGlu: 3.72 ± 0.665
2.232AspPhe: 2.232 ± 0.903
4.216AspGly: 4.216 ± 1.225
1.24AspHis: 1.24 ± 0.438
3.472AspIle: 3.472 ± 1.029
3.968AspLys: 3.968 ± 0.796
6.696AspLeu: 6.696 ± 2.106
1.984AspMet: 1.984 ± 1.1
1.736AspAsn: 1.736 ± 1.187
2.48AspPro: 2.48 ± 0.459
0.992AspGln: 0.992 ± 0.792
2.232AspArg: 2.232 ± 0.856
4.464AspSer: 4.464 ± 0.69
2.48AspThr: 2.48 ± 1.68
2.232AspVal: 2.232 ± 0.633
0.744AspTrp: 0.744 ± 0.341
2.232AspTyr: 2.232 ± 0.459
0.0AspXaa: 0.0 ± 0.0
Glu
4.712GluAla: 4.712 ± 0.714
1.24GluCys: 1.24 ± 0.351
4.712GluAsp: 4.712 ± 1.247
4.464GluGlu: 4.464 ± 1.372
4.216GluPhe: 4.216 ± 1.444
2.48GluGly: 2.48 ± 0.443
1.736GluHis: 1.736 ± 0.371
5.952GluIle: 5.952 ± 0.993
4.464GluLys: 4.464 ± 1.276
6.2GluLeu: 6.2 ± 1.238
1.24GluMet: 1.24 ± 0.776
2.728GluAsn: 2.728 ± 1.168
1.736GluPro: 1.736 ± 0.732
2.48GluGln: 2.48 ± 0.374
3.224GluArg: 3.224 ± 1.235
6.696GluSer: 6.696 ± 1.936
3.472GluThr: 3.472 ± 0.978
3.72GluVal: 3.72 ± 0.811
0.496GluTrp: 0.496 ± 0.541
1.736GluTyr: 1.736 ± 0.601
0.0GluXaa: 0.0 ± 0.0
Phe
1.984PheAla: 1.984 ± 0.928
1.24PheCys: 1.24 ± 0.776
2.232PheAsp: 2.232 ± 1.154
1.984PheGlu: 1.984 ± 0.862
1.736PhePhe: 1.736 ± 0.625
1.736PheGly: 1.736 ± 0.389
0.496PheHis: 0.496 ± 0.151
2.728PheIle: 2.728 ± 0.422
3.968PheLys: 3.968 ± 1.049
5.704PheLeu: 5.704 ± 1.0
0.992PheMet: 0.992 ± 0.849
3.472PheAsn: 3.472 ± 0.58
1.984PhePro: 1.984 ± 1.004
0.496PheGln: 0.496 ± 0.151
2.232PheArg: 2.232 ± 1.202
5.208PheSer: 5.208 ± 0.735
2.48PheThr: 2.48 ± 1.056
3.72PheVal: 3.72 ± 1.054
0.744PheTrp: 0.744 ± 0.231
0.744PheTyr: 0.744 ± 0.549
0.0PheXaa: 0.0 ± 0.0
Gly
3.968GlyAla: 3.968 ± 0.796
1.488GlyCys: 1.488 ± 0.453
1.984GlyAsp: 1.984 ± 0.604
2.48GlyGlu: 2.48 ± 0.824
4.464GlyPhe: 4.464 ± 1.363
4.216GlyGly: 4.216 ± 0.72
1.24GlyHis: 1.24 ± 0.536
2.232GlyIle: 2.232 ± 0.633
3.224GlyLys: 3.224 ± 0.887
5.208GlyLeu: 5.208 ± 1.928
1.984GlyMet: 1.984 ± 0.868
1.736GlyAsn: 1.736 ± 1.023
2.976GlyPro: 2.976 ± 1.129
1.736GlyGln: 1.736 ± 0.895
2.728GlyArg: 2.728 ± 1.195
5.952GlySer: 5.952 ± 1.727
2.976GlyThr: 2.976 ± 0.496
3.72GlyVal: 3.72 ± 0.622
0.248GlyTrp: 0.248 ± 0.17
1.24GlyTyr: 1.24 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
0.248HisAla: 0.248 ± 0.17
0.496HisCys: 0.496 ± 0.151
1.488HisAsp: 1.488 ± 0.682
0.992HisGlu: 0.992 ± 0.556
0.992HisPhe: 0.992 ± 0.679
2.232HisGly: 2.232 ± 0.693
0.248HisHis: 0.248 ± 0.17
1.984HisIle: 1.984 ± 0.758
1.736HisLys: 1.736 ± 0.621
2.48HisLeu: 2.48 ± 0.703
0.992HisMet: 0.992 ± 0.302
0.992HisAsn: 0.992 ± 0.376
0.744HisPro: 0.744 ± 0.509
1.24HisGln: 1.24 ± 0.848
0.992HisArg: 0.992 ± 0.607
2.48HisSer: 2.48 ± 1.532
1.736HisThr: 1.736 ± 0.61
0.992HisVal: 0.992 ± 0.431
0.0HisTrp: 0.0 ± 0.0
2.232HisTyr: 2.232 ± 0.925
0.0HisXaa: 0.0 ± 0.0
Ile
2.976IleAla: 2.976 ± 0.884
1.24IleCys: 1.24 ± 0.478
4.96IleAsp: 4.96 ± 1.132
3.968IleGlu: 3.968 ± 1.853
2.976IlePhe: 2.976 ± 0.716
2.976IleGly: 2.976 ± 0.715
2.232IleHis: 2.232 ± 0.768
4.712IleIle: 4.712 ± 1.003
5.208IleLys: 5.208 ± 0.77
4.216IleLeu: 4.216 ± 1.209
1.488IleMet: 1.488 ± 0.515
3.224IleAsn: 3.224 ± 0.926
2.728IlePro: 2.728 ± 1.386
2.976IleGln: 2.976 ± 0.716
4.216IleArg: 4.216 ± 0.866
5.704IleSer: 5.704 ± 1.106
3.472IleThr: 3.472 ± 1.163
3.472IleVal: 3.472 ± 1.082
0.248IleTrp: 0.248 ± 0.17
1.736IleTyr: 1.736 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
5.456LysAla: 5.456 ± 0.958
1.984LysCys: 1.984 ± 0.816
1.736LysAsp: 1.736 ± 0.489
6.448LysGlu: 6.448 ± 1.625
1.736LysPhe: 1.736 ± 0.489
3.472LysGly: 3.472 ± 1.243
1.24LysHis: 1.24 ± 0.848
5.208LysIle: 5.208 ± 1.663
6.448LysLys: 6.448 ± 1.57
4.464LysLeu: 4.464 ± 1.386
3.968LysMet: 3.968 ± 1.084
2.48LysAsn: 2.48 ± 0.829
1.984LysPro: 1.984 ± 0.551
1.984LysGln: 1.984 ± 0.604
3.224LysArg: 3.224 ± 2.077
6.944LysSer: 6.944 ± 2.184
4.464LysThr: 4.464 ± 0.884
4.712LysVal: 4.712 ± 0.782
1.488LysTrp: 1.488 ± 0.7
1.984LysTyr: 1.984 ± 0.722
0.0LysXaa: 0.0 ± 0.0
Leu
6.2LeuAla: 6.2 ± 0.56
1.736LeuCys: 1.736 ± 0.601
4.216LeuAsp: 4.216 ± 0.939
4.216LeuGlu: 4.216 ± 0.725
3.968LeuPhe: 3.968 ± 1.505
5.208LeuGly: 5.208 ± 0.365
2.48LeuHis: 2.48 ± 1.395
4.96LeuIle: 4.96 ± 0.379
7.688LeuLys: 7.688 ± 0.809
7.688LeuLeu: 7.688 ± 1.132
2.232LeuMet: 2.232 ± 0.983
4.464LeuAsn: 4.464 ± 1.266
2.48LeuPro: 2.48 ± 0.582
2.728LeuGln: 2.728 ± 0.963
7.937LeuArg: 7.937 ± 1.922
9.425LeuSer: 9.425 ± 1.115
4.216LeuThr: 4.216 ± 1.007
3.968LeuVal: 3.968 ± 0.752
0.248LeuTrp: 0.248 ± 0.223
2.728LeuTyr: 2.728 ± 0.8
0.0LeuXaa: 0.0 ± 0.0
Met
1.488MetAla: 1.488 ± 0.453
0.248MetCys: 0.248 ± 0.223
1.984MetAsp: 1.984 ± 0.363
2.48MetGlu: 2.48 ± 0.722
1.736MetPhe: 1.736 ± 0.625
1.984MetGly: 1.984 ± 0.987
0.992MetHis: 0.992 ± 0.608
3.968MetIle: 3.968 ± 1.231
2.976MetLys: 2.976 ± 0.924
1.488MetLeu: 1.488 ± 0.964
2.232MetMet: 2.232 ± 0.9
1.24MetAsn: 1.24 ± 0.702
0.496MetPro: 0.496 ± 0.549
1.24MetGln: 1.24 ± 0.454
0.744MetArg: 0.744 ± 0.509
2.976MetSer: 2.976 ± 1.036
1.488MetThr: 1.488 ± 0.503
0.992MetVal: 0.992 ± 0.302
0.248MetTrp: 0.248 ± 0.584
0.744MetTyr: 0.744 ± 0.509
0.0MetXaa: 0.0 ± 0.0
Asn
1.24AsnAla: 1.24 ± 0.776
0.744AsnCys: 0.744 ± 0.231
2.728AsnAsp: 2.728 ± 0.364
3.968AsnGlu: 3.968 ± 0.482
1.736AsnPhe: 1.736 ± 0.66
1.736AsnGly: 1.736 ± 0.66
0.992AsnHis: 0.992 ± 0.607
1.736AsnIle: 1.736 ± 0.371
2.48AsnLys: 2.48 ± 0.723
3.72AsnLeu: 3.72 ± 0.113
1.736AsnMet: 1.736 ± 1.221
2.232AsnAsn: 2.232 ± 0.712
2.728AsnPro: 2.728 ± 0.963
0.992AsnGln: 0.992 ± 0.679
1.984AsnArg: 1.984 ± 0.752
3.472AsnSer: 3.472 ± 0.81
1.984AsnThr: 1.984 ± 0.82
1.736AsnVal: 1.736 ± 0.933
0.496AsnTrp: 0.496 ± 0.151
0.496AsnTyr: 0.496 ± 0.151
0.0AsnXaa: 0.0 ± 0.0
Pro
2.232ProAla: 2.232 ± 1.1
0.496ProCys: 0.496 ± 0.151
2.728ProAsp: 2.728 ± 0.803
3.968ProGlu: 3.968 ± 1.718
0.992ProPhe: 0.992 ± 0.431
3.72ProGly: 3.72 ± 0.679
0.744ProHis: 0.744 ± 0.231
2.48ProIle: 2.48 ± 0.624
1.488ProLys: 1.488 ± 0.353
3.224ProLeu: 3.224 ± 1.857
0.744ProMet: 0.744 ± 0.554
1.24ProAsn: 1.24 ± 0.536
0.496ProPro: 0.496 ± 0.339
0.496ProGln: 0.496 ± 0.151
1.736ProArg: 1.736 ± 0.732
3.224ProSer: 3.224 ± 0.601
1.736ProThr: 1.736 ± 1.151
2.976ProVal: 2.976 ± 1.057
1.24ProTrp: 1.24 ± 0.454
1.488ProTyr: 1.488 ± 0.7
0.0ProXaa: 0.0 ± 0.0
Gln
2.232GlnAla: 2.232 ± 0.885
1.488GlnCys: 1.488 ± 0.966
1.24GlnAsp: 1.24 ± 0.776
2.728GlnGlu: 2.728 ± 0.916
0.496GlnPhe: 0.496 ± 0.599
1.984GlnGly: 1.984 ± 0.757
2.232GlnHis: 2.232 ± 0.633
1.984GlnIle: 1.984 ± 0.758
2.976GlnLys: 2.976 ± 0.838
3.472GlnLeu: 3.472 ± 0.779
0.744GlnMet: 0.744 ± 0.74
0.992GlnAsn: 0.992 ± 0.792
1.736GlnPro: 1.736 ± 0.389
0.992GlnGln: 0.992 ± 0.302
1.488GlnArg: 1.488 ± 1.0
0.992GlnSer: 0.992 ± 0.556
1.736GlnThr: 1.736 ± 0.621
1.24GlnVal: 1.24 ± 0.702
0.0GlnTrp: 0.0 ± 0.0
0.496GlnTyr: 0.496 ± 0.151
0.0GlnXaa: 0.0 ± 0.0
Arg
4.712ArgAla: 4.712 ± 0.544
1.24ArgCys: 1.24 ± 0.478
3.472ArgAsp: 3.472 ± 1.568
4.464ArgGlu: 4.464 ± 1.51
1.488ArgPhe: 1.488 ± 0.462
3.72ArgGly: 3.72 ± 1.225
0.744ArgHis: 0.744 ± 0.341
2.728ArgIle: 2.728 ± 1.572
2.976ArgLys: 2.976 ± 0.479
3.72ArgLeu: 3.72 ± 1.213
2.48ArgMet: 2.48 ± 1.004
1.488ArgAsn: 1.488 ± 0.99
1.736ArgPro: 1.736 ± 0.621
0.992ArgGln: 0.992 ± 0.608
2.48ArgArg: 2.48 ± 0.403
4.464ArgSer: 4.464 ± 1.473
2.232ArgThr: 2.232 ± 0.479
4.96ArgVal: 4.96 ± 0.393
0.744ArgTrp: 0.744 ± 0.509
0.744ArgTyr: 0.744 ± 0.509
0.0ArgXaa: 0.0 ± 0.0
Ser
4.216SerAla: 4.216 ± 1.225
3.968SerCys: 3.968 ± 1.524
5.208SerAsp: 5.208 ± 0.908
6.944SerGlu: 6.944 ± 1.428
3.968SerPhe: 3.968 ± 1.788
4.712SerGly: 4.712 ± 2.24
2.728SerHis: 2.728 ± 0.571
4.464SerIle: 4.464 ± 1.631
7.44SerLys: 7.44 ± 1.879
11.161SerLeu: 11.161 ± 2.183
2.976SerMet: 2.976 ± 0.479
1.488SerAsn: 1.488 ± 1.0
4.96SerPro: 4.96 ± 1.149
2.976SerGln: 2.976 ± 0.449
4.712SerArg: 4.712 ± 0.233
9.673SerSer: 9.673 ± 2.837
4.464SerThr: 4.464 ± 0.871
5.208SerVal: 5.208 ± 1.172
1.736SerTrp: 1.736 ± 0.899
2.232SerTyr: 2.232 ± 0.91
0.0SerXaa: 0.0 ± 0.0
Thr
2.232ThrAla: 2.232 ± 0.377
1.736ThrCys: 1.736 ± 0.422
4.464ThrAsp: 4.464 ± 1.303
3.472ThrGlu: 3.472 ± 0.742
2.232ThrPhe: 2.232 ± 0.858
4.464ThrGly: 4.464 ± 1.073
0.496ThrHis: 0.496 ± 0.339
2.976ThrIle: 2.976 ± 0.833
2.976ThrLys: 2.976 ± 0.906
5.952ThrLeu: 5.952 ± 0.636
0.0ThrMet: 0.0 ± 0.0
2.48ThrAsn: 2.48 ± 0.505
1.736ThrPro: 1.736 ± 0.962
1.984ThrGln: 1.984 ± 1.168
2.48ThrArg: 2.48 ± 0.505
4.464ThrSer: 4.464 ± 1.073
3.472ThrThr: 3.472 ± 0.978
4.712ThrVal: 4.712 ± 1.028
0.744ThrTrp: 0.744 ± 0.554
1.24ThrTyr: 1.24 ± 1.09
0.0ThrXaa: 0.0 ± 0.0
Val
3.968ValAla: 3.968 ± 1.861
1.488ValCys: 1.488 ± 0.503
2.728ValAsp: 2.728 ± 0.505
3.72ValGlu: 3.72 ± 1.516
3.472ValPhe: 3.472 ± 0.742
2.232ValGly: 2.232 ± 0.541
1.736ValHis: 1.736 ± 0.371
4.464ValIle: 4.464 ± 1.093
4.712ValLys: 4.712 ± 1.242
3.968ValLeu: 3.968 ± 1.233
1.736ValMet: 1.736 ± 0.489
1.984ValAsn: 1.984 ± 0.594
0.744ValPro: 0.744 ± 0.482
1.736ValGln: 1.736 ± 0.895
4.216ValArg: 4.216 ± 1.52
7.192ValSer: 7.192 ± 0.496
2.728ValThr: 2.728 ± 0.873
5.208ValVal: 5.208 ± 1.204
0.496ValTrp: 0.496 ± 0.151
1.984ValTyr: 1.984 ± 0.331
0.0ValXaa: 0.0 ± 0.0
Trp
0.496TrpAla: 0.496 ± 0.541
0.0TrpCys: 0.0 ± 0.0
0.248TrpAsp: 0.248 ± 0.17
0.992TrpGlu: 0.992 ± 0.376
0.496TrpPhe: 0.496 ± 0.151
0.744TrpGly: 0.744 ± 0.482
0.0TrpHis: 0.0 ± 0.0
1.488TrpIle: 1.488 ± 0.453
0.744TrpLys: 0.744 ± 0.5
0.496TrpLeu: 0.496 ± 0.541
0.496TrpMet: 0.496 ± 0.151
0.744TrpAsn: 0.744 ± 0.231
0.496TrpPro: 0.496 ± 0.541
0.496TrpGln: 0.496 ± 0.151
0.248TrpArg: 0.248 ± 0.17
0.496TrpSer: 0.496 ± 0.339
0.992TrpThr: 0.992 ± 0.447
0.992TrpVal: 0.992 ± 0.447
0.248TrpTrp: 0.248 ± 0.17
0.248TrpTyr: 0.248 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.992TyrAla: 0.992 ± 0.607
1.488TyrCys: 1.488 ± 0.997
1.24TyrAsp: 1.24 ± 0.536
1.736TyrGlu: 1.736 ± 0.601
2.232TyrPhe: 2.232 ± 1.106
1.24TyrGly: 1.24 ± 0.641
1.24TyrHis: 1.24 ± 0.536
0.992TyrIle: 0.992 ± 0.376
2.728TyrLys: 2.728 ± 0.571
1.736TyrLeu: 1.736 ± 0.389
0.744TyrMet: 0.744 ± 0.509
1.24TyrAsn: 1.24 ± 0.454
0.992TyrPro: 0.992 ± 0.892
0.744TyrGln: 0.744 ± 1.786
0.992TyrArg: 0.992 ± 0.447
2.728TyrSer: 2.728 ± 0.803
1.736TyrThr: 1.736 ± 0.391
1.736TyrVal: 1.736 ± 0.69
0.248TyrTrp: 0.248 ± 0.223
0.496TyrTyr: 0.496 ± 0.549
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4033 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski