Amino acid dipepetide frequency for Bimiti virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.291AlaAla: 2.291 ± 5.681
1.527AlaCys: 1.527 ± 0.693
2.291AlaAsp: 2.291 ± 0.571
3.563AlaGlu: 3.563 ± 2.282
2.545AlaPhe: 2.545 ± 0.732
1.018AlaGly: 1.018 ± 0.79
1.273AlaHis: 1.273 ± 0.464
3.309AlaIle: 3.309 ± 0.519
4.836AlaLys: 4.836 ± 1.228
4.072AlaLeu: 4.072 ± 0.661
0.0AlaMet: 0.0 ± 0.0
4.327AlaAsn: 4.327 ± 1.016
1.527AlaPro: 1.527 ± 0.439
2.545AlaGln: 2.545 ± 1.501
2.8AlaArg: 2.8 ± 2.456
2.545AlaSer: 2.545 ± 0.45
3.309AlaThr: 3.309 ± 0.853
2.545AlaVal: 2.545 ± 0.965
0.255AlaTrp: 0.255 ± 0.153
2.545AlaTyr: 2.545 ± 1.488
0.0AlaXaa: 0.0 ± 0.0
Cys
1.273CysAla: 1.273 ± 0.483
0.255CysCys: 0.255 ± 0.153
1.273CysAsp: 1.273 ± 0.313
0.764CysGlu: 0.764 ± 0.673
2.036CysPhe: 2.036 ± 1.131
2.8CysGly: 2.8 ± 1.475
1.018CysHis: 1.018 ± 0.565
1.782CysIle: 1.782 ± 0.447
3.309CysLys: 3.309 ± 2.243
2.036CysLeu: 2.036 ± 0.586
0.764CysMet: 0.764 ± 0.347
2.291CysAsn: 2.291 ± 0.587
1.018CysPro: 1.018 ± 0.322
1.273CysGln: 1.273 ± 0.483
0.509CysArg: 0.509 ± 0.449
1.782CysSer: 1.782 ± 0.623
1.273CysThr: 1.273 ± 0.787
1.782CysVal: 1.782 ± 1.233
0.255CysTrp: 0.255 ± 0.153
1.527CysTyr: 1.527 ± 0.693
0.0CysXaa: 0.0 ± 0.0
Asp
1.527AspAla: 1.527 ± 0.741
1.273AspCys: 1.273 ± 0.483
1.782AspAsp: 1.782 ± 0.447
4.327AspGlu: 4.327 ± 1.427
4.327AspPhe: 4.327 ± 1.509
2.036AspGly: 2.036 ± 0.827
0.764AspHis: 0.764 ± 0.347
4.327AspIle: 4.327 ± 1.345
3.818AspLys: 3.818 ± 0.939
8.145AspLeu: 8.145 ± 1.205
2.291AspMet: 2.291 ± 1.375
3.818AspAsn: 3.818 ± 0.939
2.545AspPro: 2.545 ± 0.927
1.782AspGln: 1.782 ± 0.447
3.054AspArg: 3.054 ± 1.671
1.527AspSer: 1.527 ± 1.01
3.054AspThr: 3.054 ± 1.202
2.545AspVal: 2.545 ± 0.979
0.509AspTrp: 0.509 ± 0.306
2.036AspTyr: 2.036 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
2.8GluAla: 2.8 ± 0.526
1.782GluCys: 1.782 ± 0.911
3.818GluAsp: 3.818 ± 0.53
5.345GluGlu: 5.345 ± 1.039
7.126GluPhe: 7.126 ± 2.16
1.527GluGly: 1.527 ± 0.396
2.036GluHis: 2.036 ± 0.548
8.399GluIle: 8.399 ± 2.093
3.563GluLys: 3.563 ± 0.894
6.363GluLeu: 6.363 ± 1.314
3.563GluMet: 3.563 ± 1.514
2.036GluAsn: 2.036 ± 0.705
1.527GluPro: 1.527 ± 0.741
2.8GluGln: 2.8 ± 0.698
3.054GluArg: 3.054 ± 1.202
3.818GluSer: 3.818 ± 0.214
3.054GluThr: 3.054 ± 0.758
1.273GluVal: 1.273 ± 0.483
0.255GluTrp: 0.255 ± 0.153
2.8GluTyr: 2.8 ± 0.833
0.0GluXaa: 0.0 ± 0.0
Phe
1.782PheAla: 1.782 ± 0.513
2.036PheCys: 2.036 ± 0.91
3.309PheAsp: 3.309 ± 0.873
3.054PheGlu: 3.054 ± 1.361
3.054PhePhe: 3.054 ± 1.671
3.563PheGly: 3.563 ± 1.691
1.018PheHis: 1.018 ± 0.322
3.054PheIle: 3.054 ± 0.54
4.836PheLys: 4.836 ± 1.476
5.345PheLeu: 5.345 ± 5.958
2.036PheMet: 2.036 ± 0.644
3.054PheAsn: 3.054 ± 0.793
1.273PhePro: 1.273 ± 0.751
1.782PheGln: 1.782 ± 0.513
2.291PheArg: 2.291 ± 0.887
5.854PheSer: 5.854 ± 1.439
2.291PheThr: 2.291 ± 0.784
1.782PheVal: 1.782 ± 0.513
0.764PheTrp: 0.764 ± 0.458
2.036PheTyr: 2.036 ± 0.827
0.0PheXaa: 0.0 ± 0.0
Gly
1.527GlyAla: 1.527 ± 0.439
2.8GlyCys: 2.8 ± 1.797
3.563GlyAsp: 3.563 ± 0.414
3.309GlyGlu: 3.309 ± 0.638
0.255GlyPhe: 0.255 ± 0.224
0.255GlyGly: 0.255 ± 0.153
0.509GlyHis: 0.509 ± 0.146
5.854GlyIle: 5.854 ± 2.254
2.291GlyLys: 2.291 ± 0.766
3.054GlyLeu: 3.054 ± 0.306
1.273GlyMet: 1.273 ± 0.483
3.563GlyAsn: 3.563 ± 1.025
1.273GlyPro: 1.273 ± 0.313
1.273GlyGln: 1.273 ± 0.764
2.291GlyArg: 2.291 ± 0.571
2.036GlySer: 2.036 ± 1.457
2.545GlyThr: 2.545 ± 1.488
2.291GlyVal: 2.291 ± 0.766
0.764GlyTrp: 0.764 ± 0.198
2.036GlyTyr: 2.036 ± 0.705
0.0GlyXaa: 0.0 ± 0.0
His
1.018HisAla: 1.018 ± 1.027
0.764HisCys: 0.764 ± 0.673
0.509HisAsp: 0.509 ± 0.922
1.273HisGlu: 1.273 ± 0.787
1.527HisPhe: 1.527 ± 0.917
2.291HisGly: 2.291 ± 0.617
0.764HisHis: 0.764 ± 0.856
0.764HisIle: 0.764 ± 0.198
2.036HisLys: 2.036 ± 0.91
1.782HisLeu: 1.782 ± 0.763
1.273HisMet: 1.273 ± 0.744
1.527HisAsn: 1.527 ± 0.611
0.764HisPro: 0.764 ± 0.347
0.509HisGln: 0.509 ± 0.306
1.273HisArg: 1.273 ± 0.892
1.018HisSer: 1.018 ± 0.293
1.782HisThr: 1.782 ± 0.911
1.018HisVal: 1.018 ± 0.322
0.509HisTrp: 0.509 ± 0.306
1.273HisTyr: 1.273 ± 0.483
0.0HisXaa: 0.0 ± 0.0
Ile
5.345IleAla: 5.345 ± 0.5
2.036IleCys: 2.036 ± 1.131
3.309IleAsp: 3.309 ± 1.325
6.872IleGlu: 6.872 ± 2.074
4.327IlePhe: 4.327 ± 1.866
4.581IleGly: 4.581 ± 1.651
2.036IleHis: 2.036 ± 0.91
6.363IleIle: 6.363 ± 1.619
6.363IleLys: 6.363 ± 1.591
8.145IleLeu: 8.145 ± 2.065
2.291IleMet: 2.291 ± 1.04
3.054IleAsn: 3.054 ± 0.793
3.309IlePro: 3.309 ± 0.519
3.818IleGln: 3.818 ± 0.904
4.327IleArg: 4.327 ± 1.154
4.581IleSer: 4.581 ± 0.489
4.327IleThr: 4.327 ± 1.055
4.581IleVal: 4.581 ± 0.489
0.509IleTrp: 0.509 ± 0.306
1.782IleTyr: 1.782 ± 1.64
0.0IleXaa: 0.0 ± 0.0
Lys
4.072LysAla: 4.072 ± 1.212
2.8LysCys: 2.8 ± 1.475
6.108LysAsp: 6.108 ± 1.585
5.599LysGlu: 5.599 ± 1.495
3.054LysPhe: 3.054 ± 1.671
3.309LysGly: 3.309 ± 1.669
3.054LysHis: 3.054 ± 1.387
5.599LysIle: 5.599 ± 1.395
4.836LysLys: 4.836 ± 0.899
8.654LysLeu: 8.654 ± 0.887
2.291LysMet: 2.291 ± 0.587
4.327LysAsn: 4.327 ± 1.685
2.8LysPro: 2.8 ± 0.698
3.309LysGln: 3.309 ± 1.34
1.782LysArg: 1.782 ± 0.513
4.327LysSer: 4.327 ± 1.587
5.09LysThr: 5.09 ± 1.414
5.345LysVal: 5.345 ± 2.427
0.509LysTrp: 0.509 ± 0.306
2.8LysTyr: 2.8 ± 0.698
0.0LysXaa: 0.0 ± 0.0
Leu
4.072LeuAla: 4.072 ± 3.158
1.782LeuCys: 1.782 ± 0.623
4.836LeuAsp: 4.836 ± 1.981
7.89LeuGlu: 7.89 ± 1.188
3.309LeuPhe: 3.309 ± 1.105
2.8LeuGly: 2.8 ± 0.825
2.545LeuHis: 2.545 ± 0.45
6.617LeuIle: 6.617 ± 0.736
7.381LeuLys: 7.381 ± 3.409
7.89LeuLeu: 7.89 ± 1.941
2.036LeuMet: 2.036 ± 0.259
6.363LeuAsn: 6.363 ± 0.999
4.327LeuPro: 4.327 ± 1.274
2.036LeuGln: 2.036 ± 0.91
3.563LeuArg: 3.563 ± 0.165
8.145LeuSer: 8.145 ± 0.813
4.836LeuThr: 4.836 ± 2.032
5.345LeuVal: 5.345 ± 1.802
0.255LeuTrp: 0.255 ± 0.153
4.072LeuTyr: 4.072 ± 1.033
0.0LeuXaa: 0.0 ± 0.0
Met
1.527MetAla: 1.527 ± 0.396
1.018MetCys: 1.018 ± 0.565
2.545MetAsp: 2.545 ± 0.707
2.036MetGlu: 2.036 ± 0.813
1.527MetPhe: 1.527 ± 0.741
1.018MetGly: 1.018 ± 0.293
0.255MetHis: 0.255 ± 0.153
3.054MetIle: 3.054 ± 0.879
2.545MetLys: 2.545 ± 0.449
3.309MetLeu: 3.309 ± 0.33
1.018MetMet: 1.018 ± 0.322
1.527MetAsn: 1.527 ± 0.693
1.782MetPro: 1.782 ± 0.612
0.255MetGln: 0.255 ± 0.153
0.764MetArg: 0.764 ± 0.892
3.563MetSer: 3.563 ± 1.103
1.018MetThr: 1.018 ± 0.887
1.018MetVal: 1.018 ± 0.611
0.0MetTrp: 0.0 ± 0.0
0.764MetTyr: 0.764 ± 0.347
0.0MetXaa: 0.0 ± 0.0
Asn
3.818AsnAla: 3.818 ± 1.139
1.273AsnCys: 1.273 ± 0.787
4.327AsnAsp: 4.327 ± 0.445
3.818AsnGlu: 3.818 ± 0.214
2.291AsnPhe: 2.291 ± 0.477
1.018AsnGly: 1.018 ± 0.897
2.036AsnHis: 2.036 ± 0.813
5.09AsnIle: 5.09 ± 0.9
4.836AsnLys: 4.836 ± 1.2
4.836AsnLeu: 4.836 ± 0.373
1.782AsnMet: 1.782 ± 0.612
4.072AsnAsn: 4.072 ± 1.19
3.054AsnPro: 3.054 ± 0.306
2.036AsnGln: 2.036 ± 0.503
3.054AsnArg: 3.054 ± 0.758
2.8AsnSer: 2.8 ± 0.729
3.563AsnThr: 3.563 ± 0.754
3.054AsnVal: 3.054 ± 0.383
0.255AsnTrp: 0.255 ± 0.153
2.8AsnTyr: 2.8 ± 0.729
0.0AsnXaa: 0.0 ± 0.0
Pro
2.036ProAla: 2.036 ± 0.595
0.509ProCys: 0.509 ± 0.306
1.018ProAsp: 1.018 ± 0.322
2.545ProGlu: 2.545 ± 0.927
1.782ProPhe: 1.782 ± 0.447
2.036ProGly: 2.036 ± 0.548
0.764ProHis: 0.764 ± 0.347
3.818ProIle: 3.818 ± 0.904
2.291ProLys: 2.291 ± 0.587
1.527ProLeu: 1.527 ± 0.664
0.509ProMet: 0.509 ± 0.146
2.036ProAsn: 2.036 ± 0.548
0.764ProPro: 0.764 ± 0.198
1.018ProGln: 1.018 ± 0.611
1.018ProArg: 1.018 ± 0.293
3.054ProSer: 3.054 ± 0.54
2.545ProThr: 2.545 ± 0.449
2.545ProVal: 2.545 ± 0.626
0.764ProTrp: 0.764 ± 0.892
1.018ProTyr: 1.018 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
2.291GlnAla: 2.291 ± 0.594
0.509GlnCys: 0.509 ± 0.306
1.527GlnAsp: 1.527 ± 0.396
2.036GlnGlu: 2.036 ± 0.503
1.527GlnPhe: 1.527 ± 0.396
2.036GlnGly: 2.036 ± 0.705
0.509GlnHis: 0.509 ± 0.146
2.545GlnIle: 2.545 ± 0.979
3.818GlnLys: 3.818 ± 1.569
2.545GlnLeu: 2.545 ± 0.626
0.764GlnMet: 0.764 ± 0.892
2.036GlnAsn: 2.036 ± 1.69
0.764GlnPro: 0.764 ± 0.347
1.273GlnGln: 1.273 ± 2.814
1.782GlnArg: 1.782 ± 1.07
4.072GlnSer: 4.072 ± 1.171
2.036GlnThr: 2.036 ± 0.644
2.036GlnVal: 2.036 ± 0.503
0.509GlnTrp: 0.509 ± 0.942
1.018GlnTyr: 1.018 ± 0.79
0.0GlnXaa: 0.0 ± 0.0
Arg
1.273ArgAla: 1.273 ± 1.765
2.036ArgCys: 2.036 ± 0.827
3.818ArgAsp: 3.818 ± 2.292
2.545ArgGlu: 2.545 ± 0.927
2.036ArgPhe: 2.036 ± 0.595
0.255ArgGly: 0.255 ± 0.224
0.764ArgHis: 0.764 ± 0.198
2.8ArgIle: 2.8 ± 0.758
3.309ArgLys: 3.309 ± 1.055
4.072ArgLeu: 4.072 ± 1.37
2.036ArgMet: 2.036 ± 0.573
3.309ArgAsn: 3.309 ± 3.449
0.0ArgPro: 0.0 ± 0.0
2.291ArgGln: 2.291 ± 0.477
1.527ArgArg: 1.527 ± 0.611
2.545ArgSer: 2.545 ± 0.979
1.782ArgThr: 1.782 ± 0.763
2.8ArgVal: 2.8 ± 1.642
0.255ArgTrp: 0.255 ± 0.224
2.036ArgTyr: 2.036 ± 0.644
0.0ArgXaa: 0.0 ± 0.0
Ser
3.818SerAla: 3.818 ± 1.139
1.782SerCys: 1.782 ± 0.911
5.854SerAsp: 5.854 ± 0.938
2.545SerGlu: 2.545 ± 0.674
3.309SerPhe: 3.309 ± 0.519
3.054SerGly: 3.054 ± 0.672
1.273SerHis: 1.273 ± 0.751
5.854SerIle: 5.854 ± 1.599
7.126SerLys: 7.126 ± 0.729
6.108SerLeu: 6.108 ± 0.463
2.545SerMet: 2.545 ± 1.213
2.291SerAsn: 2.291 ± 1.352
2.036SerPro: 2.036 ± 0.91
1.527SerGln: 1.527 ± 0.741
3.054SerArg: 3.054 ± 1.221
3.309SerSer: 3.309 ± 0.903
4.581SerThr: 4.581 ± 1.189
5.854SerVal: 5.854 ± 1.599
0.764SerTrp: 0.764 ± 0.458
1.782SerTyr: 1.782 ± 0.911
0.0SerXaa: 0.0 ± 0.0
Thr
4.581ThrAla: 4.581 ± 1.045
1.527ThrCys: 1.527 ± 0.693
2.545ThrAsp: 2.545 ± 0.449
2.545ThrGlu: 2.545 ± 0.979
2.8ThrPhe: 2.8 ± 1.484
3.563ThrGly: 3.563 ± 1.52
0.255ThrHis: 0.255 ± 0.224
4.072ThrIle: 4.072 ± 1.171
3.818ThrLys: 3.818 ± 1.154
4.327ThrLeu: 4.327 ± 4.133
1.018ThrMet: 1.018 ± 0.322
3.054ThrAsn: 3.054 ± 0.879
1.782ThrPro: 1.782 ± 0.76
2.291ThrGln: 2.291 ± 0.766
2.545ThrArg: 2.545 ± 1.309
3.818ThrSer: 3.818 ± 1.154
2.545ThrThr: 2.545 ± 0.965
3.818ThrVal: 3.818 ± 1.139
0.764ThrTrp: 0.764 ± 0.856
4.072ThrTyr: 4.072 ± 1.033
0.0ThrXaa: 0.0 ± 0.0
Val
1.782ValAla: 1.782 ± 0.612
1.782ValCys: 1.782 ± 0.911
1.782ValAsp: 1.782 ± 0.911
3.054ValGlu: 3.054 ± 0.306
4.581ValPhe: 4.581 ± 1.045
2.291ValGly: 2.291 ± 0.766
1.018ValHis: 1.018 ± 0.293
2.8ValIle: 2.8 ± 0.833
4.581ValLys: 4.581 ± 1.173
3.563ValLeu: 3.563 ± 1.025
1.018ValMet: 1.018 ± 0.322
3.054ValAsn: 3.054 ± 1.548
2.036ValPro: 2.036 ± 0.548
2.291ValGln: 2.291 ± 1.585
2.036ValArg: 2.036 ± 0.548
5.854ValSer: 5.854 ± 0.697
3.054ValThr: 3.054 ± 1.387
1.273ValVal: 1.273 ± 0.313
0.764ValTrp: 0.764 ± 0.673
3.054ValTyr: 3.054 ± 1.387
0.0ValXaa: 0.0 ± 0.0
Trp
0.255TrpAla: 0.255 ± 0.153
0.0TrpCys: 0.0 ± 0.0
0.255TrpAsp: 0.255 ± 0.153
0.255TrpGlu: 0.255 ± 0.153
0.509TrpPhe: 0.509 ± 0.146
1.527TrpGly: 1.527 ± 0.439
0.255TrpHis: 0.255 ± 0.224
0.255TrpIle: 0.255 ± 0.153
0.0TrpLys: 0.0 ± 0.0
1.018TrpLeu: 1.018 ± 0.293
0.255TrpMet: 0.255 ± 0.976
0.764TrpAsn: 0.764 ± 0.458
0.0TrpPro: 0.0 ± 0.0
0.764TrpGln: 0.764 ± 0.458
0.0TrpArg: 0.0 ± 0.0
1.273TrpSer: 1.273 ± 0.908
0.255TrpThr: 0.255 ± 0.976
0.509TrpVal: 0.509 ± 0.306
0.0TrpTrp: 0.0 ± 0.0
0.509TrpTyr: 0.509 ± 0.306
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.036TyrAla: 2.036 ± 1.457
1.527TyrCys: 1.527 ± 0.693
1.018TyrAsp: 1.018 ± 0.79
3.563TyrGlu: 3.563 ± 1.025
2.291TyrPhe: 2.291 ± 0.594
1.782TyrGly: 1.782 ± 0.845
1.527TyrHis: 1.527 ± 0.693
5.599TyrIle: 5.599 ± 1.495
4.072TyrLys: 4.072 ± 1.949
3.309TyrLeu: 3.309 ± 1.292
1.527TyrMet: 1.527 ± 0.396
3.309TyrAsn: 3.309 ± 0.814
1.018TyrPro: 1.018 ± 0.293
0.764TyrGln: 0.764 ± 0.198
1.018TyrArg: 1.018 ± 0.79
2.545TyrSer: 2.545 ± 0.707
2.8TyrThr: 2.8 ± 0.758
0.255TyrVal: 0.255 ± 0.224
0.0TyrTrp: 0.0 ± 0.0
1.527TyrTyr: 1.527 ± 0.693
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3930 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski