Amino acid dipepetide frequency for Mona Grita virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.562AlaAla: 4.562 ± 1.291
1.774AlaCys: 1.774 ± 0.632
1.267AlaAsp: 1.267 ± 0.777
2.788AlaGlu: 2.788 ± 1.483
2.281AlaPhe: 2.281 ± 2.077
3.294AlaGly: 3.294 ± 1.188
1.267AlaHis: 1.267 ± 0.857
4.308AlaIle: 4.308 ± 0.502
1.267AlaLys: 1.267 ± 0.361
4.562AlaLeu: 4.562 ± 1.246
1.774AlaMet: 1.774 ± 0.869
0.507AlaAsn: 0.507 ± 0.342
1.267AlaPro: 1.267 ± 0.364
1.014AlaGln: 1.014 ± 0.813
1.774AlaArg: 1.774 ± 0.879
4.055AlaSer: 4.055 ± 1.161
2.534AlaThr: 2.534 ± 0.528
4.055AlaVal: 4.055 ± 1.282
0.507AlaTrp: 0.507 ± 0.608
1.521AlaTyr: 1.521 ± 0.916
0.0AlaXaa: 0.0 ± 0.0
Cys
0.507CysAla: 0.507 ± 0.155
0.507CysCys: 0.507 ± 0.342
0.76CysAsp: 0.76 ± 0.513
1.774CysGlu: 1.774 ± 0.506
1.774CysPhe: 1.774 ± 0.506
1.014CysGly: 1.014 ± 0.558
0.76CysHis: 0.76 ± 0.668
1.267CysIle: 1.267 ± 0.484
2.534CysLys: 2.534 ± 0.968
2.788CysLeu: 2.788 ± 1.168
0.253CysMet: 0.253 ± 0.171
1.267CysAsn: 1.267 ± 0.777
0.76CysPro: 0.76 ± 0.343
1.521CysGln: 1.521 ± 0.841
1.267CysArg: 1.267 ± 0.484
3.548CysSer: 3.548 ± 1.799
1.267CysThr: 1.267 ± 0.484
1.267CysVal: 1.267 ± 0.65
0.0CysTrp: 0.0 ± 0.0
1.014CysTyr: 1.014 ± 0.31
0.0CysXaa: 0.0 ± 0.0
Asp
2.534AspAla: 2.534 ± 0.651
2.027AspCys: 2.027 ± 0.596
3.548AspAsp: 3.548 ± 1.329
2.788AspGlu: 2.788 ± 0.328
2.027AspPhe: 2.027 ± 0.899
3.294AspGly: 3.294 ± 0.544
1.267AspHis: 1.267 ± 0.41
4.308AspIle: 4.308 ± 2.166
3.548AspLys: 3.548 ± 0.753
4.815AspLeu: 4.815 ± 1.47
1.521AspMet: 1.521 ± 0.287
2.788AspAsn: 2.788 ± 0.426
2.534AspPro: 2.534 ± 0.317
1.774AspGln: 1.774 ± 1.249
2.027AspArg: 2.027 ± 0.596
3.801AspSer: 3.801 ± 0.717
2.027AspThr: 2.027 ± 0.861
2.788AspVal: 2.788 ± 0.934
1.014AspTrp: 1.014 ± 0.385
1.267AspTyr: 1.267 ± 1.083
0.0AspXaa: 0.0 ± 0.0
Glu
4.308GluAla: 4.308 ± 1.309
1.014GluCys: 1.014 ± 0.558
4.815GluAsp: 4.815 ± 1.212
5.829GluGlu: 5.829 ± 1.822
4.055GluPhe: 4.055 ± 0.924
4.562GluGly: 4.562 ± 0.931
0.507GluHis: 0.507 ± 0.342
6.082GluIle: 6.082 ± 0.741
2.534GluLys: 2.534 ± 0.651
5.068GluLeu: 5.068 ± 0.933
2.534GluMet: 2.534 ± 1.092
2.281GluAsn: 2.281 ± 0.698
2.788GluPro: 2.788 ± 0.555
1.014GluGln: 1.014 ± 0.558
2.788GluArg: 2.788 ± 0.968
5.322GluSer: 5.322 ± 1.059
2.534GluThr: 2.534 ± 1.157
5.829GluVal: 5.829 ± 0.87
0.253GluTrp: 0.253 ± 0.171
1.774GluTyr: 1.774 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
2.788PheAla: 2.788 ± 2.016
1.521PheCys: 1.521 ± 0.998
2.027PheAsp: 2.027 ± 1.539
1.774PheGlu: 1.774 ± 0.962
1.774PhePhe: 1.774 ± 0.879
2.534PheGly: 2.534 ± 1.351
0.507PheHis: 0.507 ± 0.155
3.041PheIle: 3.041 ± 1.185
2.788PheLys: 2.788 ± 1.257
4.815PheLeu: 4.815 ± 0.623
1.521PheMet: 1.521 ± 0.309
2.534PheAsn: 2.534 ± 0.568
2.027PhePro: 2.027 ± 0.971
2.027PheGln: 2.027 ± 0.596
3.548PheArg: 3.548 ± 1.43
5.068PheSer: 5.068 ± 0.646
2.534PheThr: 2.534 ± 1.677
4.308PheVal: 4.308 ± 0.683
1.014PheTrp: 1.014 ± 0.431
0.507PheTyr: 0.507 ± 0.608
0.0PheXaa: 0.0 ± 0.0
Gly
3.801GlyAla: 3.801 ± 0.823
1.267GlyCys: 1.267 ± 0.484
2.788GlyAsp: 2.788 ± 0.465
3.041GlyGlu: 3.041 ± 0.831
3.548GlyPhe: 3.548 ± 0.672
5.575GlyGly: 5.575 ± 1.197
1.774GlyHis: 1.774 ± 0.33
4.562GlyIle: 4.562 ± 0.626
5.068GlyLys: 5.068 ± 0.824
4.815GlyLeu: 4.815 ± 1.256
2.027GlyMet: 2.027 ± 0.702
2.534GlyAsn: 2.534 ± 1.126
2.534GlyPro: 2.534 ± 0.388
2.027GlyGln: 2.027 ± 0.403
2.534GlyArg: 2.534 ± 0.428
6.842GlySer: 6.842 ± 2.165
2.027GlyThr: 2.027 ± 0.512
5.068GlyVal: 5.068 ± 0.873
1.521GlyTrp: 1.521 ± 1.316
1.521GlyTyr: 1.521 ± 1.697
0.0GlyXaa: 0.0 ± 0.0
His
0.507HisAla: 0.507 ± 0.461
0.76HisCys: 0.76 ± 0.239
0.76HisAsp: 0.76 ± 0.239
0.76HisGlu: 0.76 ± 0.239
1.014HisPhe: 1.014 ± 0.431
2.027HisGly: 2.027 ± 0.77
0.253HisHis: 0.253 ± 0.223
1.267HisIle: 1.267 ± 0.546
0.76HisLys: 0.76 ± 0.679
1.267HisLeu: 1.267 ± 0.484
0.0HisMet: 0.0 ± 0.0
1.014HisAsn: 1.014 ± 1.113
2.027HisPro: 2.027 ± 0.806
1.014HisGln: 1.014 ± 0.31
1.267HisArg: 1.267 ± 0.426
2.788HisSer: 2.788 ± 1.168
0.76HisThr: 0.76 ± 0.343
1.774HisVal: 1.774 ± 0.632
0.0HisTrp: 0.0 ± 0.0
1.267HisTyr: 1.267 ± 0.546
0.0HisXaa: 0.0 ± 0.0
Ile
2.788IleAla: 2.788 ± 0.886
1.521IleCys: 1.521 ± 0.33
2.788IleAsp: 2.788 ± 0.384
4.815IleGlu: 4.815 ± 1.056
3.294IlePhe: 3.294 ± 0.825
3.801IleGly: 3.801 ± 1.24
1.774IleHis: 1.774 ± 0.632
3.294IleIle: 3.294 ± 1.163
3.801IleLys: 3.801 ± 1.386
7.096IleLeu: 7.096 ± 1.661
1.774IleMet: 1.774 ± 0.369
3.548IleAsn: 3.548 ± 1.526
4.308IlePro: 4.308 ± 1.931
2.281IleGln: 2.281 ± 0.503
3.041IleArg: 3.041 ± 0.868
7.349IleSer: 7.349 ± 1.264
4.055IleThr: 4.055 ± 0.73
3.801IleVal: 3.801 ± 1.449
0.253IleTrp: 0.253 ± 0.171
1.774IleTyr: 1.774 ± 0.879
0.0IleXaa: 0.0 ± 0.0
Lys
3.041LysAla: 3.041 ± 0.617
1.774LysCys: 1.774 ± 0.632
4.562LysAsp: 4.562 ± 1.395
3.548LysGlu: 3.548 ± 0.498
2.281LysPhe: 2.281 ± 1.218
2.281LysGly: 2.281 ± 0.39
1.014LysHis: 1.014 ± 0.385
4.308LysIle: 4.308 ± 0.709
4.308LysLys: 4.308 ± 0.648
6.082LysLeu: 6.082 ± 0.831
2.788LysMet: 2.788 ± 1.72
2.534LysAsn: 2.534 ± 0.412
4.055LysPro: 4.055 ± 0.806
2.027LysGln: 2.027 ± 0.596
2.534LysArg: 2.534 ± 0.428
5.575LysSer: 5.575 ± 0.697
3.041LysThr: 3.041 ± 0.93
4.055LysVal: 4.055 ± 1.803
1.521LysTrp: 1.521 ± 0.477
1.521LysTyr: 1.521 ± 0.712
0.0LysXaa: 0.0 ± 0.0
Leu
5.068LeuAla: 5.068 ± 0.776
2.027LeuCys: 2.027 ± 0.77
3.548LeuAsp: 3.548 ± 0.988
6.082LeuGlu: 6.082 ± 0.987
5.068LeuPhe: 5.068 ± 0.933
6.082LeuGly: 6.082 ± 1.049
2.534LeuHis: 2.534 ± 0.681
4.815LeuIle: 4.815 ± 1.409
8.109LeuLys: 8.109 ± 1.295
8.363LeuLeu: 8.363 ± 0.985
2.534LeuMet: 2.534 ± 0.448
3.801LeuAsn: 3.801 ± 1.486
2.281LeuPro: 2.281 ± 1.167
3.548LeuGln: 3.548 ± 1.846
4.815LeuArg: 4.815 ± 1.205
10.644LeuSer: 10.644 ± 1.285
4.562LeuThr: 4.562 ± 0.654
4.562LeuVal: 4.562 ± 0.506
0.253LeuTrp: 0.253 ± 0.223
2.534LeuTyr: 2.534 ± 1.058
0.0LeuXaa: 0.0 ± 0.0
Met
1.014MetAla: 1.014 ± 0.837
0.253MetCys: 0.253 ± 0.223
3.041MetAsp: 3.041 ± 1.218
1.521MetGlu: 1.521 ± 0.485
0.76MetPhe: 0.76 ± 0.513
1.774MetGly: 1.774 ± 0.869
0.76MetHis: 0.76 ± 0.458
2.788MetIle: 2.788 ± 0.678
0.76MetLys: 0.76 ± 0.239
3.548MetLeu: 3.548 ± 0.879
2.534MetMet: 2.534 ± 0.653
2.281MetAsn: 2.281 ± 1.329
0.253MetPro: 0.253 ± 0.223
0.76MetGln: 0.76 ± 0.239
2.534MetArg: 2.534 ± 0.681
2.534MetSer: 2.534 ± 0.681
2.027MetThr: 2.027 ± 0.721
1.521MetVal: 1.521 ± 0.465
0.0MetTrp: 0.0 ± 0.0
0.507MetTyr: 0.507 ± 0.342
0.0MetXaa: 0.0 ± 0.0
Asn
2.027AsnAla: 2.027 ± 1.215
0.76AsnCys: 0.76 ± 0.668
1.774AsnAsp: 1.774 ± 0.554
2.788AsnGlu: 2.788 ± 0.664
2.281AsnPhe: 2.281 ± 0.866
2.788AsnGly: 2.788 ± 0.384
0.507AsnHis: 0.507 ± 0.155
3.041AsnIle: 3.041 ± 0.416
4.308AsnLys: 4.308 ± 0.775
3.041AsnLeu: 3.041 ± 0.711
1.014AsnMet: 1.014 ± 0.994
2.027AsnAsn: 2.027 ± 1.245
2.534AsnPro: 2.534 ± 0.388
1.014AsnGln: 1.014 ± 0.385
2.027AsnArg: 2.027 ± 0.843
3.801AsnSer: 3.801 ± 0.876
2.027AsnThr: 2.027 ± 0.512
2.027AsnVal: 2.027 ± 0.512
0.507AsnTrp: 0.507 ± 0.445
1.521AsnTyr: 1.521 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
0.507ProAla: 0.507 ± 0.342
0.76ProCys: 0.76 ± 0.343
1.521ProAsp: 1.521 ± 0.33
5.829ProGlu: 5.829 ± 1.098
2.534ProPhe: 2.534 ± 0.82
2.788ProGly: 2.788 ± 1.249
0.0ProHis: 0.0 ± 0.0
2.788ProIle: 2.788 ± 0.855
1.521ProLys: 1.521 ± 0.518
3.548ProLeu: 3.548 ± 1.259
2.027ProMet: 2.027 ± 0.567
2.788ProAsn: 2.788 ± 0.348
0.253ProPro: 0.253 ± 0.171
1.267ProGln: 1.267 ± 0.751
1.521ProArg: 1.521 ± 0.309
4.562ProSer: 4.562 ± 0.456
2.534ProThr: 2.534 ± 1.734
1.521ProVal: 1.521 ± 0.824
1.014ProTrp: 1.014 ± 0.684
1.267ProTyr: 1.267 ± 0.361
0.0ProXaa: 0.0 ± 0.0
Gln
1.521GlnAla: 1.521 ± 0.867
1.267GlnCys: 1.267 ± 0.777
1.521GlnAsp: 1.521 ± 0.465
1.774GlnGlu: 1.774 ± 0.658
1.014GlnPhe: 1.014 ± 0.361
2.027GlnGly: 2.027 ± 0.596
1.267GlnHis: 1.267 ± 0.546
2.788GlnIle: 2.788 ± 0.831
3.041GlnLys: 3.041 ± 0.868
2.281GlnLeu: 2.281 ± 1.592
0.507GlnMet: 0.507 ± 0.342
1.267GlnAsn: 1.267 ± 0.361
2.027GlnPro: 2.027 ± 1.085
0.76GlnGln: 0.76 ± 0.239
1.014GlnArg: 1.014 ± 0.361
3.294GlnSer: 3.294 ± 1.087
2.281GlnThr: 2.281 ± 0.488
2.534GlnVal: 2.534 ± 0.388
0.0GlnTrp: 0.0 ± 0.0
1.267GlnTyr: 1.267 ± 0.484
0.0GlnXaa: 0.0 ± 0.0
Arg
2.534ArgAla: 2.534 ± 0.412
0.507ArgCys: 0.507 ± 0.445
3.294ArgAsp: 3.294 ± 1.03
2.281ArgGlu: 2.281 ± 0.716
2.027ArgPhe: 2.027 ± 1.638
3.548ArgGly: 3.548 ± 1.738
0.76ArgHis: 0.76 ± 0.239
3.548ArgIle: 3.548 ± 1.017
3.041ArgLys: 3.041 ± 0.252
4.055ArgLeu: 4.055 ± 0.559
1.521ArgMet: 1.521 ± 0.983
1.521ArgAsn: 1.521 ± 0.465
2.281ArgPro: 2.281 ± 0.698
1.521ArgGln: 1.521 ± 0.477
2.534ArgArg: 2.534 ± 0.538
4.562ArgSer: 4.562 ± 1.859
2.534ArgThr: 2.534 ± 1.092
3.801ArgVal: 3.801 ± 1.127
1.014ArgTrp: 1.014 ± 0.594
1.267ArgTyr: 1.267 ± 0.484
0.0ArgXaa: 0.0 ± 0.0
Ser
4.562SerAla: 4.562 ± 1.246
3.294SerCys: 3.294 ± 2.552
4.815SerAsp: 4.815 ± 0.565
8.363SerGlu: 8.363 ± 2.415
5.068SerPhe: 5.068 ± 2.117
7.096SerGly: 7.096 ± 1.932
2.281SerHis: 2.281 ± 0.782
6.589SerIle: 6.589 ± 0.545
6.082SerLys: 6.082 ± 1.805
11.151SerLeu: 11.151 ± 2.608
2.281SerMet: 2.281 ± 0.341
2.534SerAsn: 2.534 ± 0.538
4.562SerPro: 4.562 ± 1.137
4.308SerGln: 4.308 ± 1.394
5.068SerArg: 5.068 ± 1.056
12.418SerSer: 12.418 ± 1.326
4.562SerThr: 4.562 ± 1.792
4.308SerVal: 4.308 ± 0.367
1.774SerTrp: 1.774 ± 0.33
2.027SerTyr: 2.027 ± 0.77
0.0SerXaa: 0.0 ± 0.0
Thr
1.014ThrAla: 1.014 ± 0.512
1.521ThrCys: 1.521 ± 0.436
3.548ThrAsp: 3.548 ± 0.859
3.801ThrGlu: 3.801 ± 1.093
2.534ThrPhe: 2.534 ± 0.82
4.815ThrGly: 4.815 ± 1.56
1.014ThrHis: 1.014 ± 0.534
2.534ThrIle: 2.534 ± 0.428
2.534ThrLys: 2.534 ± 0.677
5.575ThrLeu: 5.575 ± 0.59
1.267ThrMet: 1.267 ± 0.426
1.774ThrAsn: 1.774 ± 0.962
1.774ThrPro: 1.774 ± 0.506
2.281ThrGln: 2.281 ± 0.341
2.281ThrArg: 2.281 ± 0.702
4.562ThrSer: 4.562 ± 0.832
5.068ThrThr: 5.068 ± 0.939
1.774ThrVal: 1.774 ± 0.879
0.76ThrTrp: 0.76 ± 0.964
1.521ThrTyr: 1.521 ± 1.4
0.0ThrXaa: 0.0 ± 0.0
Val
2.281ValAla: 2.281 ± 0.503
2.281ValCys: 2.281 ± 0.782
3.041ValAsp: 3.041 ± 1.384
4.308ValGlu: 4.308 ± 0.603
2.534ValPhe: 2.534 ± 0.568
3.294ValGly: 3.294 ± 1.264
2.534ValHis: 2.534 ± 0.568
3.041ValIle: 3.041 ± 0.493
4.308ValLys: 4.308 ± 1.084
3.801ValLeu: 3.801 ± 0.703
2.534ValMet: 2.534 ± 0.681
2.788ValAsn: 2.788 ± 1.186
0.76ValPro: 0.76 ± 0.239
1.774ValGln: 1.774 ± 0.658
4.308ValArg: 4.308 ± 0.645
8.616ValSer: 8.616 ± 1.636
2.281ValThr: 2.281 ± 0.465
5.068ValVal: 5.068 ± 1.766
0.76ValTrp: 0.76 ± 0.513
2.788ValTyr: 2.788 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
0.253TrpAla: 0.253 ± 0.171
0.0TrpCys: 0.0 ± 0.0
0.76TrpAsp: 0.76 ± 0.513
0.253TrpGlu: 0.253 ± 0.171
1.014TrpPhe: 1.014 ± 0.534
0.76TrpGly: 0.76 ± 0.498
0.0TrpHis: 0.0 ± 0.0
1.014TrpIle: 1.014 ± 0.594
0.76TrpLys: 0.76 ± 0.498
1.014TrpLeu: 1.014 ± 0.558
0.507TrpMet: 0.507 ± 0.155
0.253TrpAsn: 0.253 ± 0.171
0.507TrpPro: 0.507 ± 0.557
0.507TrpGln: 0.507 ± 0.461
0.507TrpArg: 0.507 ± 0.155
1.014TrpSer: 1.014 ± 0.431
1.267TrpThr: 1.267 ± 0.386
1.774TrpVal: 1.774 ± 0.315
0.253TrpTrp: 0.253 ± 0.171
0.253TrpTyr: 0.253 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.76TyrAla: 0.76 ± 0.458
1.014TyrCys: 1.014 ± 0.558
1.774TyrAsp: 1.774 ± 0.962
1.774TyrGlu: 1.774 ± 0.399
1.774TyrPhe: 1.774 ± 0.59
1.521TyrGly: 1.521 ± 0.465
0.507TyrHis: 0.507 ± 0.342
1.774TyrIle: 1.774 ± 0.506
1.774TyrLys: 1.774 ± 0.33
3.548TyrLeu: 3.548 ± 1.738
0.0TyrMet: 0.0 ± 0.0
1.774TyrAsn: 1.774 ± 0.315
1.014TyrPro: 1.014 ± 0.847
1.014TyrGln: 1.014 ± 0.385
0.76TyrArg: 0.76 ± 0.498
2.788TyrSer: 2.788 ± 1.453
2.027TyrThr: 2.027 ± 1.048
1.267TyrVal: 1.267 ± 0.546
0.253TyrTrp: 0.253 ± 0.223
1.014TyrTyr: 1.014 ± 0.385
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3947 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski