Amino acid dipepetide frequency for Haloarcula hispanica icosahedral virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.916AlaAla: 13.916 ± 1.569
1.07AlaCys: 1.07 ± 0.439
12.524AlaAsp: 12.524 ± 0.988
9.313AlaGlu: 9.313 ± 1.118
2.141AlaPhe: 2.141 ± 0.453
10.062AlaGly: 10.062 ± 1.901
1.285AlaHis: 1.285 ± 0.302
2.355AlaIle: 2.355 ± 0.393
1.177AlaLys: 1.177 ± 0.51
7.814AlaLeu: 7.814 ± 1.027
1.606AlaMet: 1.606 ± 0.499
2.89AlaAsn: 2.89 ± 0.471
3.854AlaPro: 3.854 ± 0.599
2.141AlaGln: 2.141 ± 0.376
6.744AlaArg: 6.744 ± 0.666
5.994AlaSer: 5.994 ± 0.916
7.6AlaThr: 7.6 ± 1.524
11.561AlaVal: 11.561 ± 1.166
1.927AlaTrp: 1.927 ± 0.604
2.034AlaTyr: 2.034 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.428CysAla: 0.428 ± 0.236
0.214CysCys: 0.214 ± 0.186
0.749CysAsp: 0.749 ± 0.279
0.749CysGlu: 0.749 ± 0.353
0.0CysPhe: 0.0 ± 0.0
1.285CysGly: 1.285 ± 0.498
0.428CysHis: 0.428 ± 0.39
0.107CysIle: 0.107 ± 0.112
0.214CysLys: 0.214 ± 0.166
0.428CysLeu: 0.428 ± 0.281
0.0CysMet: 0.0 ± 0.0
0.107CysAsn: 0.107 ± 0.11
1.07CysPro: 1.07 ± 0.548
0.642CysGln: 0.642 ± 0.307
0.642CysArg: 0.642 ± 0.252
0.642CysSer: 0.642 ± 0.33
0.214CysThr: 0.214 ± 0.179
0.107CysVal: 0.107 ± 0.127
0.214CysTrp: 0.214 ± 0.152
0.107CysTyr: 0.107 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
11.775AspAla: 11.775 ± 1.116
0.856AspCys: 0.856 ± 0.404
14.451AspAsp: 14.451 ± 2.565
9.634AspGlu: 9.634 ± 1.127
1.07AspPhe: 1.07 ± 0.374
12.203AspGly: 12.203 ± 1.176
2.569AspHis: 2.569 ± 0.698
2.141AspIle: 2.141 ± 0.524
1.82AspLys: 1.82 ± 0.519
7.707AspLeu: 7.707 ± 1.057
2.034AspMet: 2.034 ± 0.467
2.248AspAsn: 2.248 ± 0.355
7.065AspPro: 7.065 ± 1.131
2.248AspGln: 2.248 ± 0.475
5.245AspArg: 5.245 ± 0.695
5.78AspSer: 5.78 ± 0.721
6.423AspThr: 6.423 ± 0.909
6.744AspVal: 6.744 ± 0.845
1.07AspTrp: 1.07 ± 0.396
3.211AspTyr: 3.211 ± 0.58
0.0AspXaa: 0.0 ± 0.0
Glu
11.561GluAla: 11.561 ± 1.116
0.642GluCys: 0.642 ± 0.298
7.386GluAsp: 7.386 ± 1.251
7.279GluGlu: 7.279 ± 1.157
1.499GluPhe: 1.499 ± 0.34
6.744GluGly: 6.744 ± 1.034
2.997GluHis: 2.997 ± 0.861
2.034GluIle: 2.034 ± 0.651
1.499GluLys: 1.499 ± 0.332
3.211GluLeu: 3.211 ± 0.627
1.499GluMet: 1.499 ± 0.478
2.355GluAsn: 2.355 ± 0.593
4.068GluPro: 4.068 ± 0.68
5.031GluGln: 5.031 ± 0.917
5.673GluArg: 5.673 ± 0.864
5.673GluSer: 5.673 ± 0.732
3.854GluThr: 3.854 ± 0.649
8.778GluVal: 8.778 ± 1.133
1.07GluTrp: 1.07 ± 0.416
3.639GluTyr: 3.639 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
1.82PheAla: 1.82 ± 0.462
0.0PheCys: 0.0 ± 0.0
2.783PheAsp: 2.783 ± 0.584
1.927PheGlu: 1.927 ± 0.438
0.0PhePhe: 0.0 ± 0.0
2.676PheGly: 2.676 ± 0.506
0.642PheHis: 0.642 ± 0.396
0.749PheIle: 0.749 ± 0.216
0.107PheLys: 0.107 ± 0.116
0.963PheLeu: 0.963 ± 0.309
0.535PheMet: 0.535 ± 0.295
0.642PheAsn: 0.642 ± 0.27
0.749PhePro: 0.749 ± 0.222
0.321PheGln: 0.321 ± 0.221
1.285PheArg: 1.285 ± 0.287
1.07PheSer: 1.07 ± 0.311
1.392PheThr: 1.392 ± 0.448
0.963PheVal: 0.963 ± 0.304
0.321PheTrp: 0.321 ± 0.168
0.749PheTyr: 0.749 ± 0.329
0.0PheXaa: 0.0 ± 0.0
Gly
9.313GlyAla: 9.313 ± 2.183
0.749GlyCys: 0.749 ± 0.332
10.169GlyAsp: 10.169 ± 0.91
6.209GlyGlu: 6.209 ± 0.766
1.285GlyPhe: 1.285 ± 0.306
9.741GlyGly: 9.741 ± 0.997
1.82GlyHis: 1.82 ± 0.418
3.425GlyIle: 3.425 ± 0.641
2.248GlyLys: 2.248 ± 0.45
6.101GlyLeu: 6.101 ± 0.94
1.392GlyMet: 1.392 ± 0.323
3.318GlyAsn: 3.318 ± 0.811
3.425GlyPro: 3.425 ± 0.646
4.817GlyGln: 4.817 ± 0.879
7.279GlyArg: 7.279 ± 0.992
5.673GlySer: 5.673 ± 0.96
6.637GlyThr: 6.637 ± 1.305
7.493GlyVal: 7.493 ± 0.865
0.749GlyTrp: 0.749 ± 0.232
2.355GlyTyr: 2.355 ± 0.572
0.0GlyXaa: 0.0 ± 0.0
His
1.606HisAla: 1.606 ± 0.548
0.107HisCys: 0.107 ± 0.123
2.355HisAsp: 2.355 ± 0.634
2.355HisGlu: 2.355 ± 0.508
0.321HisPhe: 0.321 ± 0.167
1.285HisGly: 1.285 ± 0.46
0.107HisHis: 0.107 ± 0.114
0.963HisIle: 0.963 ± 0.319
0.428HisLys: 0.428 ± 0.209
1.392HisLeu: 1.392 ± 0.41
0.321HisMet: 0.321 ± 0.188
0.428HisAsn: 0.428 ± 0.174
1.07HisPro: 1.07 ± 0.276
0.535HisGln: 0.535 ± 0.208
1.285HisArg: 1.285 ± 0.338
0.749HisSer: 0.749 ± 0.227
0.963HisThr: 0.963 ± 0.326
2.034HisVal: 2.034 ± 0.592
0.321HisTrp: 0.321 ± 0.165
0.856HisTyr: 0.856 ± 0.286
0.0HisXaa: 0.0 ± 0.0
Ile
2.997IleAla: 2.997 ± 0.569
0.107IleCys: 0.107 ± 0.116
3.532IleAsp: 3.532 ± 0.508
2.997IleGlu: 2.997 ± 0.548
0.321IlePhe: 0.321 ± 0.188
1.713IleGly: 1.713 ± 0.394
0.642IleHis: 0.642 ± 0.283
0.749IleIle: 0.749 ± 0.219
0.749IleLys: 0.749 ± 0.286
1.07IleLeu: 1.07 ± 0.4
0.214IleMet: 0.214 ± 0.162
2.034IleAsn: 2.034 ± 0.527
1.07IlePro: 1.07 ± 0.405
1.177IleGln: 1.177 ± 0.345
1.177IleArg: 1.177 ± 0.468
2.034IleSer: 2.034 ± 0.745
1.499IleThr: 1.499 ± 0.385
2.034IleVal: 2.034 ± 0.551
0.428IleTrp: 0.428 ± 0.277
0.749IleTyr: 0.749 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
2.248LysAla: 2.248 ± 0.361
0.214LysCys: 0.214 ± 0.16
1.285LysAsp: 1.285 ± 0.385
0.749LysGlu: 0.749 ± 0.307
0.321LysPhe: 0.321 ± 0.151
1.499LysGly: 1.499 ± 0.4
0.642LysHis: 0.642 ± 0.23
0.642LysIle: 0.642 ± 0.209
0.214LysLys: 0.214 ± 0.13
1.606LysLeu: 1.606 ± 0.426
0.642LysMet: 0.642 ± 0.244
0.642LysAsn: 0.642 ± 0.23
0.963LysPro: 0.963 ± 0.368
0.963LysGln: 0.963 ± 0.332
1.285LysArg: 1.285 ± 0.339
0.749LysSer: 0.749 ± 0.295
1.82LysThr: 1.82 ± 0.481
1.927LysVal: 1.927 ± 0.435
0.321LysTrp: 0.321 ± 0.168
0.749LysTyr: 0.749 ± 0.344
0.0LysXaa: 0.0 ± 0.0
Leu
8.778LeuAla: 8.778 ± 1.087
0.321LeuCys: 0.321 ± 0.184
5.245LeuAsp: 5.245 ± 0.704
3.854LeuGlu: 3.854 ± 0.683
1.606LeuPhe: 1.606 ± 0.476
8.135LeuGly: 8.135 ± 0.854
1.285LeuHis: 1.285 ± 0.369
1.285LeuIle: 1.285 ± 0.403
1.07LeuLys: 1.07 ± 0.318
5.352LeuLeu: 5.352 ± 0.97
1.177LeuMet: 1.177 ± 0.369
0.749LeuAsn: 0.749 ± 0.314
3.532LeuPro: 3.532 ± 0.761
1.927LeuGln: 1.927 ± 0.506
5.673LeuArg: 5.673 ± 0.812
5.459LeuSer: 5.459 ± 0.835
5.245LeuThr: 5.245 ± 0.902
4.389LeuVal: 4.389 ± 0.871
0.642LeuTrp: 0.642 ± 0.232
1.606LeuTyr: 1.606 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
2.462MetAla: 2.462 ± 0.604
0.107MetCys: 0.107 ± 0.116
0.963MetAsp: 0.963 ± 0.308
1.07MetGlu: 1.07 ± 0.297
0.749MetPhe: 0.749 ± 0.295
1.606MetGly: 1.606 ± 0.401
0.642MetHis: 0.642 ± 0.236
0.749MetIle: 0.749 ± 0.328
0.321MetLys: 0.321 ± 0.169
1.177MetLeu: 1.177 ± 0.386
0.321MetMet: 0.321 ± 0.192
0.535MetAsn: 0.535 ± 0.258
0.856MetPro: 0.856 ± 0.266
0.214MetGln: 0.214 ± 0.126
0.856MetArg: 0.856 ± 0.344
2.676MetSer: 2.676 ± 0.52
1.606MetThr: 1.606 ± 0.586
0.642MetVal: 0.642 ± 0.254
0.107MetTrp: 0.107 ± 0.115
0.321MetTyr: 0.321 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
1.392AsnAla: 1.392 ± 0.341
0.642AsnCys: 0.642 ± 0.24
2.355AsnAsp: 2.355 ± 0.537
2.355AsnGlu: 2.355 ± 0.346
0.749AsnPhe: 0.749 ± 0.392
4.282AsnGly: 4.282 ± 0.694
0.535AsnHis: 0.535 ± 0.201
1.927AsnIle: 1.927 ± 0.451
0.749AsnLys: 0.749 ± 0.24
1.07AsnLeu: 1.07 ± 0.296
0.535AsnMet: 0.535 ± 0.254
0.535AsnAsn: 0.535 ± 0.267
2.355AsnPro: 2.355 ± 0.453
0.963AsnGln: 0.963 ± 0.412
1.285AsnArg: 1.285 ± 0.396
1.713AsnSer: 1.713 ± 0.617
1.499AsnThr: 1.499 ± 0.433
2.034AsnVal: 2.034 ± 0.486
0.428AsnTrp: 0.428 ± 0.244
1.07AsnTyr: 1.07 ± 0.384
0.0AsnXaa: 0.0 ± 0.0
Pro
5.352ProAla: 5.352 ± 0.775
0.642ProCys: 0.642 ± 0.357
6.423ProAsp: 6.423 ± 0.908
5.887ProGlu: 5.887 ± 1.05
1.285ProPhe: 1.285 ± 0.358
4.282ProGly: 4.282 ± 0.779
0.749ProHis: 0.749 ± 0.244
0.963ProIle: 0.963 ± 0.336
0.642ProLys: 0.642 ± 0.257
3.211ProLeu: 3.211 ± 0.627
0.428ProMet: 0.428 ± 0.192
1.82ProAsn: 1.82 ± 0.443
1.82ProPro: 1.82 ± 0.572
1.285ProGln: 1.285 ± 0.425
1.82ProArg: 1.82 ± 0.516
2.462ProSer: 2.462 ± 0.608
3.639ProThr: 3.639 ± 0.644
4.603ProVal: 4.603 ± 0.895
0.428ProTrp: 0.428 ± 0.186
1.499ProTyr: 1.499 ± 0.45
0.0ProXaa: 0.0 ± 0.0
Gln
3.532GlnAla: 3.532 ± 0.551
0.214GlnCys: 0.214 ± 0.175
2.248GlnAsp: 2.248 ± 0.625
0.856GlnGlu: 0.856 ± 0.302
1.499GlnPhe: 1.499 ± 0.462
1.606GlnGly: 1.606 ± 0.427
1.07GlnHis: 1.07 ± 0.413
1.713GlnIle: 1.713 ± 0.554
0.642GlnLys: 0.642 ± 0.265
2.89GlnLeu: 2.89 ± 0.498
0.535GlnMet: 0.535 ± 0.237
1.07GlnAsn: 1.07 ± 0.292
1.392GlnPro: 1.392 ± 0.523
2.248GlnGln: 2.248 ± 0.548
2.89GlnArg: 2.89 ± 0.603
2.034GlnSer: 2.034 ± 0.529
2.569GlnThr: 2.569 ± 0.566
2.248GlnVal: 2.248 ± 0.498
1.07GlnTrp: 1.07 ± 0.374
1.82GlnTyr: 1.82 ± 0.393
0.0GlnXaa: 0.0 ± 0.0
Arg
5.245ArgAla: 5.245 ± 0.662
0.535ArgCys: 0.535 ± 0.287
4.068ArgAsp: 4.068 ± 0.724
6.209ArgGlu: 6.209 ± 0.969
1.82ArgPhe: 1.82 ± 0.366
3.747ArgGly: 3.747 ± 0.682
0.642ArgHis: 0.642 ± 0.228
1.713ArgIle: 1.713 ± 0.423
1.713ArgLys: 1.713 ± 0.489
4.924ArgLeu: 4.924 ± 0.76
1.499ArgMet: 1.499 ± 0.338
1.713ArgAsn: 1.713 ± 0.399
4.068ArgPro: 4.068 ± 0.665
3.211ArgGln: 3.211 ± 0.574
5.031ArgArg: 5.031 ± 1.247
2.997ArgSer: 2.997 ± 0.45
2.462ArgThr: 2.462 ± 0.585
6.637ArgVal: 6.637 ± 0.871
1.285ArgTrp: 1.285 ± 0.41
2.355ArgTyr: 2.355 ± 0.66
0.0ArgXaa: 0.0 ± 0.0
Ser
6.637SerAla: 6.637 ± 1.167
0.535SerCys: 0.535 ± 0.279
7.814SerAsp: 7.814 ± 1.096
4.71SerGlu: 4.71 ± 0.663
1.285SerPhe: 1.285 ± 0.302
7.172SerGly: 7.172 ± 1.027
0.428SerHis: 0.428 ± 0.185
1.713SerIle: 1.713 ± 0.411
1.392SerLys: 1.392 ± 0.406
4.068SerLeu: 4.068 ± 0.657
1.392SerMet: 1.392 ± 0.368
2.034SerAsn: 2.034 ± 0.506
2.783SerPro: 2.783 ± 0.48
0.963SerGln: 0.963 ± 0.362
3.318SerArg: 3.318 ± 0.715
2.569SerSer: 2.569 ± 0.634
4.603SerThr: 4.603 ± 0.606
4.068SerVal: 4.068 ± 0.893
0.749SerTrp: 0.749 ± 0.219
2.141SerTyr: 2.141 ± 0.631
0.0SerXaa: 0.0 ± 0.0
Thr
6.423ThrAla: 6.423 ± 0.964
0.214ThrCys: 0.214 ± 0.145
7.279ThrAsp: 7.279 ± 0.938
5.245ThrGlu: 5.245 ± 0.744
1.285ThrPhe: 1.285 ± 0.518
4.924ThrGly: 4.924 ± 0.977
0.856ThrHis: 0.856 ± 0.296
2.355ThrIle: 2.355 ± 0.495
0.963ThrLys: 0.963 ± 0.271
5.994ThrLeu: 5.994 ± 0.813
0.428ThrMet: 0.428 ± 0.207
1.82ThrAsn: 1.82 ± 0.351
3.854ThrPro: 3.854 ± 0.586
2.248ThrGln: 2.248 ± 0.532
2.569ThrArg: 2.569 ± 0.469
3.639ThrSer: 3.639 ± 0.583
4.068ThrThr: 4.068 ± 0.968
6.958ThrVal: 6.958 ± 0.756
0.856ThrTrp: 0.856 ± 0.247
1.392ThrTyr: 1.392 ± 0.482
0.0ThrXaa: 0.0 ± 0.0
Val
9.634ValAla: 9.634 ± 1.229
0.642ValCys: 0.642 ± 0.334
9.313ValAsp: 9.313 ± 1.347
11.24ValGlu: 11.24 ± 1.075
1.285ValPhe: 1.285 ± 0.371
7.6ValGly: 7.6 ± 0.786
1.07ValHis: 1.07 ± 0.332
0.856ValIle: 0.856 ± 0.32
2.034ValLys: 2.034 ± 0.494
5.245ValLeu: 5.245 ± 1.014
2.034ValMet: 2.034 ± 0.49
2.248ValAsn: 2.248 ± 0.482
4.068ValPro: 4.068 ± 0.878
1.499ValGln: 1.499 ± 0.356
4.603ValArg: 4.603 ± 0.761
4.71ValSer: 4.71 ± 0.774
4.282ValThr: 4.282 ± 0.644
6.101ValVal: 6.101 ± 0.907
0.963ValTrp: 0.963 ± 0.271
3.639ValTyr: 3.639 ± 0.504
0.0ValXaa: 0.0 ± 0.0
Trp
1.07TrpAla: 1.07 ± 0.481
0.214TrpCys: 0.214 ± 0.226
1.606TrpAsp: 1.606 ± 0.495
0.749TrpGlu: 0.749 ± 0.272
0.749TrpPhe: 0.749 ± 0.265
1.07TrpGly: 1.07 ± 0.442
0.428TrpHis: 0.428 ± 0.216
0.214TrpIle: 0.214 ± 0.13
0.321TrpLys: 0.321 ± 0.198
1.07TrpLeu: 1.07 ± 0.413
0.428TrpMet: 0.428 ± 0.193
0.321TrpAsn: 0.321 ± 0.239
0.214TrpPro: 0.214 ± 0.164
0.428TrpGln: 0.428 ± 0.189
1.07TrpArg: 1.07 ± 0.406
1.499TrpSer: 1.499 ± 0.377
0.749TrpThr: 0.749 ± 0.292
0.856TrpVal: 0.856 ± 0.273
0.0TrpTrp: 0.0 ± 0.0
0.214TrpTyr: 0.214 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.927TyrAla: 1.927 ± 0.392
0.321TyrCys: 0.321 ± 0.248
3.747TyrAsp: 3.747 ± 0.787
3.104TyrGlu: 3.104 ± 0.674
0.428TyrPhe: 0.428 ± 0.226
2.569TyrGly: 2.569 ± 0.587
0.642TyrHis: 0.642 ± 0.222
0.642TyrIle: 0.642 ± 0.243
1.177TyrLys: 1.177 ± 0.348
2.034TyrLeu: 2.034 ± 0.537
0.856TyrMet: 0.856 ± 0.276
0.963TyrAsn: 0.963 ± 0.485
0.963TyrPro: 0.963 ± 0.397
1.285TyrGln: 1.285 ± 0.425
2.141TyrArg: 2.141 ± 0.449
2.248TyrSer: 2.248 ± 0.355
2.248TyrThr: 2.248 ± 0.484
2.89TyrVal: 2.89 ± 0.465
0.321TyrTrp: 0.321 ± 0.153
1.07TyrTyr: 1.07 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (9343 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski