Amino acid dipepetide frequency for Clostridium virus phiC2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.334AlaAla: 2.334 ± 0.727
0.549AlaCys: 0.549 ± 0.212
2.196AlaAsp: 2.196 ± 0.4
3.844AlaGlu: 3.844 ± 0.578
1.991AlaPhe: 1.991 ± 0.432
3.089AlaGly: 3.089 ± 0.646
0.48AlaHis: 0.48 ± 0.146
5.011AlaIle: 5.011 ± 0.545
4.667AlaLys: 4.667 ± 0.664
3.981AlaLeu: 3.981 ± 0.533
1.579AlaMet: 1.579 ± 0.319
2.471AlaAsn: 2.471 ± 0.387
1.098AlaPro: 1.098 ± 0.319
1.716AlaGln: 1.716 ± 0.378
1.785AlaArg: 1.785 ± 0.412
3.638AlaSer: 3.638 ± 0.531
3.569AlaThr: 3.569 ± 0.537
2.334AlaVal: 2.334 ± 0.4
0.618AlaTrp: 0.618 ± 0.199
1.716AlaTyr: 1.716 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
0.206CysAla: 0.206 ± 0.107
0.412CysCys: 0.412 ± 0.165
1.098CysAsp: 1.098 ± 0.284
1.236CysGlu: 1.236 ± 0.257
0.48CysPhe: 0.48 ± 0.193
0.48CysGly: 0.48 ± 0.307
0.069CysHis: 0.069 ± 0.08
0.961CysIle: 0.961 ± 0.26
1.441CysLys: 1.441 ± 0.311
1.236CysLeu: 1.236 ± 0.306
0.412CysMet: 0.412 ± 0.169
0.618CysAsn: 0.618 ± 0.21
0.412CysPro: 0.412 ± 0.148
0.275CysGln: 0.275 ± 0.12
0.824CysArg: 0.824 ± 0.263
0.755CysSer: 0.755 ± 0.269
0.412CysThr: 0.412 ± 0.159
0.549CysVal: 0.549 ± 0.169
0.206CysTrp: 0.206 ± 0.133
0.48CysTyr: 0.48 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
3.089AspAla: 3.089 ± 0.416
1.03AspCys: 1.03 ± 0.257
4.187AspAsp: 4.187 ± 0.591
5.148AspGlu: 5.148 ± 0.61
3.295AspPhe: 3.295 ± 0.513
3.501AspGly: 3.501 ± 0.418
0.069AspHis: 0.069 ± 0.058
7.138AspIle: 7.138 ± 0.559
7.688AspLys: 7.688 ± 0.77
4.805AspLeu: 4.805 ± 0.547
1.03AspMet: 1.03 ± 0.235
4.599AspAsn: 4.599 ± 0.493
0.686AspPro: 0.686 ± 0.21
0.549AspGln: 0.549 ± 0.167
2.059AspArg: 2.059 ± 0.429
3.432AspSer: 3.432 ± 0.468
3.295AspThr: 3.295 ± 0.407
3.02AspVal: 3.02 ± 0.456
0.755AspTrp: 0.755 ± 0.202
3.157AspTyr: 3.157 ± 0.396
0.0AspXaa: 0.0 ± 0.0
Glu
3.432GluAla: 3.432 ± 0.463
0.961GluCys: 0.961 ± 0.278
5.354GluAsp: 5.354 ± 0.508
7.276GluGlu: 7.276 ± 0.871
3.569GluPhe: 3.569 ± 0.418
4.118GluGly: 4.118 ± 0.519
0.48GluHis: 0.48 ± 0.153
7.55GluIle: 7.55 ± 0.912
9.747GluLys: 9.747 ± 0.86
9.609GluLeu: 9.609 ± 0.944
2.471GluMet: 2.471 ± 0.37
6.315GluAsn: 6.315 ± 0.783
1.304GluPro: 1.304 ± 0.358
2.951GluGln: 2.951 ± 0.384
2.951GluArg: 2.951 ± 0.439
3.912GluSer: 3.912 ± 0.399
3.775GluThr: 3.775 ± 0.638
4.393GluVal: 4.393 ± 0.639
0.686GluTrp: 0.686 ± 0.242
5.011GluTyr: 5.011 ± 0.658
0.0GluXaa: 0.0 ± 0.0
Phe
1.373PheAla: 1.373 ± 0.41
0.48PheCys: 0.48 ± 0.213
3.432PheAsp: 3.432 ± 0.397
4.667PheGlu: 4.667 ± 0.51
1.441PhePhe: 1.441 ± 0.301
2.883PheGly: 2.883 ± 0.315
0.412PheHis: 0.412 ± 0.187
3.638PheIle: 3.638 ± 0.453
3.363PheLys: 3.363 ± 0.452
2.814PheLeu: 2.814 ± 0.367
1.236PheMet: 1.236 ± 0.269
3.295PheAsn: 3.295 ± 0.522
0.892PhePro: 0.892 ± 0.273
1.167PheGln: 1.167 ± 0.261
1.716PheArg: 1.716 ± 0.323
2.196PheSer: 2.196 ± 0.369
2.402PheThr: 2.402 ± 0.325
1.785PheVal: 1.785 ± 0.318
0.275PheTrp: 0.275 ± 0.136
1.304PheTyr: 1.304 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
2.402GlyAla: 2.402 ± 0.464
1.098GlyCys: 1.098 ± 0.241
2.608GlyAsp: 2.608 ± 0.547
4.53GlyGlu: 4.53 ± 0.578
2.814GlyPhe: 2.814 ± 0.428
3.089GlyGly: 3.089 ± 0.56
1.167GlyHis: 1.167 ± 0.334
4.118GlyIle: 4.118 ± 0.502
5.285GlyLys: 5.285 ± 0.627
3.432GlyLeu: 3.432 ± 0.452
1.304GlyMet: 1.304 ± 0.36
4.599GlyAsn: 4.599 ± 0.597
0.48GlyPro: 0.48 ± 0.223
1.441GlyGln: 1.441 ± 0.337
2.059GlyArg: 2.059 ± 0.381
3.363GlySer: 3.363 ± 0.445
2.334GlyThr: 2.334 ± 0.362
4.118GlyVal: 4.118 ± 0.538
0.824GlyTrp: 0.824 ± 0.265
2.814GlyTyr: 2.814 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
0.206HisAla: 0.206 ± 0.114
0.412HisCys: 0.412 ± 0.156
0.412HisAsp: 0.412 ± 0.153
1.03HisGlu: 1.03 ± 0.249
0.755HisPhe: 0.755 ± 0.198
0.48HisGly: 0.48 ± 0.171
0.275HisHis: 0.275 ± 0.12
0.755HisIle: 0.755 ± 0.247
1.304HisLys: 1.304 ± 0.281
0.686HisLeu: 0.686 ± 0.196
0.48HisMet: 0.48 ± 0.156
0.686HisAsn: 0.686 ± 0.277
0.549HisPro: 0.549 ± 0.218
0.206HisGln: 0.206 ± 0.124
0.275HisArg: 0.275 ± 0.142
0.824HisSer: 0.824 ± 0.19
0.618HisThr: 0.618 ± 0.21
0.549HisVal: 0.549 ± 0.184
0.206HisTrp: 0.206 ± 0.106
0.686HisTyr: 0.686 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
5.422IleAla: 5.422 ± 0.761
1.03IleCys: 1.03 ± 0.306
7.07IleAsp: 7.07 ± 0.761
8.443IleGlu: 8.443 ± 0.845
3.432IlePhe: 3.432 ± 0.452
3.981IleGly: 3.981 ± 0.569
1.03IleHis: 1.03 ± 0.261
7.825IleIle: 7.825 ± 0.795
10.09IleLys: 10.09 ± 0.919
7.344IleLeu: 7.344 ± 0.647
1.167IleMet: 1.167 ± 0.303
6.933IleAsn: 6.933 ± 0.739
2.196IlePro: 2.196 ± 0.466
2.746IleGln: 2.746 ± 0.416
3.295IleArg: 3.295 ± 0.505
5.079IleSer: 5.079 ± 0.631
4.53IleThr: 4.53 ± 0.562
5.079IleVal: 5.079 ± 0.735
0.686IleTrp: 0.686 ± 0.243
3.226IleTyr: 3.226 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
4.805LysAla: 4.805 ± 0.556
1.236LysCys: 1.236 ± 0.264
6.658LysAsp: 6.658 ± 0.649
10.296LysGlu: 10.296 ± 0.95
3.157LysPhe: 3.157 ± 0.453
4.873LysGly: 4.873 ± 0.53
1.716LysHis: 1.716 ± 0.384
9.541LysIle: 9.541 ± 0.83
10.845LysLys: 10.845 ± 0.897
9.198LysLeu: 9.198 ± 0.792
2.814LysMet: 2.814 ± 0.405
8.237LysAsn: 8.237 ± 0.648
1.647LysPro: 1.647 ± 0.346
3.707LysGln: 3.707 ± 0.481
4.256LysArg: 4.256 ± 0.487
5.56LysSer: 5.56 ± 0.573
4.805LysThr: 4.805 ± 0.56
7.688LysVal: 7.688 ± 0.788
1.167LysTrp: 1.167 ± 0.265
5.628LysTyr: 5.628 ± 0.607
0.0LysXaa: 0.0 ± 0.0
Leu
4.118LeuAla: 4.118 ± 0.551
0.755LeuCys: 0.755 ± 0.193
5.697LeuAsp: 5.697 ± 0.451
8.511LeuGlu: 8.511 ± 1.081
2.677LeuPhe: 2.677 ± 0.475
5.079LeuGly: 5.079 ± 0.609
1.441LeuHis: 1.441 ± 0.285
7.482LeuIle: 7.482 ± 0.813
9.472LeuLys: 9.472 ± 0.775
5.697LeuLeu: 5.697 ± 0.659
1.236LeuMet: 1.236 ± 0.335
6.315LeuAsn: 6.315 ± 0.751
1.441LeuPro: 1.441 ± 0.374
2.677LeuGln: 2.677 ± 0.404
3.638LeuArg: 3.638 ± 0.473
4.805LeuSer: 4.805 ± 0.589
4.667LeuThr: 4.667 ± 0.532
4.942LeuVal: 4.942 ± 0.435
0.755LeuTrp: 0.755 ± 0.189
2.883LeuTyr: 2.883 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
1.51MetAla: 1.51 ± 0.42
0.0MetCys: 0.0 ± 0.0
1.853MetAsp: 1.853 ± 0.332
1.785MetGlu: 1.785 ± 0.305
0.412MetPhe: 0.412 ± 0.177
0.755MetGly: 0.755 ± 0.278
0.069MetHis: 0.069 ± 0.065
1.373MetIle: 1.373 ± 0.321
2.196MetLys: 2.196 ± 0.377
2.128MetLeu: 2.128 ± 0.359
0.343MetMet: 0.343 ± 0.191
1.785MetAsn: 1.785 ± 0.356
0.48MetPro: 0.48 ± 0.199
0.343MetGln: 0.343 ± 0.174
0.961MetArg: 0.961 ± 0.248
1.441MetSer: 1.441 ± 0.296
1.647MetThr: 1.647 ± 0.244
1.373MetVal: 1.373 ± 0.264
0.069MetTrp: 0.069 ± 0.058
0.755MetTyr: 0.755 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
4.256AsnAla: 4.256 ± 0.619
0.549AsnCys: 0.549 ± 0.208
3.775AsnAsp: 3.775 ± 0.569
4.873AsnGlu: 4.873 ± 0.627
2.746AsnPhe: 2.746 ± 0.421
3.638AsnGly: 3.638 ± 0.464
0.755AsnHis: 0.755 ± 0.228
7.207AsnIle: 7.207 ± 0.77
8.717AsnLys: 8.717 ± 0.743
5.285AsnLeu: 5.285 ± 0.534
1.236AsnMet: 1.236 ± 0.244
6.452AsnAsn: 6.452 ± 0.739
2.059AsnPro: 2.059 ± 0.417
2.128AsnGln: 2.128 ± 0.375
2.814AsnArg: 2.814 ± 0.408
4.873AsnSer: 4.873 ± 0.51
4.256AsnThr: 4.256 ± 0.396
3.707AsnVal: 3.707 ± 0.468
0.686AsnTrp: 0.686 ± 0.181
2.677AsnTyr: 2.677 ± 0.421
0.0AsnXaa: 0.0 ± 0.0
Pro
0.824ProAla: 0.824 ± 0.209
0.48ProCys: 0.48 ± 0.168
1.51ProAsp: 1.51 ± 0.352
1.304ProGlu: 1.304 ± 0.284
0.686ProPhe: 0.686 ± 0.216
0.755ProGly: 0.755 ± 0.229
0.137ProHis: 0.137 ± 0.09
2.059ProIle: 2.059 ± 0.427
2.196ProLys: 2.196 ± 0.453
1.441ProLeu: 1.441 ± 0.314
0.206ProMet: 0.206 ± 0.12
1.373ProAsn: 1.373 ± 0.253
0.412ProPro: 0.412 ± 0.234
0.412ProGln: 0.412 ± 0.146
0.686ProArg: 0.686 ± 0.239
1.167ProSer: 1.167 ± 0.285
1.373ProThr: 1.373 ± 0.336
1.785ProVal: 1.785 ± 0.365
0.275ProTrp: 0.275 ± 0.163
0.824ProTyr: 0.824 ± 0.222
0.0ProXaa: 0.0 ± 0.0
Gln
1.304GlnAla: 1.304 ± 0.254
0.137GlnCys: 0.137 ± 0.096
1.853GlnAsp: 1.853 ± 0.354
2.677GlnGlu: 2.677 ± 0.479
0.824GlnPhe: 0.824 ± 0.23
1.579GlnGly: 1.579 ± 0.301
0.206GlnHis: 0.206 ± 0.121
2.951GlnIle: 2.951 ± 0.41
2.196GlnLys: 2.196 ± 0.378
2.746GlnLeu: 2.746 ± 0.452
0.686GlnMet: 0.686 ± 0.221
2.128GlnAsn: 2.128 ± 0.342
0.618GlnPro: 0.618 ± 0.277
1.167GlnGln: 1.167 ± 0.358
1.098GlnArg: 1.098 ± 0.316
2.059GlnSer: 2.059 ± 0.365
2.059GlnThr: 2.059 ± 0.359
1.373GlnVal: 1.373 ± 0.277
0.206GlnTrp: 0.206 ± 0.116
1.236GlnTyr: 1.236 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
1.51ArgAla: 1.51 ± 0.454
0.824ArgCys: 0.824 ± 0.262
2.059ArgAsp: 2.059 ± 0.347
3.638ArgGlu: 3.638 ± 0.505
1.922ArgPhe: 1.922 ± 0.363
2.265ArgGly: 2.265 ± 0.342
0.343ArgHis: 0.343 ± 0.147
3.363ArgIle: 3.363 ± 0.439
4.05ArgLys: 4.05 ± 0.597
3.226ArgLeu: 3.226 ± 0.416
1.167ArgMet: 1.167 ± 0.307
1.922ArgAsn: 1.922 ± 0.312
0.618ArgPro: 0.618 ± 0.226
1.098ArgGln: 1.098 ± 0.22
1.03ArgArg: 1.03 ± 0.26
2.196ArgSer: 2.196 ± 0.422
1.716ArgThr: 1.716 ± 0.324
2.54ArgVal: 2.54 ± 0.356
0.343ArgTrp: 0.343 ± 0.158
1.304ArgTyr: 1.304 ± 0.23
0.0ArgXaa: 0.0 ± 0.0
Ser
3.089SerAla: 3.089 ± 0.645
0.618SerCys: 0.618 ± 0.177
2.608SerAsp: 2.608 ± 0.338
3.981SerGlu: 3.981 ± 0.461
3.295SerPhe: 3.295 ± 0.467
3.226SerGly: 3.226 ± 0.409
0.343SerHis: 0.343 ± 0.134
5.697SerIle: 5.697 ± 0.61
7.55SerLys: 7.55 ± 0.78
5.217SerLeu: 5.217 ± 0.508
1.236SerMet: 1.236 ± 0.239
3.912SerAsn: 3.912 ± 0.545
0.824SerPro: 0.824 ± 0.222
1.853SerGln: 1.853 ± 0.266
1.922SerArg: 1.922 ± 0.293
3.157SerSer: 3.157 ± 0.592
3.226SerThr: 3.226 ± 0.51
3.02SerVal: 3.02 ± 0.407
0.48SerTrp: 0.48 ± 0.186
2.402SerTyr: 2.402 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
2.814ThrAla: 2.814 ± 0.459
0.755ThrCys: 0.755 ± 0.247
2.746ThrAsp: 2.746 ± 0.459
3.775ThrGlu: 3.775 ± 0.521
2.608ThrPhe: 2.608 ± 0.398
3.569ThrGly: 3.569 ± 0.483
1.167ThrHis: 1.167 ± 0.263
5.285ThrIle: 5.285 ± 0.605
5.011ThrLys: 5.011 ± 0.558
4.462ThrLeu: 4.462 ± 0.494
0.961ThrMet: 0.961 ± 0.205
3.089ThrAsn: 3.089 ± 0.396
1.922ThrPro: 1.922 ± 0.384
1.922ThrGln: 1.922 ± 0.291
1.647ThrArg: 1.647 ± 0.321
2.677ThrSer: 2.677 ± 0.419
3.02ThrThr: 3.02 ± 0.597
2.951ThrVal: 2.951 ± 0.494
0.48ThrTrp: 0.48 ± 0.18
2.471ThrTyr: 2.471 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
3.775ValAla: 3.775 ± 0.613
0.48ValCys: 0.48 ± 0.157
4.187ValAsp: 4.187 ± 0.434
4.667ValGlu: 4.667 ± 0.525
2.059ValPhe: 2.059 ± 0.318
3.981ValGly: 3.981 ± 0.441
0.686ValHis: 0.686 ± 0.231
3.707ValIle: 3.707 ± 0.557
5.56ValLys: 5.56 ± 0.666
5.903ValLeu: 5.903 ± 0.672
1.03ValMet: 1.03 ± 0.279
4.667ValAsn: 4.667 ± 0.547
1.441ValPro: 1.441 ± 0.288
1.236ValGln: 1.236 ± 0.306
2.265ValArg: 2.265 ± 0.366
3.501ValSer: 3.501 ± 0.563
2.402ValThr: 2.402 ± 0.441
3.501ValVal: 3.501 ± 0.612
0.275ValTrp: 0.275 ± 0.123
2.334ValTyr: 2.334 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
0.412TrpAla: 0.412 ± 0.147
0.206TrpCys: 0.206 ± 0.117
0.618TrpAsp: 0.618 ± 0.187
1.098TrpGlu: 1.098 ± 0.295
0.343TrpPhe: 0.343 ± 0.127
0.618TrpGly: 0.618 ± 0.173
0.069TrpHis: 0.069 ± 0.066
0.686TrpIle: 0.686 ± 0.206
1.167TrpLys: 1.167 ± 0.284
0.961TrpLeu: 0.961 ± 0.272
0.275TrpMet: 0.275 ± 0.14
0.412TrpAsn: 0.412 ± 0.155
0.0TrpPro: 0.0 ± 0.0
0.343TrpGln: 0.343 ± 0.132
0.206TrpArg: 0.206 ± 0.12
0.412TrpSer: 0.412 ± 0.182
0.343TrpThr: 0.343 ± 0.183
0.618TrpVal: 0.618 ± 0.182
0.0TrpTrp: 0.0 ± 0.0
0.48TrpTyr: 0.48 ± 0.251
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.853TyrAla: 1.853 ± 0.315
0.549TyrCys: 0.549 ± 0.175
2.608TyrAsp: 2.608 ± 0.489
3.089TyrGlu: 3.089 ± 0.493
2.265TyrPhe: 2.265 ± 0.367
1.922TyrGly: 1.922 ± 0.324
0.48TyrHis: 0.48 ± 0.147
4.256TyrIle: 4.256 ± 0.558
5.285TyrLys: 5.285 ± 0.617
4.118TyrLeu: 4.118 ± 0.424
0.343TyrMet: 0.343 ± 0.152
2.883TyrAsn: 2.883 ± 0.424
0.824TyrPro: 0.824 ± 0.253
1.167TyrGln: 1.167 ± 0.253
1.647TyrArg: 1.647 ± 0.337
2.608TyrSer: 2.608 ± 0.332
2.883TyrThr: 2.883 ± 0.498
2.334TyrVal: 2.334 ± 0.363
0.275TyrTrp: 0.275 ± 0.126
2.196TyrTyr: 2.196 ± 0.549
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (14570 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski