Amino acid dipepetide frequency for Acinetobacter phage Ab11510-phi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.762AlaAla: 6.762 ± 1.237
0.52AlaCys: 0.52 ± 0.203
4.421AlaAsp: 4.421 ± 0.528
5.851AlaGlu: 5.851 ± 0.875
2.991AlaPhe: 2.991 ± 0.483
3.901AlaGly: 3.901 ± 0.633
1.69AlaHis: 1.69 ± 0.454
5.136AlaIle: 5.136 ± 0.647
6.437AlaLys: 6.437 ± 0.585
7.087AlaLeu: 7.087 ± 0.694
2.406AlaMet: 2.406 ± 0.343
4.681AlaAsn: 4.681 ± 0.556
1.755AlaPro: 1.755 ± 0.389
3.771AlaGln: 3.771 ± 0.876
4.031AlaArg: 4.031 ± 0.518
4.941AlaSer: 4.941 ± 0.724
5.006AlaThr: 5.006 ± 0.806
4.681AlaVal: 4.681 ± 0.477
0.91AlaTrp: 0.91 ± 0.359
2.601AlaTyr: 2.601 ± 0.391
0.0AlaXaa: 0.0 ± 0.0
Cys
0.715CysAla: 0.715 ± 0.223
0.0CysCys: 0.0 ± 0.0
0.325CysAsp: 0.325 ± 0.137
1.04CysGlu: 1.04 ± 0.248
0.39CysPhe: 0.39 ± 0.148
0.715CysGly: 0.715 ± 0.27
0.26CysHis: 0.26 ± 0.161
0.455CysIle: 0.455 ± 0.144
0.585CysLys: 0.585 ± 0.199
0.585CysLeu: 0.585 ± 0.203
0.195CysMet: 0.195 ± 0.106
0.325CysAsn: 0.325 ± 0.151
0.195CysPro: 0.195 ± 0.103
0.52CysGln: 0.52 ± 0.193
0.52CysArg: 0.52 ± 0.161
0.455CysSer: 0.455 ± 0.204
0.39CysThr: 0.39 ± 0.172
0.455CysVal: 0.455 ± 0.148
0.0CysTrp: 0.0 ± 0.0
0.845CysTyr: 0.845 ± 0.239
0.0CysXaa: 0.0 ± 0.0
Asp
5.006AspAla: 5.006 ± 0.534
0.195AspCys: 0.195 ± 0.107
3.966AspAsp: 3.966 ± 0.637
4.161AspGlu: 4.161 ± 0.543
2.796AspPhe: 2.796 ± 0.447
4.161AspGly: 4.161 ± 0.698
0.65AspHis: 0.65 ± 0.229
3.771AspIle: 3.771 ± 0.543
4.616AspLys: 4.616 ± 0.599
5.916AspLeu: 5.916 ± 0.613
1.43AspMet: 1.43 ± 0.306
2.471AspAsn: 2.471 ± 0.259
1.95AspPro: 1.95 ± 0.418
2.536AspGln: 2.536 ± 0.4
2.276AspArg: 2.276 ± 0.382
2.926AspSer: 2.926 ± 0.49
2.406AspThr: 2.406 ± 0.377
3.706AspVal: 3.706 ± 0.577
0.65AspTrp: 0.65 ± 0.191
2.926AspTyr: 2.926 ± 0.423
0.0AspXaa: 0.0 ± 0.0
Glu
6.111GluAla: 6.111 ± 0.71
0.65GluCys: 0.65 ± 0.221
3.121GluAsp: 3.121 ± 0.502
5.266GluGlu: 5.266 ± 0.672
3.641GluPhe: 3.641 ± 0.363
3.511GluGly: 3.511 ± 0.379
1.365GluHis: 1.365 ± 0.321
5.396GluIle: 5.396 ± 0.385
6.111GluLys: 6.111 ± 0.687
7.217GluLeu: 7.217 ± 0.699
2.211GluMet: 2.211 ± 0.46
3.771GluAsn: 3.771 ± 0.519
1.625GluPro: 1.625 ± 0.326
3.901GluGln: 3.901 ± 0.582
3.251GluArg: 3.251 ± 0.476
4.161GluSer: 4.161 ± 0.575
3.186GluThr: 3.186 ± 0.406
5.396GluVal: 5.396 ± 0.548
1.56GluTrp: 1.56 ± 0.447
2.991GluTyr: 2.991 ± 0.434
0.0GluXaa: 0.0 ± 0.0
Phe
3.446PheAla: 3.446 ± 0.386
0.52PheCys: 0.52 ± 0.203
2.471PheAsp: 2.471 ± 0.406
3.771PheGlu: 3.771 ± 0.486
1.755PhePhe: 1.755 ± 0.408
2.406PheGly: 2.406 ± 0.43
0.455PheHis: 0.455 ± 0.153
3.121PheIle: 3.121 ± 0.425
3.056PheLys: 3.056 ± 0.473
3.251PheLeu: 3.251 ± 0.647
0.52PheMet: 0.52 ± 0.191
2.926PheAsn: 2.926 ± 0.412
1.43PhePro: 1.43 ± 0.289
1.69PheGln: 1.69 ± 0.313
2.08PheArg: 2.08 ± 0.268
1.95PheSer: 1.95 ± 0.38
1.69PheThr: 1.69 ± 0.292
2.276PheVal: 2.276 ± 0.423
0.455PheTrp: 0.455 ± 0.162
1.3PheTyr: 1.3 ± 0.214
0.0PheXaa: 0.0 ± 0.0
Gly
4.811GlyAla: 4.811 ± 0.922
0.78GlyCys: 0.78 ± 0.228
2.991GlyAsp: 2.991 ± 0.455
4.161GlyGlu: 4.161 ± 0.469
3.446GlyPhe: 3.446 ± 0.666
5.786GlyGly: 5.786 ± 0.772
1.235GlyHis: 1.235 ± 0.323
3.836GlyIle: 3.836 ± 0.49
4.356GlyLys: 4.356 ± 0.475
5.526GlyLeu: 5.526 ± 0.612
2.08GlyMet: 2.08 ± 0.404
2.341GlyAsn: 2.341 ± 0.353
1.04GlyPro: 1.04 ± 0.294
2.471GlyGln: 2.471 ± 0.392
2.926GlyArg: 2.926 ± 0.559
3.511GlySer: 3.511 ± 0.521
3.706GlyThr: 3.706 ± 0.635
3.381GlyVal: 3.381 ± 0.411
1.3GlyTrp: 1.3 ± 0.272
2.146GlyTyr: 2.146 ± 0.388
0.0GlyXaa: 0.0 ± 0.0
His
1.365HisAla: 1.365 ± 0.301
0.26HisCys: 0.26 ± 0.127
0.91HisAsp: 0.91 ± 0.225
1.625HisGlu: 1.625 ± 0.393
1.04HisPhe: 1.04 ± 0.263
0.845HisGly: 0.845 ± 0.219
0.325HisHis: 0.325 ± 0.144
1.17HisIle: 1.17 ± 0.301
1.105HisLys: 1.105 ± 0.255
2.471HisLeu: 2.471 ± 0.565
0.455HisMet: 0.455 ± 0.154
0.585HisAsn: 0.585 ± 0.2
0.715HisPro: 0.715 ± 0.22
0.585HisGln: 0.585 ± 0.178
0.65HisArg: 0.65 ± 0.223
0.845HisSer: 0.845 ± 0.246
0.52HisThr: 0.52 ± 0.208
1.235HisVal: 1.235 ± 0.301
0.065HisTrp: 0.065 ± 0.069
0.845HisTyr: 0.845 ± 0.251
0.0HisXaa: 0.0 ± 0.0
Ile
5.396IleAla: 5.396 ± 0.738
0.52IleCys: 0.52 ± 0.164
4.226IleAsp: 4.226 ± 0.569
5.006IleGlu: 5.006 ± 0.525
1.235IlePhe: 1.235 ± 0.259
4.096IleGly: 4.096 ± 0.526
0.715IleHis: 0.715 ± 0.203
3.446IleIle: 3.446 ± 0.562
4.746IleLys: 4.746 ± 0.607
4.486IleLeu: 4.486 ± 0.566
1.105IleMet: 1.105 ± 0.235
3.186IleAsn: 3.186 ± 0.488
2.471IlePro: 2.471 ± 0.331
3.056IleGln: 3.056 ± 0.524
2.471IleArg: 2.471 ± 0.481
4.226IleSer: 4.226 ± 0.486
3.251IleThr: 3.251 ± 0.386
4.226IleVal: 4.226 ± 0.452
0.78IleTrp: 0.78 ± 0.25
3.251IleTyr: 3.251 ± 0.588
0.0IleXaa: 0.0 ± 0.0
Lys
7.217LysAla: 7.217 ± 1.08
0.585LysCys: 0.585 ± 0.26
4.031LysAsp: 4.031 ± 0.505
6.241LysGlu: 6.241 ± 0.73
2.406LysPhe: 2.406 ± 0.347
4.226LysGly: 4.226 ± 0.481
1.3LysHis: 1.3 ± 0.284
4.486LysIle: 4.486 ± 0.518
6.762LysLys: 6.762 ± 1.013
7.997LysLeu: 7.997 ± 0.756
1.95LysMet: 1.95 ± 0.354
3.576LysAsn: 3.576 ± 0.396
2.796LysPro: 2.796 ± 0.535
4.096LysGln: 4.096 ± 0.847
4.356LysArg: 4.356 ± 0.519
4.421LysSer: 4.421 ± 0.489
3.836LysThr: 3.836 ± 0.411
4.811LysVal: 4.811 ± 0.509
0.975LysTrp: 0.975 ± 0.272
2.796LysTyr: 2.796 ± 0.387
0.0LysXaa: 0.0 ± 0.0
Leu
5.981LeuAla: 5.981 ± 0.701
1.365LeuCys: 1.365 ± 0.287
5.591LeuAsp: 5.591 ± 0.625
7.152LeuGlu: 7.152 ± 0.857
3.316LeuPhe: 3.316 ± 0.517
5.396LeuGly: 5.396 ± 0.641
1.235LeuHis: 1.235 ± 0.328
4.746LeuIle: 4.746 ± 0.603
7.932LeuLys: 7.932 ± 0.846
5.851LeuLeu: 5.851 ± 0.692
1.885LeuMet: 1.885 ± 0.327
6.371LeuAsn: 6.371 ± 0.898
2.991LeuPro: 2.991 ± 0.548
3.771LeuGln: 3.771 ± 0.426
3.771LeuArg: 3.771 ± 0.521
7.152LeuSer: 7.152 ± 0.626
5.136LeuThr: 5.136 ± 0.698
5.331LeuVal: 5.331 ± 0.487
0.585LeuTrp: 0.585 ± 0.185
2.601LeuTyr: 2.601 ± 0.491
0.0LeuXaa: 0.0 ± 0.0
Met
2.015MetAla: 2.015 ± 0.324
0.13MetCys: 0.13 ± 0.089
1.495MetAsp: 1.495 ± 0.257
1.43MetGlu: 1.43 ± 0.309
0.975MetPhe: 0.975 ± 0.223
1.235MetGly: 1.235 ± 0.223
0.195MetHis: 0.195 ± 0.104
1.3MetIle: 1.3 ± 0.239
2.08MetLys: 2.08 ± 0.395
2.666MetLeu: 2.666 ± 0.524
0.585MetMet: 0.585 ± 0.202
2.08MetAsn: 2.08 ± 0.439
1.43MetPro: 1.43 ± 0.29
1.235MetGln: 1.235 ± 0.238
0.715MetArg: 0.715 ± 0.242
2.08MetSer: 2.08 ± 0.4
1.625MetThr: 1.625 ± 0.293
1.3MetVal: 1.3 ± 0.311
0.195MetTrp: 0.195 ± 0.12
0.65MetTyr: 0.65 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
4.291AsnAla: 4.291 ± 0.595
0.65AsnCys: 0.65 ± 0.244
3.771AsnAsp: 3.771 ± 0.614
3.836AsnGlu: 3.836 ± 0.425
2.211AsnPhe: 2.211 ± 0.331
4.291AsnGly: 4.291 ± 0.538
1.105AsnHis: 1.105 ± 0.324
2.601AsnIle: 2.601 ± 0.403
3.771AsnLys: 3.771 ± 0.425
5.266AsnLeu: 5.266 ± 0.699
1.56AsnMet: 1.56 ± 0.361
2.991AsnAsn: 2.991 ± 0.571
2.666AsnPro: 2.666 ± 0.433
2.796AsnGln: 2.796 ± 0.476
1.885AsnArg: 1.885 ± 0.418
3.966AsnSer: 3.966 ± 0.476
2.666AsnThr: 2.666 ± 0.364
3.056AsnVal: 3.056 ± 0.464
1.235AsnTrp: 1.235 ± 0.266
1.82AsnTyr: 1.82 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
2.926ProAla: 2.926 ± 0.441
0.195ProCys: 0.195 ± 0.108
2.406ProAsp: 2.406 ± 0.519
2.731ProGlu: 2.731 ± 0.433
1.17ProPhe: 1.17 ± 0.321
1.235ProGly: 1.235 ± 0.452
0.65ProHis: 0.65 ± 0.198
2.406ProIle: 2.406 ± 0.411
2.406ProLys: 2.406 ± 0.417
2.991ProLeu: 2.991 ± 0.435
1.04ProMet: 1.04 ± 0.256
2.211ProAsn: 2.211 ± 0.463
1.3ProPro: 1.3 ± 0.323
1.235ProGln: 1.235 ± 0.315
0.975ProArg: 0.975 ± 0.225
1.82ProSer: 1.82 ± 0.362
2.276ProThr: 2.276 ± 0.487
2.406ProVal: 2.406 ± 0.389
0.065ProTrp: 0.065 ± 0.058
1.105ProTyr: 1.105 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
4.356GlnAla: 4.356 ± 0.79
0.26GlnCys: 0.26 ± 0.134
2.991GlnAsp: 2.991 ± 0.65
3.316GlnGlu: 3.316 ± 0.56
1.43GlnPhe: 1.43 ± 0.271
2.926GlnGly: 2.926 ± 0.445
1.235GlnHis: 1.235 ± 0.251
2.406GlnIle: 2.406 ± 0.39
3.576GlnLys: 3.576 ± 0.494
3.446GlnLeu: 3.446 ± 0.413
1.235GlnMet: 1.235 ± 0.309
2.601GlnAsn: 2.601 ± 0.504
0.91GlnPro: 0.91 ± 0.244
1.885GlnGln: 1.885 ± 0.447
1.755GlnArg: 1.755 ± 0.266
2.146GlnSer: 2.146 ± 0.616
2.666GlnThr: 2.666 ± 0.452
3.121GlnVal: 3.121 ± 0.549
0.715GlnTrp: 0.715 ± 0.259
1.885GlnTyr: 1.885 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
3.186ArgAla: 3.186 ± 0.37
0.585ArgCys: 0.585 ± 0.199
2.341ArgAsp: 2.341 ± 0.362
2.991ArgGlu: 2.991 ± 0.363
1.495ArgPhe: 1.495 ± 0.295
2.08ArgGly: 2.08 ± 0.414
0.91ArgHis: 0.91 ± 0.209
3.056ArgIle: 3.056 ± 0.41
3.446ArgLys: 3.446 ± 0.369
4.681ArgLeu: 4.681 ± 0.652
1.56ArgMet: 1.56 ± 0.259
2.536ArgAsn: 2.536 ± 0.479
1.885ArgPro: 1.885 ± 0.373
1.95ArgGln: 1.95 ± 0.394
2.276ArgArg: 2.276 ± 0.363
2.796ArgSer: 2.796 ± 0.444
2.08ArgThr: 2.08 ± 0.463
2.991ArgVal: 2.991 ± 0.365
0.26ArgTrp: 0.26 ± 0.123
1.755ArgTyr: 1.755 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
3.641SerAla: 3.641 ± 0.842
0.195SerCys: 0.195 ± 0.11
3.641SerAsp: 3.641 ± 0.479
4.746SerGlu: 4.746 ± 0.508
3.251SerPhe: 3.251 ± 0.539
4.616SerGly: 4.616 ± 0.497
1.365SerHis: 1.365 ± 0.307
3.771SerIle: 3.771 ± 0.539
5.526SerLys: 5.526 ± 0.708
4.616SerLeu: 4.616 ± 0.698
1.365SerMet: 1.365 ± 0.317
3.511SerAsn: 3.511 ± 0.508
2.146SerPro: 2.146 ± 0.421
2.471SerGln: 2.471 ± 0.454
2.731SerArg: 2.731 ± 0.318
4.746SerSer: 4.746 ± 0.588
2.796SerThr: 2.796 ± 0.451
4.226SerVal: 4.226 ± 0.584
0.52SerTrp: 0.52 ± 0.168
1.495SerTyr: 1.495 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
3.966ThrAla: 3.966 ± 0.501
0.39ThrCys: 0.39 ± 0.148
2.926ThrAsp: 2.926 ± 0.513
3.446ThrGlu: 3.446 ± 0.32
2.276ThrPhe: 2.276 ± 0.412
3.576ThrGly: 3.576 ± 0.379
0.585ThrHis: 0.585 ± 0.169
3.901ThrIle: 3.901 ± 0.482
4.291ThrLys: 4.291 ± 0.55
4.096ThrLeu: 4.096 ± 0.444
0.585ThrMet: 0.585 ± 0.155
3.186ThrAsn: 3.186 ± 0.728
2.731ThrPro: 2.731 ± 0.489
2.146ThrGln: 2.146 ± 0.577
2.276ThrArg: 2.276 ± 0.386
2.536ThrSer: 2.536 ± 0.373
3.706ThrThr: 3.706 ± 0.756
3.251ThrVal: 3.251 ± 0.426
0.195ThrTrp: 0.195 ± 0.108
2.211ThrTyr: 2.211 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
5.656ValAla: 5.656 ± 0.587
0.52ValCys: 0.52 ± 0.173
4.031ValAsp: 4.031 ± 0.59
4.421ValGlu: 4.421 ± 0.483
2.406ValPhe: 2.406 ± 0.424
3.966ValGly: 3.966 ± 0.414
0.845ValHis: 0.845 ± 0.289
3.446ValIle: 3.446 ± 0.55
3.966ValLys: 3.966 ± 0.512
5.266ValLeu: 5.266 ± 0.618
2.015ValMet: 2.015 ± 0.347
4.421ValAsn: 4.421 ± 0.63
2.015ValPro: 2.015 ± 0.348
2.536ValGln: 2.536 ± 0.557
3.251ValArg: 3.251 ± 0.481
3.966ValSer: 3.966 ± 0.413
3.186ValThr: 3.186 ± 0.456
4.161ValVal: 4.161 ± 0.652
0.585ValTrp: 0.585 ± 0.197
1.495ValTyr: 1.495 ± 0.307
0.0ValXaa: 0.0 ± 0.0
Trp
0.845TrpAla: 0.845 ± 0.281
0.39TrpCys: 0.39 ± 0.173
0.585TrpAsp: 0.585 ± 0.15
0.78TrpGlu: 0.78 ± 0.225
0.65TrpPhe: 0.65 ± 0.206
0.715TrpGly: 0.715 ± 0.247
0.52TrpHis: 0.52 ± 0.241
1.235TrpIle: 1.235 ± 0.292
0.65TrpLys: 0.65 ± 0.23
1.3TrpLeu: 1.3 ± 0.257
0.52TrpMet: 0.52 ± 0.185
0.715TrpAsn: 0.715 ± 0.232
0.39TrpPro: 0.39 ± 0.161
0.91TrpGln: 0.91 ± 0.253
0.325TrpArg: 0.325 ± 0.148
0.26TrpSer: 0.26 ± 0.118
0.26TrpThr: 0.26 ± 0.115
0.325TrpVal: 0.325 ± 0.132
0.195TrpTrp: 0.195 ± 0.112
0.325TrpTyr: 0.325 ± 0.129
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.885TyrAla: 1.885 ± 0.3
0.26TyrCys: 0.26 ± 0.131
2.276TyrAsp: 2.276 ± 0.354
2.471TyrGlu: 2.471 ± 0.391
1.69TyrPhe: 1.69 ± 0.397
2.146TyrGly: 2.146 ± 0.493
0.975TyrHis: 0.975 ± 0.274
2.211TyrIle: 2.211 ± 0.41
3.446TyrLys: 3.446 ± 0.385
3.446TyrLeu: 3.446 ± 0.564
0.65TyrMet: 0.65 ± 0.216
1.95TyrAsn: 1.95 ± 0.342
1.105TyrPro: 1.105 ± 0.265
1.3TyrGln: 1.3 ± 0.273
2.341TyrArg: 2.341 ± 0.514
2.536TyrSer: 2.536 ± 0.309
1.885TyrThr: 1.885 ± 0.281
1.82TyrVal: 1.82 ± 0.37
0.65TyrTrp: 0.65 ± 0.232
1.69TyrTyr: 1.69 ± 0.389
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (15382 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski