Amino acid dipepetide frequency for Agrobacterium phage Atu_ph02

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.64AlaAla: 13.64 ± 2.389
0.714AlaCys: 0.714 ± 0.262
5.642AlaAsp: 5.642 ± 0.635
6.784AlaGlu: 6.784 ± 0.684
3.856AlaPhe: 3.856 ± 0.613
8.641AlaGly: 8.641 ± 0.965
0.857AlaHis: 0.857 ± 0.227
4.213AlaIle: 4.213 ± 0.587
5.999AlaLys: 5.999 ± 0.755
7.356AlaLeu: 7.356 ± 0.704
2.785AlaMet: 2.785 ± 0.424
4.713AlaAsn: 4.713 ± 0.686
4.071AlaPro: 4.071 ± 0.666
4.928AlaGln: 4.928 ± 0.675
5.784AlaArg: 5.784 ± 0.77
6.427AlaSer: 6.427 ± 1.127
5.642AlaThr: 5.642 ± 0.748
7.57AlaVal: 7.57 ± 0.772
1.5AlaTrp: 1.5 ± 0.246
3.428AlaTyr: 3.428 ± 0.559
0.0AlaXaa: 0.0 ± 0.0
Cys
0.214CysAla: 0.214 ± 0.124
0.143CysCys: 0.143 ± 0.1
0.214CysAsp: 0.214 ± 0.123
0.357CysGlu: 0.357 ± 0.193
0.071CysPhe: 0.071 ± 0.092
0.5CysGly: 0.5 ± 0.203
0.143CysHis: 0.143 ± 0.113
0.357CysIle: 0.357 ± 0.162
0.643CysLys: 0.643 ± 0.239
0.428CysLeu: 0.428 ± 0.186
0.286CysMet: 0.286 ± 0.145
0.214CysAsn: 0.214 ± 0.125
0.214CysPro: 0.214 ± 0.161
0.214CysGln: 0.214 ± 0.142
0.428CysArg: 0.428 ± 0.197
0.428CysSer: 0.428 ± 0.196
0.571CysThr: 0.571 ± 0.197
0.571CysVal: 0.571 ± 0.298
0.071CysTrp: 0.071 ± 0.08
0.143CysTyr: 0.143 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
7.284AspAla: 7.284 ± 0.665
0.428AspCys: 0.428 ± 0.227
4.356AspAsp: 4.356 ± 0.549
3.642AspGlu: 3.642 ± 0.452
2.499AspPhe: 2.499 ± 0.433
5.57AspGly: 5.57 ± 0.69
1.285AspHis: 1.285 ± 0.3
3.214AspIle: 3.214 ± 0.364
3.571AspLys: 3.571 ± 0.506
6.641AspLeu: 6.641 ± 0.669
1.571AspMet: 1.571 ± 0.31
2.214AspAsn: 2.214 ± 0.39
3.713AspPro: 3.713 ± 0.615
2.214AspGln: 2.214 ± 0.306
3.856AspArg: 3.856 ± 0.556
1.714AspSer: 1.714 ± 0.373
3.428AspThr: 3.428 ± 0.538
5.142AspVal: 5.142 ± 0.628
1.357AspTrp: 1.357 ± 0.304
2.357AspTyr: 2.357 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
6.356GluAla: 6.356 ± 0.716
0.357GluCys: 0.357 ± 0.151
4.785GluAsp: 4.785 ± 0.599
2.928GluGlu: 2.928 ± 0.605
2.928GluPhe: 2.928 ± 0.471
3.285GluGly: 3.285 ± 0.475
0.857GluHis: 0.857 ± 0.286
2.499GluIle: 2.499 ± 0.388
2.714GluLys: 2.714 ± 0.523
5.427GluLeu: 5.427 ± 0.693
1.5GluMet: 1.5 ± 0.378
1.714GluAsn: 1.714 ± 0.347
2.071GluPro: 2.071 ± 0.361
2.571GluGln: 2.571 ± 0.515
4.213GluArg: 4.213 ± 0.527
3.214GluSer: 3.214 ± 0.466
3.356GluThr: 3.356 ± 0.521
3.928GluVal: 3.928 ± 0.731
1.214GluTrp: 1.214 ± 0.315
2.357GluTyr: 2.357 ± 0.39
0.0GluXaa: 0.0 ± 0.0
Phe
2.428PheAla: 2.428 ± 0.329
0.357PheCys: 0.357 ± 0.154
2.857PheAsp: 2.857 ± 0.413
2.571PheGlu: 2.571 ± 0.361
0.857PhePhe: 0.857 ± 0.268
2.071PheGly: 2.071 ± 0.333
1.285PheHis: 1.285 ± 0.355
2.428PheIle: 2.428 ± 0.319
1.643PheLys: 1.643 ± 0.389
2.714PheLeu: 2.714 ± 0.303
0.928PheMet: 0.928 ± 0.209
1.714PheAsn: 1.714 ± 0.328
0.857PhePro: 0.857 ± 0.225
1.5PheGln: 1.5 ± 0.267
2.0PheArg: 2.0 ± 0.595
1.714PheSer: 1.714 ± 0.353
1.643PheThr: 1.643 ± 0.336
2.285PheVal: 2.285 ± 0.357
0.571PheTrp: 0.571 ± 0.179
1.285PheTyr: 1.285 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
6.999GlyAla: 6.999 ± 0.929
0.357GlyCys: 0.357 ± 0.157
4.856GlyAsp: 4.856 ± 0.539
3.785GlyGlu: 3.785 ± 0.56
2.357GlyPhe: 2.357 ± 0.372
5.57GlyGly: 5.57 ± 0.898
1.143GlyHis: 1.143 ± 0.261
3.856GlyIle: 3.856 ± 0.458
5.856GlyLys: 5.856 ± 0.749
6.641GlyLeu: 6.641 ± 0.691
2.642GlyMet: 2.642 ± 0.412
3.285GlyAsn: 3.285 ± 0.508
2.285GlyPro: 2.285 ± 0.343
3.071GlyGln: 3.071 ± 0.488
5.427GlyArg: 5.427 ± 0.659
4.928GlySer: 4.928 ± 0.652
4.999GlyThr: 4.999 ± 0.554
5.356GlyVal: 5.356 ± 0.556
1.214GlyTrp: 1.214 ± 0.269
2.428GlyTyr: 2.428 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
2.071HisAla: 2.071 ± 0.481
0.214HisCys: 0.214 ± 0.137
1.643HisAsp: 1.643 ± 0.364
1.0HisGlu: 1.0 ± 0.323
0.571HisPhe: 0.571 ± 0.182
1.714HisGly: 1.714 ± 0.387
0.643HisHis: 0.643 ± 0.299
1.357HisIle: 1.357 ± 0.34
0.786HisLys: 0.786 ± 0.214
1.571HisLeu: 1.571 ± 0.369
0.571HisMet: 0.571 ± 0.253
0.928HisAsn: 0.928 ± 0.23
0.857HisPro: 0.857 ± 0.33
0.571HisGln: 0.571 ± 0.263
0.5HisArg: 0.5 ± 0.194
0.928HisSer: 0.928 ± 0.274
1.285HisThr: 1.285 ± 0.295
1.214HisVal: 1.214 ± 0.31
0.714HisTrp: 0.714 ± 0.232
0.357HisTyr: 0.357 ± 0.119
0.0HisXaa: 0.0 ± 0.0
Ile
4.499IleAla: 4.499 ± 0.548
0.286IleCys: 0.286 ± 0.182
4.928IleAsp: 4.928 ± 0.611
3.642IleGlu: 3.642 ± 0.599
1.143IlePhe: 1.143 ± 0.268
3.928IleGly: 3.928 ± 0.432
1.143IleHis: 1.143 ± 0.268
2.428IleIle: 2.428 ± 0.447
3.713IleLys: 3.713 ± 0.591
2.071IleLeu: 2.071 ± 0.327
1.428IleMet: 1.428 ± 0.296
2.071IleAsn: 2.071 ± 0.375
2.214IlePro: 2.214 ± 0.398
1.928IleGln: 1.928 ± 0.374
2.714IleArg: 2.714 ± 0.35
2.214IleSer: 2.214 ± 0.308
4.356IleThr: 4.356 ± 0.678
3.642IleVal: 3.642 ± 0.522
0.714IleTrp: 0.714 ± 0.193
1.571IleTyr: 1.571 ± 0.33
0.0IleXaa: 0.0 ± 0.0
Lys
6.07LysAla: 6.07 ± 0.783
0.286LysCys: 0.286 ± 0.186
4.213LysAsp: 4.213 ± 0.603
4.428LysGlu: 4.428 ± 0.543
1.857LysPhe: 1.857 ± 0.261
4.213LysGly: 4.213 ± 0.545
1.357LysHis: 1.357 ± 0.324
2.499LysIle: 2.499 ± 0.403
2.714LysLys: 2.714 ± 0.485
4.356LysLeu: 4.356 ± 0.576
1.285LysMet: 1.285 ± 0.287
2.0LysAsn: 2.0 ± 0.391
2.928LysPro: 2.928 ± 0.425
2.428LysGln: 2.428 ± 0.582
4.071LysArg: 4.071 ± 0.581
2.357LysSer: 2.357 ± 0.329
3.071LysThr: 3.071 ± 0.453
3.642LysVal: 3.642 ± 0.439
1.071LysTrp: 1.071 ± 0.22
1.357LysTyr: 1.357 ± 0.328
0.0LysXaa: 0.0 ± 0.0
Leu
8.284LeuAla: 8.284 ± 0.939
0.5LeuCys: 0.5 ± 0.207
4.999LeuAsp: 4.999 ± 0.456
4.356LeuGlu: 4.356 ± 0.606
1.571LeuPhe: 1.571 ± 0.314
5.642LeuGly: 5.642 ± 0.653
1.571LeuHis: 1.571 ± 0.335
4.213LeuIle: 4.213 ± 0.624
4.213LeuLys: 4.213 ± 0.742
5.499LeuLeu: 5.499 ± 0.707
2.214LeuMet: 2.214 ± 0.476
3.499LeuAsn: 3.499 ± 0.432
4.356LeuPro: 4.356 ± 0.471
3.285LeuGln: 3.285 ± 0.555
4.428LeuArg: 4.428 ± 0.478
5.07LeuSer: 5.07 ± 0.482
4.57LeuThr: 4.57 ± 0.546
5.356LeuVal: 5.356 ± 0.588
1.071LeuTrp: 1.071 ± 0.271
2.714LeuTyr: 2.714 ± 0.483
0.0LeuXaa: 0.0 ± 0.0
Met
3.499MetAla: 3.499 ± 0.473
0.428MetCys: 0.428 ± 0.228
1.357MetAsp: 1.357 ± 0.304
1.571MetGlu: 1.571 ± 0.413
1.0MetPhe: 1.0 ± 0.269
2.071MetGly: 2.071 ± 0.313
0.643MetHis: 0.643 ± 0.258
1.5MetIle: 1.5 ± 0.361
0.857MetLys: 0.857 ± 0.254
2.499MetLeu: 2.499 ± 0.481
0.643MetMet: 0.643 ± 0.222
1.143MetAsn: 1.143 ± 0.26
0.786MetPro: 0.786 ± 0.25
1.857MetGln: 1.857 ± 0.422
2.285MetArg: 2.285 ± 0.537
1.357MetSer: 1.357 ± 0.347
1.857MetThr: 1.857 ± 0.341
1.428MetVal: 1.428 ± 0.378
0.214MetTrp: 0.214 ± 0.107
0.714MetTyr: 0.714 ± 0.213
0.0MetXaa: 0.0 ± 0.0
Asn
4.999AsnAla: 4.999 ± 0.606
0.286AsnCys: 0.286 ± 0.13
2.071AsnAsp: 2.071 ± 0.414
1.571AsnGlu: 1.571 ± 0.412
1.071AsnPhe: 1.071 ± 0.248
3.499AsnGly: 3.499 ± 0.533
1.071AsnHis: 1.071 ± 0.354
2.071AsnIle: 2.071 ± 0.446
2.285AsnLys: 2.285 ± 0.415
2.714AsnLeu: 2.714 ± 0.383
1.214AsnMet: 1.214 ± 0.277
1.785AsnAsn: 1.785 ± 0.385
2.785AsnPro: 2.785 ± 0.373
1.428AsnGln: 1.428 ± 0.344
2.785AsnArg: 2.785 ± 0.553
1.928AsnSer: 1.928 ± 0.333
2.857AsnThr: 2.857 ± 0.472
2.928AsnVal: 2.928 ± 0.413
0.643AsnTrp: 0.643 ± 0.194
1.571AsnTyr: 1.571 ± 0.391
0.0AsnXaa: 0.0 ± 0.0
Pro
4.642ProAla: 4.642 ± 0.777
0.214ProCys: 0.214 ± 0.12
3.785ProAsp: 3.785 ± 0.443
3.142ProGlu: 3.142 ± 0.621
1.785ProPhe: 1.785 ± 0.473
3.142ProGly: 3.142 ± 0.472
0.643ProHis: 0.643 ± 0.201
1.928ProIle: 1.928 ± 0.371
2.214ProLys: 2.214 ± 0.515
2.642ProLeu: 2.642 ± 0.401
1.143ProMet: 1.143 ± 0.388
1.785ProAsn: 1.785 ± 0.34
2.428ProPro: 2.428 ± 0.51
2.642ProGln: 2.642 ± 0.382
1.571ProArg: 1.571 ± 0.312
2.285ProSer: 2.285 ± 0.413
3.214ProThr: 3.214 ± 0.436
3.428ProVal: 3.428 ± 0.569
0.5ProTrp: 0.5 ± 0.232
1.143ProTyr: 1.143 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
4.856GlnAla: 4.856 ± 0.831
0.143GlnCys: 0.143 ± 0.09
2.428GlnAsp: 2.428 ± 0.395
2.142GlnGlu: 2.142 ± 0.333
1.643GlnPhe: 1.643 ± 0.286
3.999GlnGly: 3.999 ± 0.561
1.428GlnHis: 1.428 ± 0.305
1.785GlnIle: 1.785 ± 0.278
2.857GlnLys: 2.857 ± 0.473
3.642GlnLeu: 3.642 ± 0.519
1.428GlnMet: 1.428 ± 0.285
1.214GlnAsn: 1.214 ± 0.314
1.643GlnPro: 1.643 ± 0.389
2.142GlnGln: 2.142 ± 0.564
3.499GlnArg: 3.499 ± 0.469
2.642GlnSer: 2.642 ± 0.484
2.071GlnThr: 2.071 ± 0.529
2.642GlnVal: 2.642 ± 0.438
1.143GlnTrp: 1.143 ± 0.251
1.5GlnTyr: 1.5 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
5.285ArgAla: 5.285 ± 0.609
0.143ArgCys: 0.143 ± 0.106
3.785ArgAsp: 3.785 ± 0.517
3.356ArgGlu: 3.356 ± 0.421
2.571ArgPhe: 2.571 ± 0.461
4.428ArgGly: 4.428 ± 0.52
1.071ArgHis: 1.071 ± 0.32
3.571ArgIle: 3.571 ± 0.55
3.713ArgLys: 3.713 ± 0.583
4.785ArgLeu: 4.785 ± 0.619
2.071ArgMet: 2.071 ± 0.297
2.499ArgAsn: 2.499 ± 0.444
2.285ArgPro: 2.285 ± 0.524
3.499ArgGln: 3.499 ± 0.509
3.928ArgArg: 3.928 ± 0.6
2.642ArgSer: 2.642 ± 0.345
3.142ArgThr: 3.142 ± 0.394
4.142ArgVal: 4.142 ± 0.615
1.214ArgTrp: 1.214 ± 0.267
2.499ArgTyr: 2.499 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
6.284SerAla: 6.284 ± 1.071
0.357SerCys: 0.357 ± 0.181
2.357SerAsp: 2.357 ± 0.261
2.142SerGlu: 2.142 ± 0.367
1.714SerPhe: 1.714 ± 0.387
4.713SerGly: 4.713 ± 0.744
0.714SerHis: 0.714 ± 0.193
3.214SerIle: 3.214 ± 0.654
2.857SerLys: 2.857 ± 0.32
4.856SerLeu: 4.856 ± 0.823
0.857SerMet: 0.857 ± 0.216
3.285SerAsn: 3.285 ± 0.561
2.428SerPro: 2.428 ± 0.429
2.142SerGln: 2.142 ± 0.284
2.499SerArg: 2.499 ± 0.52
3.285SerSer: 3.285 ± 0.583
2.999SerThr: 2.999 ± 0.505
3.571SerVal: 3.571 ± 0.463
1.071SerTrp: 1.071 ± 0.267
1.643SerTyr: 1.643 ± 0.313
0.0SerXaa: 0.0 ± 0.0
Thr
5.713ThrAla: 5.713 ± 0.794
0.286ThrCys: 0.286 ± 0.192
4.213ThrAsp: 4.213 ± 0.664
3.285ThrGlu: 3.285 ± 0.506
2.071ThrPhe: 2.071 ± 0.436
5.713ThrGly: 5.713 ± 0.682
1.071ThrHis: 1.071 ± 0.298
3.499ThrIle: 3.499 ± 0.456
3.499ThrLys: 3.499 ± 0.739
3.428ThrLeu: 3.428 ± 0.577
2.071ThrMet: 2.071 ± 0.386
2.714ThrAsn: 2.714 ± 0.609
2.714ThrPro: 2.714 ± 0.39
2.642ThrGln: 2.642 ± 0.481
2.642ThrArg: 2.642 ± 0.365
3.285ThrSer: 3.285 ± 0.546
3.428ThrThr: 3.428 ± 0.555
4.142ThrVal: 4.142 ± 0.684
0.928ThrTrp: 0.928 ± 0.266
1.714ThrTyr: 1.714 ± 0.343
0.0ThrXaa: 0.0 ± 0.0
Val
6.356ValAla: 6.356 ± 0.639
0.428ValCys: 0.428 ± 0.207
4.785ValAsp: 4.785 ± 0.508
4.57ValGlu: 4.57 ± 0.944
1.928ValPhe: 1.928 ± 0.377
4.713ValGly: 4.713 ± 0.638
1.5ValHis: 1.5 ± 0.332
3.571ValIle: 3.571 ± 0.615
3.285ValLys: 3.285 ± 0.835
6.213ValLeu: 6.213 ± 0.615
1.785ValMet: 1.785 ± 0.349
2.857ValAsn: 2.857 ± 0.446
3.571ValPro: 3.571 ± 0.409
3.356ValGln: 3.356 ± 0.61
4.785ValArg: 4.785 ± 0.517
3.642ValSer: 3.642 ± 0.49
3.499ValThr: 3.499 ± 0.427
5.213ValVal: 5.213 ± 0.683
1.428ValTrp: 1.428 ± 0.438
1.928ValTyr: 1.928 ± 0.315
0.0ValXaa: 0.0 ± 0.0
Trp
1.5TrpAla: 1.5 ± 0.395
0.071TrpCys: 0.071 ± 0.08
0.643TrpAsp: 0.643 ± 0.193
1.143TrpGlu: 1.143 ± 0.193
1.357TrpPhe: 1.357 ± 0.284
0.786TrpGly: 0.786 ± 0.159
0.5TrpHis: 0.5 ± 0.18
0.5TrpIle: 0.5 ± 0.153
1.0TrpLys: 1.0 ± 0.226
2.071TrpLeu: 2.071 ± 0.348
0.5TrpMet: 0.5 ± 0.224
0.571TrpAsn: 0.571 ± 0.27
0.5TrpPro: 0.5 ± 0.236
1.143TrpGln: 1.143 ± 0.283
1.143TrpArg: 1.143 ± 0.283
0.857TrpSer: 0.857 ± 0.252
0.928TrpThr: 0.928 ± 0.27
1.357TrpVal: 1.357 ± 0.381
0.071TrpTrp: 0.071 ± 0.067
0.857TrpTyr: 0.857 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.571TyrAla: 3.571 ± 0.419
0.214TyrCys: 0.214 ± 0.139
2.142TyrAsp: 2.142 ± 0.429
1.857TyrGlu: 1.857 ± 0.373
0.857TyrPhe: 0.857 ± 0.221
2.857TyrGly: 2.857 ± 0.457
0.571TyrHis: 0.571 ± 0.233
1.928TyrIle: 1.928 ± 0.376
1.785TyrLys: 1.785 ± 0.282
1.857TyrLeu: 1.857 ± 0.464
0.714TyrMet: 0.714 ± 0.221
1.357TyrAsn: 1.357 ± 0.369
1.571TyrPro: 1.571 ± 0.372
1.285TyrGln: 1.285 ± 0.277
2.142TyrArg: 2.142 ± 0.338
2.071TyrSer: 2.071 ± 0.391
2.142TyrThr: 2.142 ± 0.443
1.857TyrVal: 1.857 ± 0.308
0.786TyrTrp: 0.786 ± 0.24
0.857TyrTyr: 0.857 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (14004 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski