Amino acid dipepetide frequency for Lactococcus phage TP901-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.577AlaAla: 4.577 ± 1.261
0.345AlaCys: 0.345 ± 0.146
4.231AlaAsp: 4.231 ± 0.65
3.713AlaGlu: 3.713 ± 0.707
3.022AlaPhe: 3.022 ± 0.4
2.591AlaGly: 2.591 ± 0.49
0.604AlaHis: 0.604 ± 0.267
4.577AlaIle: 4.577 ± 0.813
6.304AlaLys: 6.304 ± 0.73
5.181AlaLeu: 5.181 ± 0.675
2.245AlaMet: 2.245 ± 0.423
4.75AlaAsn: 4.75 ± 0.717
2.159AlaPro: 2.159 ± 0.484
2.936AlaGln: 2.936 ± 0.507
1.641AlaArg: 1.641 ± 0.39
4.836AlaSer: 4.836 ± 0.774
3.886AlaThr: 3.886 ± 0.739
2.936AlaVal: 2.936 ± 0.532
1.468AlaTrp: 1.468 ± 0.405
2.418AlaTyr: 2.418 ± 0.462
0.0AlaXaa: 0.0 ± 0.0
Cys
0.259CysAla: 0.259 ± 0.13
0.0CysCys: 0.0 ± 0.0
0.345CysAsp: 0.345 ± 0.184
0.604CysGlu: 0.604 ± 0.211
0.173CysPhe: 0.173 ± 0.112
0.777CysGly: 0.777 ± 0.326
0.086CysHis: 0.086 ± 0.097
0.086CysIle: 0.086 ± 0.094
0.086CysLys: 0.086 ± 0.073
0.432CysLeu: 0.432 ± 0.202
0.0CysMet: 0.0 ± 0.0
0.086CysAsn: 0.086 ± 0.079
0.345CysPro: 0.345 ± 0.201
0.259CysGln: 0.259 ± 0.156
0.173CysArg: 0.173 ± 0.106
0.604CysSer: 0.604 ± 0.201
0.086CysThr: 0.086 ± 0.093
0.432CysVal: 0.432 ± 0.163
0.0CysTrp: 0.0 ± 0.0
0.345CysTyr: 0.345 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
2.936AspAla: 2.936 ± 0.541
0.432AspCys: 0.432 ± 0.229
3.972AspAsp: 3.972 ± 0.685
5.959AspGlu: 5.959 ± 1.181
2.85AspPhe: 2.85 ± 0.512
4.145AspGly: 4.145 ± 0.538
0.518AspHis: 0.518 ± 0.22
4.922AspIle: 4.922 ± 0.578
6.218AspLys: 6.218 ± 0.656
5.699AspLeu: 5.699 ± 0.712
1.554AspMet: 1.554 ± 0.337
4.404AspAsn: 4.404 ± 0.518
1.123AspPro: 1.123 ± 0.348
1.209AspGln: 1.209 ± 0.37
2.332AspArg: 2.332 ± 0.412
4.231AspSer: 4.231 ± 0.537
3.8AspThr: 3.8 ± 0.652
3.627AspVal: 3.627 ± 0.524
1.209AspTrp: 1.209 ± 0.278
2.504AspTyr: 2.504 ± 0.46
0.0AspXaa: 0.0 ± 0.0
Glu
4.059GluAla: 4.059 ± 0.624
0.518GluCys: 0.518 ± 0.267
2.763GluAsp: 2.763 ± 0.622
5.44GluGlu: 5.44 ± 0.946
3.713GluPhe: 3.713 ± 0.484
2.85GluGly: 2.85 ± 0.501
0.864GluHis: 0.864 ± 0.283
5.959GluIle: 5.959 ± 0.969
6.39GluLys: 6.39 ± 1.233
8.722GluLeu: 8.722 ± 0.987
2.073GluMet: 2.073 ± 0.397
4.491GluAsn: 4.491 ± 0.769
1.986GluPro: 1.986 ± 0.463
3.713GluGln: 3.713 ± 0.545
2.245GluArg: 2.245 ± 0.567
4.577GluSer: 4.577 ± 0.584
3.282GluThr: 3.282 ± 0.672
5.527GluVal: 5.527 ± 0.703
0.777GluTrp: 0.777 ± 0.231
2.677GluTyr: 2.677 ± 0.529
0.0GluXaa: 0.0 ± 0.0
Phe
2.591PheAla: 2.591 ± 0.548
0.345PheCys: 0.345 ± 0.169
3.454PheAsp: 3.454 ± 0.417
3.368PheGlu: 3.368 ± 0.504
1.295PhePhe: 1.295 ± 0.345
3.109PheGly: 3.109 ± 0.439
0.432PheHis: 0.432 ± 0.184
2.763PheIle: 2.763 ± 0.55
4.922PheLys: 4.922 ± 0.666
2.245PheLeu: 2.245 ± 0.438
1.468PheMet: 1.468 ± 0.375
2.936PheAsn: 2.936 ± 0.458
1.123PhePro: 1.123 ± 0.33
1.727PheGln: 1.727 ± 0.386
1.295PheArg: 1.295 ± 0.286
3.454PheSer: 3.454 ± 0.469
2.936PheThr: 2.936 ± 0.496
2.418PheVal: 2.418 ± 0.496
0.259PheTrp: 0.259 ± 0.156
1.209PheTyr: 1.209 ± 0.356
0.0PheXaa: 0.0 ± 0.0
Gly
3.8GlyAla: 3.8 ± 0.695
0.086GlyCys: 0.086 ± 0.072
3.886GlyAsp: 3.886 ± 0.693
2.85GlyGlu: 2.85 ± 0.558
3.109GlyPhe: 3.109 ± 0.596
4.663GlyGly: 4.663 ± 0.964
0.95GlyHis: 0.95 ± 0.298
4.922GlyIle: 4.922 ± 0.729
5.613GlyLys: 5.613 ± 0.659
4.836GlyLeu: 4.836 ± 1.217
1.986GlyMet: 1.986 ± 0.482
3.109GlyAsn: 3.109 ± 0.671
1.209GlyPro: 1.209 ± 0.371
2.763GlyGln: 2.763 ± 0.664
2.245GlyArg: 2.245 ± 0.545
3.282GlySer: 3.282 ± 0.528
4.145GlyThr: 4.145 ± 0.644
3.454GlyVal: 3.454 ± 0.589
0.604GlyTrp: 0.604 ± 0.21
3.972GlyTyr: 3.972 ± 0.568
0.0GlyXaa: 0.0 ± 0.0
His
0.432HisAla: 0.432 ± 0.193
0.259HisCys: 0.259 ± 0.164
0.777HisAsp: 0.777 ± 0.293
1.123HisGlu: 1.123 ± 0.332
0.95HisPhe: 0.95 ± 0.297
0.432HisGly: 0.432 ± 0.174
0.345HisHis: 0.345 ± 0.146
0.604HisIle: 0.604 ± 0.237
1.123HisLys: 1.123 ± 0.35
0.604HisLeu: 0.604 ± 0.228
0.432HisMet: 0.432 ± 0.199
0.259HisAsn: 0.259 ± 0.137
0.432HisPro: 0.432 ± 0.191
0.432HisGln: 0.432 ± 0.195
0.345HisArg: 0.345 ± 0.141
0.691HisSer: 0.691 ± 0.242
0.432HisThr: 0.432 ± 0.185
0.691HisVal: 0.691 ± 0.212
0.086HisTrp: 0.086 ± 0.079
0.864HisTyr: 0.864 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.145IleAla: 4.145 ± 0.593
0.259IleCys: 0.259 ± 0.136
4.836IleAsp: 4.836 ± 0.673
6.045IleGlu: 6.045 ± 0.729
2.245IlePhe: 2.245 ± 0.435
5.009IleGly: 5.009 ± 0.921
0.345IleHis: 0.345 ± 0.154
5.181IleIle: 5.181 ± 0.735
6.736IleLys: 6.736 ± 0.551
3.972IleLeu: 3.972 ± 0.531
1.9IleMet: 1.9 ± 0.401
4.145IleAsn: 4.145 ± 0.593
2.936IlePro: 2.936 ± 0.472
2.591IleGln: 2.591 ± 0.415
2.677IleArg: 2.677 ± 0.523
6.218IleSer: 6.218 ± 0.776
4.404IleThr: 4.404 ± 0.613
3.627IleVal: 3.627 ± 0.664
0.604IleTrp: 0.604 ± 0.23
2.504IleTyr: 2.504 ± 0.482
0.0IleXaa: 0.0 ± 0.0
Lys
7.427LysAla: 7.427 ± 0.992
0.345LysCys: 0.345 ± 0.171
5.613LysAsp: 5.613 ± 0.858
6.304LysGlu: 6.304 ± 0.692
3.886LysPhe: 3.886 ± 0.602
4.663LysGly: 4.663 ± 0.614
1.468LysHis: 1.468 ± 0.35
5.959LysIle: 5.959 ± 0.902
8.549LysLys: 8.549 ± 1.227
7.945LysLeu: 7.945 ± 1.11
2.85LysMet: 2.85 ± 0.459
5.699LysAsn: 5.699 ± 0.626
2.677LysPro: 2.677 ± 0.416
4.059LysGln: 4.059 ± 0.555
3.713LysArg: 3.713 ± 0.728
5.181LysSer: 5.181 ± 0.766
5.527LysThr: 5.527 ± 0.746
4.318LysVal: 4.318 ± 0.621
1.209LysTrp: 1.209 ± 0.365
3.368LysTyr: 3.368 ± 0.545
0.0LysXaa: 0.0 ± 0.0
Leu
5.613LeuAla: 5.613 ± 0.72
0.518LeuCys: 0.518 ± 0.219
5.44LeuAsp: 5.44 ± 0.498
5.527LeuGlu: 5.527 ± 0.755
3.195LeuPhe: 3.195 ± 0.613
5.354LeuGly: 5.354 ± 0.464
0.345LeuHis: 0.345 ± 0.161
5.613LeuIle: 5.613 ± 0.73
8.377LeuLys: 8.377 ± 1.031
6.045LeuLeu: 6.045 ± 0.805
1.641LeuMet: 1.641 ± 0.404
5.527LeuAsn: 5.527 ± 0.509
3.022LeuPro: 3.022 ± 0.461
3.454LeuGln: 3.454 ± 0.57
2.591LeuArg: 2.591 ± 0.447
7.599LeuSer: 7.599 ± 0.846
5.009LeuThr: 5.009 ± 0.473
4.318LeuVal: 4.318 ± 0.479
1.468LeuTrp: 1.468 ± 0.485
2.332LeuTyr: 2.332 ± 0.343
0.0LeuXaa: 0.0 ± 0.0
Met
1.813MetAla: 1.813 ± 0.339
0.086MetCys: 0.086 ± 0.079
1.727MetAsp: 1.727 ± 0.304
1.554MetGlu: 1.554 ± 0.379
0.777MetPhe: 0.777 ± 0.25
1.641MetGly: 1.641 ± 0.427
0.518MetHis: 0.518 ± 0.189
1.295MetIle: 1.295 ± 0.318
2.245MetLys: 2.245 ± 0.493
1.468MetLeu: 1.468 ± 0.3
0.864MetMet: 0.864 ± 0.236
1.727MetAsn: 1.727 ± 0.357
0.95MetPro: 0.95 ± 0.321
0.95MetGln: 0.95 ± 0.243
1.036MetArg: 1.036 ± 0.336
1.813MetSer: 1.813 ± 0.315
3.109MetThr: 3.109 ± 0.506
1.123MetVal: 1.123 ± 0.415
0.345MetTrp: 0.345 ± 0.132
0.777MetTyr: 0.777 ± 0.258
0.0MetXaa: 0.0 ± 0.0
Asn
4.663AsnAla: 4.663 ± 0.933
0.345AsnCys: 0.345 ± 0.172
3.713AsnAsp: 3.713 ± 0.658
3.886AsnGlu: 3.886 ± 0.63
2.332AsnPhe: 2.332 ± 0.424
5.44AsnGly: 5.44 ± 0.791
0.432AsnHis: 0.432 ± 0.203
4.491AsnIle: 4.491 ± 0.577
5.268AsnLys: 5.268 ± 0.775
5.613AsnLeu: 5.613 ± 0.623
1.468AsnMet: 1.468 ± 0.378
3.8AsnAsn: 3.8 ± 0.602
2.677AsnPro: 2.677 ± 0.465
2.677AsnGln: 2.677 ± 0.455
2.504AsnArg: 2.504 ± 0.503
4.318AsnSer: 4.318 ± 0.535
2.936AsnThr: 2.936 ± 0.527
3.454AsnVal: 3.454 ± 0.531
0.864AsnTrp: 0.864 ± 0.314
1.727AsnTyr: 1.727 ± 0.536
0.0AsnXaa: 0.0 ± 0.0
Pro
1.295ProAla: 1.295 ± 0.31
0.0ProCys: 0.0 ± 0.0
2.418ProAsp: 2.418 ± 0.469
2.763ProGlu: 2.763 ± 0.408
1.641ProPhe: 1.641 ± 0.405
0.691ProGly: 0.691 ± 0.196
0.691ProHis: 0.691 ± 0.192
2.504ProIle: 2.504 ± 0.425
2.85ProLys: 2.85 ± 0.418
2.418ProLeu: 2.418 ± 0.405
0.518ProMet: 0.518 ± 0.191
1.813ProAsn: 1.813 ± 0.546
0.691ProPro: 0.691 ± 0.203
1.468ProGln: 1.468 ± 0.398
1.036ProArg: 1.036 ± 0.324
1.554ProSer: 1.554 ± 0.398
1.9ProThr: 1.9 ± 0.419
2.504ProVal: 2.504 ± 0.398
0.173ProTrp: 0.173 ± 0.127
0.95ProTyr: 0.95 ± 0.27
0.0ProXaa: 0.0 ± 0.0
Gln
5.095GlnAla: 5.095 ± 0.699
0.086GlnCys: 0.086 ± 0.109
1.123GlnAsp: 1.123 ± 0.281
4.059GlnGlu: 4.059 ± 0.471
1.554GlnPhe: 1.554 ± 0.401
2.677GlnGly: 2.677 ± 0.544
0.259GlnHis: 0.259 ± 0.146
3.368GlnIle: 3.368 ± 0.492
3.109GlnLys: 3.109 ± 0.523
3.109GlnLeu: 3.109 ± 0.452
0.777GlnMet: 0.777 ± 0.271
1.986GlnAsn: 1.986 ± 0.432
1.468GlnPro: 1.468 ± 0.392
2.504GlnGln: 2.504 ± 0.473
0.95GlnArg: 0.95 ± 0.232
2.504GlnSer: 2.504 ± 0.552
2.677GlnThr: 2.677 ± 0.438
2.245GlnVal: 2.245 ± 0.374
0.691GlnTrp: 0.691 ± 0.282
1.295GlnTyr: 1.295 ± 0.301
0.0GlnXaa: 0.0 ± 0.0
Arg
1.641ArgAla: 1.641 ± 0.27
0.518ArgCys: 0.518 ± 0.217
2.418ArgAsp: 2.418 ± 0.501
2.418ArgGlu: 2.418 ± 0.538
2.245ArgPhe: 2.245 ± 0.53
1.295ArgGly: 1.295 ± 0.348
0.432ArgHis: 0.432 ± 0.205
2.418ArgIle: 2.418 ± 0.486
3.368ArgLys: 3.368 ± 0.517
3.713ArgLeu: 3.713 ± 0.625
0.95ArgMet: 0.95 ± 0.314
2.245ArgAsn: 2.245 ± 0.347
0.691ArgPro: 0.691 ± 0.328
1.209ArgGln: 1.209 ± 0.265
0.691ArgArg: 0.691 ± 0.208
2.332ArgSer: 2.332 ± 0.337
1.986ArgThr: 1.986 ± 0.381
1.9ArgVal: 1.9 ± 0.331
0.259ArgTrp: 0.259 ± 0.142
1.295ArgTyr: 1.295 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
4.231SerAla: 4.231 ± 0.975
0.086SerCys: 0.086 ± 0.084
5.009SerAsp: 5.009 ± 0.58
5.095SerGlu: 5.095 ± 0.674
3.713SerPhe: 3.713 ± 0.508
5.872SerGly: 5.872 ± 0.77
0.864SerHis: 0.864 ± 0.232
4.145SerIle: 4.145 ± 0.539
4.145SerLys: 4.145 ± 0.722
6.218SerLeu: 6.218 ± 0.777
1.382SerMet: 1.382 ± 0.308
5.268SerAsn: 5.268 ± 0.718
1.209SerPro: 1.209 ± 0.286
2.85SerGln: 2.85 ± 0.434
2.245SerArg: 2.245 ± 0.382
4.059SerSer: 4.059 ± 0.781
3.886SerThr: 3.886 ± 0.722
5.095SerVal: 5.095 ± 0.58
0.95SerTrp: 0.95 ± 0.289
2.504SerTyr: 2.504 ± 0.547
0.0SerXaa: 0.0 ± 0.0
Thr
4.059ThrAla: 4.059 ± 0.568
0.086ThrCys: 0.086 ± 0.087
4.059ThrAsp: 4.059 ± 0.707
4.231ThrGlu: 4.231 ± 0.661
2.677ThrPhe: 2.677 ± 0.472
4.231ThrGly: 4.231 ± 0.619
0.259ThrHis: 0.259 ± 0.146
4.404ThrIle: 4.404 ± 0.726
5.44ThrLys: 5.44 ± 0.806
4.836ThrLeu: 4.836 ± 0.563
1.123ThrMet: 1.123 ± 0.404
2.936ThrAsn: 2.936 ± 0.438
1.727ThrPro: 1.727 ± 0.528
1.468ThrGln: 1.468 ± 0.306
2.159ThrArg: 2.159 ± 0.359
3.454ThrSer: 3.454 ± 0.391
4.318ThrThr: 4.318 ± 0.739
5.527ThrVal: 5.527 ± 0.811
0.95ThrTrp: 0.95 ± 0.324
2.763ThrTyr: 2.763 ± 0.605
0.0ThrXaa: 0.0 ± 0.0
Val
3.282ValAla: 3.282 ± 0.597
0.259ValCys: 0.259 ± 0.163
3.8ValAsp: 3.8 ± 0.621
4.836ValGlu: 4.836 ± 0.722
2.332ValPhe: 2.332 ± 0.437
2.677ValGly: 2.677 ± 0.477
0.864ValHis: 0.864 ± 0.316
3.713ValIle: 3.713 ± 0.509
6.045ValLys: 6.045 ± 0.728
5.44ValLeu: 5.44 ± 0.779
1.295ValMet: 1.295 ± 0.363
4.231ValAsn: 4.231 ± 0.504
1.813ValPro: 1.813 ± 0.357
2.504ValGln: 2.504 ± 0.658
1.727ValArg: 1.727 ± 0.381
4.663ValSer: 4.663 ± 0.748
3.713ValThr: 3.713 ± 0.603
3.454ValVal: 3.454 ± 0.58
0.691ValTrp: 0.691 ± 0.219
1.9ValTyr: 1.9 ± 0.44
0.0ValXaa: 0.0 ± 0.0
Trp
1.036TrpAla: 1.036 ± 0.299
0.0TrpCys: 0.0 ± 0.0
1.468TrpAsp: 1.468 ± 0.366
0.691TrpGlu: 0.691 ± 0.264
0.432TrpPhe: 0.432 ± 0.186
0.777TrpGly: 0.777 ± 0.325
0.173TrpHis: 0.173 ± 0.107
0.864TrpIle: 0.864 ± 0.256
1.036TrpLys: 1.036 ± 0.322
0.864TrpLeu: 0.864 ± 0.285
0.086TrpMet: 0.086 ± 0.072
1.036TrpAsn: 1.036 ± 0.395
0.259TrpPro: 0.259 ± 0.182
1.036TrpGln: 1.036 ± 0.329
0.518TrpArg: 0.518 ± 0.204
0.777TrpSer: 0.777 ± 0.261
0.777TrpThr: 0.777 ± 0.256
0.691TrpVal: 0.691 ± 0.249
0.173TrpTrp: 0.173 ± 0.109
0.691TrpTyr: 0.691 ± 0.283
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.468TyrAla: 1.468 ± 0.402
0.604TyrCys: 0.604 ± 0.266
2.591TyrAsp: 2.591 ± 0.402
2.159TyrGlu: 2.159 ± 0.409
1.295TyrPhe: 1.295 ± 0.429
2.591TyrGly: 2.591 ± 0.52
0.864TyrHis: 0.864 ± 0.296
2.591TyrIle: 2.591 ± 0.375
2.936TyrLys: 2.936 ± 0.483
3.627TyrLeu: 3.627 ± 0.721
1.123TyrMet: 1.123 ± 0.295
2.159TyrAsn: 2.159 ± 0.408
1.382TyrPro: 1.382 ± 0.363
1.727TyrGln: 1.727 ± 0.327
1.9TyrArg: 1.9 ± 0.451
2.677TyrSer: 2.677 ± 0.569
1.9TyrThr: 1.9 ± 0.478
1.9TyrVal: 1.9 ± 0.337
0.604TyrTrp: 0.604 ± 0.217
1.727TyrTyr: 1.727 ± 0.466
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski