Amino acid dipepetide frequency for Listeria phage PSU-VKH-LP041

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.533AlaAla: 5.533 ± 0.703
0.395AlaCys: 0.395 ± 0.16
3.557AlaAsp: 3.557 ± 0.45
5.335AlaGlu: 5.335 ± 0.53
2.305AlaPhe: 2.305 ± 0.379
4.874AlaGly: 4.874 ± 0.819
0.856AlaHis: 0.856 ± 0.235
5.203AlaIle: 5.203 ± 0.63
7.245AlaLys: 7.245 ± 0.962
6.059AlaLeu: 6.059 ± 0.7
1.91AlaMet: 1.91 ± 0.356
5.335AlaAsn: 5.335 ± 0.595
1.12AlaPro: 1.12 ± 0.258
1.976AlaGln: 1.976 ± 0.46
3.03AlaArg: 3.03 ± 0.404
5.006AlaSer: 5.006 ± 0.828
3.952AlaThr: 3.952 ± 0.458
5.006AlaVal: 5.006 ± 0.664
0.527AlaTrp: 0.527 ± 0.183
1.712AlaTyr: 1.712 ± 0.331
0.0AlaXaa: 0.0 ± 0.0
Cys
0.263CysAla: 0.263 ± 0.154
0.132CysCys: 0.132 ± 0.085
0.198CysAsp: 0.198 ± 0.112
1.054CysGlu: 1.054 ± 0.292
0.263CysPhe: 0.263 ± 0.139
0.724CysGly: 0.724 ± 0.222
0.0CysHis: 0.0 ± 0.0
0.395CysIle: 0.395 ± 0.197
0.856CysLys: 0.856 ± 0.283
0.263CysLeu: 0.263 ± 0.138
0.066CysMet: 0.066 ± 0.062
0.461CysAsn: 0.461 ± 0.198
0.461CysPro: 0.461 ± 0.179
0.263CysGln: 0.263 ± 0.153
0.395CysArg: 0.395 ± 0.168
0.527CysSer: 0.527 ± 0.171
0.395CysThr: 0.395 ± 0.145
0.263CysVal: 0.263 ± 0.119
0.0CysTrp: 0.0 ± 0.0
0.593CysTyr: 0.593 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
3.688AspAla: 3.688 ± 0.713
0.79AspCys: 0.79 ± 0.288
3.688AspAsp: 3.688 ± 0.69
5.401AspGlu: 5.401 ± 0.613
2.7AspPhe: 2.7 ± 0.43
2.7AspGly: 2.7 ± 0.391
0.593AspHis: 0.593 ± 0.229
4.742AspIle: 4.742 ± 0.615
5.006AspLys: 5.006 ± 0.654
5.137AspLeu: 5.137 ± 0.615
1.581AspMet: 1.581 ± 0.285
3.491AspAsn: 3.491 ± 0.473
1.449AspPro: 1.449 ± 0.33
1.186AspGln: 1.186 ± 0.267
2.042AspArg: 2.042 ± 0.337
4.94AspSer: 4.94 ± 0.746
2.832AspThr: 2.832 ± 0.501
3.096AspVal: 3.096 ± 0.489
0.922AspTrp: 0.922 ± 0.257
2.437AspTyr: 2.437 ± 0.44
0.0AspXaa: 0.0 ± 0.0
Glu
5.401GluAla: 5.401 ± 0.596
0.593GluCys: 0.593 ± 0.2
3.096GluAsp: 3.096 ± 0.522
8.299GluGlu: 8.299 ± 0.924
3.425GluPhe: 3.425 ± 0.458
4.018GluGly: 4.018 ± 0.475
1.647GluHis: 1.647 ± 0.333
7.245GluIle: 7.245 ± 0.538
8.694GluLys: 8.694 ± 0.844
8.957GluLeu: 8.957 ± 1.104
1.91GluMet: 1.91 ± 0.392
4.347GluAsn: 4.347 ± 0.402
1.581GluPro: 1.581 ± 0.374
3.886GluGln: 3.886 ± 0.578
4.347GluArg: 4.347 ± 0.557
3.952GluSer: 3.952 ± 0.473
3.952GluThr: 3.952 ± 0.523
5.73GluVal: 5.73 ± 0.67
0.79GluTrp: 0.79 ± 0.228
3.227GluTyr: 3.227 ± 0.445
0.0GluXaa: 0.0 ± 0.0
Phe
2.173PheAla: 2.173 ± 0.404
0.329PheCys: 0.329 ± 0.16
3.096PheAsp: 3.096 ± 0.569
3.886PheGlu: 3.886 ± 0.528
1.317PhePhe: 1.317 ± 0.276
2.766PheGly: 2.766 ± 0.497
0.593PheHis: 0.593 ± 0.243
2.569PheIle: 2.569 ± 0.431
3.952PheLys: 3.952 ± 0.56
3.096PheLeu: 3.096 ± 0.535
0.922PheMet: 0.922 ± 0.239
2.635PheAsn: 2.635 ± 0.486
1.581PhePro: 1.581 ± 0.346
0.79PheGln: 0.79 ± 0.251
1.251PheArg: 1.251 ± 0.202
2.766PheSer: 2.766 ± 0.411
2.042PheThr: 2.042 ± 0.325
1.778PheVal: 1.778 ± 0.391
0.198PheTrp: 0.198 ± 0.125
1.186PheTyr: 1.186 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
3.688GlyAla: 3.688 ± 0.886
0.461GlyCys: 0.461 ± 0.195
2.503GlyAsp: 2.503 ± 0.365
4.413GlyGlu: 4.413 ± 0.478
2.239GlyPhe: 2.239 ± 0.375
4.281GlyGly: 4.281 ± 1.047
0.922GlyHis: 0.922 ± 0.233
4.94GlyIle: 4.94 ± 0.586
6.059GlyLys: 6.059 ± 0.74
4.215GlyLeu: 4.215 ± 0.466
1.12GlyMet: 1.12 ± 0.402
3.227GlyAsn: 3.227 ± 0.586
1.186GlyPro: 1.186 ± 0.504
1.844GlyGln: 1.844 ± 0.331
2.108GlyArg: 2.108 ± 0.363
3.359GlySer: 3.359 ± 0.598
3.82GlyThr: 3.82 ± 0.599
3.82GlyVal: 3.82 ± 0.634
0.922GlyTrp: 0.922 ± 0.241
2.239GlyTyr: 2.239 ± 0.408
0.0GlyXaa: 0.0 ± 0.0
His
0.79HisAla: 0.79 ± 0.2
0.132HisCys: 0.132 ± 0.087
0.856HisAsp: 0.856 ± 0.214
1.515HisGlu: 1.515 ± 0.473
0.461HisPhe: 0.461 ± 0.141
0.724HisGly: 0.724 ± 0.183
0.395HisHis: 0.395 ± 0.146
1.12HisIle: 1.12 ± 0.293
1.186HisLys: 1.186 ± 0.242
0.988HisLeu: 0.988 ± 0.263
0.132HisMet: 0.132 ± 0.102
0.461HisAsn: 0.461 ± 0.175
0.659HisPro: 0.659 ± 0.237
0.659HisGln: 0.659 ± 0.237
0.593HisArg: 0.593 ± 0.199
1.581HisSer: 1.581 ± 0.308
0.988HisThr: 0.988 ± 0.226
1.186HisVal: 1.186 ± 0.242
0.066HisTrp: 0.066 ± 0.073
0.263HisTyr: 0.263 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
5.467IleAla: 5.467 ± 0.554
0.461IleCys: 0.461 ± 0.192
6.389IleAsp: 6.389 ± 0.563
6.652IleGlu: 6.652 ± 0.829
2.569IlePhe: 2.569 ± 0.419
3.754IleGly: 3.754 ± 0.644
1.647IleHis: 1.647 ± 0.296
4.94IleIle: 4.94 ± 0.812
6.784IleLys: 6.784 ± 0.682
5.006IleLeu: 5.006 ± 0.524
1.383IleMet: 1.383 ± 0.254
5.994IleAsn: 5.994 ± 0.651
3.03IlePro: 3.03 ± 0.511
2.832IleGln: 2.832 ± 0.464
2.239IleArg: 2.239 ± 0.498
3.82IleSer: 3.82 ± 0.607
4.413IleThr: 4.413 ± 0.514
4.413IleVal: 4.413 ± 0.613
0.593IleTrp: 0.593 ± 0.205
2.7IleTyr: 2.7 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
6.586LysAla: 6.586 ± 0.8
0.659LysCys: 0.659 ± 0.237
4.742LysAsp: 4.742 ± 0.513
7.772LysGlu: 7.772 ± 0.692
2.635LysPhe: 2.635 ± 0.399
5.203LysGly: 5.203 ± 0.745
1.449LysHis: 1.449 ± 0.283
6.981LysIle: 6.981 ± 0.66
8.299LysLys: 8.299 ± 0.729
8.167LysLeu: 8.167 ± 0.784
2.7LysMet: 2.7 ± 0.477
6.257LysAsn: 6.257 ± 0.652
1.976LysPro: 1.976 ± 0.348
4.018LysGln: 4.018 ± 0.541
3.557LysArg: 3.557 ± 0.376
6.323LysSer: 6.323 ± 0.574
6.191LysThr: 6.191 ± 0.632
5.598LysVal: 5.598 ± 0.57
1.91LysTrp: 1.91 ± 0.366
3.425LysTyr: 3.425 ± 0.523
0.0LysXaa: 0.0 ± 0.0
Leu
5.598LeuAla: 5.598 ± 0.551
0.922LeuCys: 0.922 ± 0.247
5.401LeuAsp: 5.401 ± 0.607
7.443LeuGlu: 7.443 ± 1.026
3.359LeuPhe: 3.359 ± 0.561
4.874LeuGly: 4.874 ± 0.791
0.79LeuHis: 0.79 ± 0.243
5.401LeuIle: 5.401 ± 0.74
6.389LeuLys: 6.389 ± 0.604
5.994LeuLeu: 5.994 ± 0.831
1.778LeuMet: 1.778 ± 0.382
6.059LeuAsn: 6.059 ± 0.691
2.305LeuPro: 2.305 ± 0.399
2.503LeuGln: 2.503 ± 0.43
3.359LeuArg: 3.359 ± 0.549
5.401LeuSer: 5.401 ± 0.539
5.137LeuThr: 5.137 ± 0.669
4.149LeuVal: 4.149 ± 0.632
0.395LeuTrp: 0.395 ± 0.179
2.173LeuTyr: 2.173 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
1.647MetAla: 1.647 ± 0.328
0.329MetCys: 0.329 ± 0.185
0.724MetAsp: 0.724 ± 0.218
1.251MetGlu: 1.251 ± 0.248
0.659MetPhe: 0.659 ± 0.188
0.988MetGly: 0.988 ± 0.24
0.329MetHis: 0.329 ± 0.133
1.976MetIle: 1.976 ± 0.362
2.832MetLys: 2.832 ± 0.389
1.515MetLeu: 1.515 ± 0.271
0.461MetMet: 0.461 ± 0.151
1.186MetAsn: 1.186 ± 0.314
0.856MetPro: 0.856 ± 0.229
0.988MetGln: 0.988 ± 0.25
1.186MetArg: 1.186 ± 0.288
1.383MetSer: 1.383 ± 0.411
1.91MetThr: 1.91 ± 0.387
1.383MetVal: 1.383 ± 0.296
0.132MetTrp: 0.132 ± 0.085
1.251MetTyr: 1.251 ± 0.288
0.0MetXaa: 0.0 ± 0.0
Asn
4.742AsnAla: 4.742 ± 0.614
0.461AsnCys: 0.461 ± 0.199
3.754AsnAsp: 3.754 ± 0.606
5.335AsnGlu: 5.335 ± 0.609
2.108AsnPhe: 2.108 ± 0.396
3.886AsnGly: 3.886 ± 0.397
0.79AsnHis: 0.79 ± 0.209
3.952AsnIle: 3.952 ± 0.449
5.73AsnLys: 5.73 ± 0.657
5.664AsnLeu: 5.664 ± 0.478
0.988AsnMet: 0.988 ± 0.301
4.61AsnAsn: 4.61 ± 0.902
2.108AsnPro: 2.108 ± 0.415
1.778AsnGln: 1.778 ± 0.255
2.766AsnArg: 2.766 ± 0.518
5.071AsnSer: 5.071 ± 0.775
3.359AsnThr: 3.359 ± 0.551
3.161AsnVal: 3.161 ± 0.462
0.659AsnTrp: 0.659 ± 0.208
1.976AsnTyr: 1.976 ± 0.44
0.0AsnXaa: 0.0 ± 0.0
Pro
2.503ProAla: 2.503 ± 0.401
0.066ProCys: 0.066 ± 0.069
1.778ProAsp: 1.778 ± 0.359
2.371ProGlu: 2.371 ± 0.35
1.449ProPhe: 1.449 ± 0.379
2.305ProGly: 2.305 ± 0.46
0.593ProHis: 0.593 ± 0.166
1.449ProIle: 1.449 ± 0.308
2.239ProLys: 2.239 ± 0.406
2.371ProLeu: 2.371 ± 0.327
0.329ProMet: 0.329 ± 0.138
1.317ProAsn: 1.317 ± 0.326
1.317ProPro: 1.317 ± 0.354
0.922ProGln: 0.922 ± 0.33
1.186ProArg: 1.186 ± 0.255
1.712ProSer: 1.712 ± 0.324
1.186ProThr: 1.186 ± 0.278
1.647ProVal: 1.647 ± 0.425
0.329ProTrp: 0.329 ± 0.17
0.593ProTyr: 0.593 ± 0.211
0.0ProXaa: 0.0 ± 0.0
Gln
2.898GlnAla: 2.898 ± 0.39
0.066GlnCys: 0.066 ± 0.073
1.647GlnAsp: 1.647 ± 0.34
3.425GlnGlu: 3.425 ± 0.562
1.515GlnPhe: 1.515 ± 0.243
1.054GlnGly: 1.054 ± 0.24
0.724GlnHis: 0.724 ± 0.218
2.503GlnIle: 2.503 ± 0.435
3.622GlnLys: 3.622 ± 0.433
3.557GlnLeu: 3.557 ± 0.578
0.724GlnMet: 0.724 ± 0.215
1.976GlnAsn: 1.976 ± 0.395
0.395GlnPro: 0.395 ± 0.192
1.515GlnGln: 1.515 ± 0.365
1.515GlnArg: 1.515 ± 0.233
1.647GlnSer: 1.647 ± 0.311
1.844GlnThr: 1.844 ± 0.319
2.305GlnVal: 2.305 ± 0.343
0.263GlnTrp: 0.263 ± 0.14
0.724GlnTyr: 0.724 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
2.173ArgAla: 2.173 ± 0.312
0.198ArgCys: 0.198 ± 0.149
2.437ArgAsp: 2.437 ± 0.418
3.688ArgGlu: 3.688 ± 0.462
1.712ArgPhe: 1.712 ± 0.364
1.976ArgGly: 1.976 ± 0.404
0.461ArgHis: 0.461 ± 0.181
3.425ArgIle: 3.425 ± 0.502
3.952ArgLys: 3.952 ± 0.416
2.898ArgLeu: 2.898 ± 0.54
1.449ArgMet: 1.449 ± 0.306
2.042ArgAsn: 2.042 ± 0.3
1.12ArgPro: 1.12 ± 0.272
1.186ArgGln: 1.186 ± 0.298
2.173ArgArg: 2.173 ± 0.343
2.964ArgSer: 2.964 ± 0.388
1.647ArgThr: 1.647 ± 0.389
2.766ArgVal: 2.766 ± 0.369
0.724ArgTrp: 0.724 ± 0.188
2.173ArgTyr: 2.173 ± 0.397
0.0ArgXaa: 0.0 ± 0.0
Ser
5.137SerAla: 5.137 ± 0.932
0.593SerCys: 0.593 ± 0.223
4.479SerAsp: 4.479 ± 0.543
5.071SerGlu: 5.071 ± 0.569
3.293SerPhe: 3.293 ± 0.501
4.94SerGly: 4.94 ± 0.603
0.724SerHis: 0.724 ± 0.256
5.467SerIle: 5.467 ± 0.678
5.269SerLys: 5.269 ± 0.584
4.149SerLeu: 4.149 ± 0.397
1.581SerMet: 1.581 ± 0.28
3.952SerAsn: 3.952 ± 0.502
1.515SerPro: 1.515 ± 0.268
1.647SerGln: 1.647 ± 0.306
1.976SerArg: 1.976 ± 0.354
4.215SerSer: 4.215 ± 0.513
4.347SerThr: 4.347 ± 0.559
4.742SerVal: 4.742 ± 0.6
0.659SerTrp: 0.659 ± 0.189
2.239SerTyr: 2.239 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
5.401ThrAla: 5.401 ± 0.707
0.461ThrCys: 0.461 ± 0.168
3.293ThrAsp: 3.293 ± 0.462
4.215ThrGlu: 4.215 ± 0.461
2.635ThrPhe: 2.635 ± 0.364
3.359ThrGly: 3.359 ± 0.598
0.659ThrHis: 0.659 ± 0.179
5.335ThrIle: 5.335 ± 0.573
5.137ThrLys: 5.137 ± 0.528
3.425ThrLeu: 3.425 ± 0.453
0.724ThrMet: 0.724 ± 0.218
2.832ThrAsn: 2.832 ± 0.409
1.647ThrPro: 1.647 ± 0.324
2.173ThrGln: 2.173 ± 0.39
2.305ThrArg: 2.305 ± 0.337
3.557ThrSer: 3.557 ± 0.506
4.084ThrThr: 4.084 ± 0.695
3.491ThrVal: 3.491 ± 0.502
0.659ThrTrp: 0.659 ± 0.207
1.91ThrTyr: 1.91 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
4.149ValAla: 4.149 ± 0.574
0.066ValCys: 0.066 ± 0.059
4.215ValAsp: 4.215 ± 0.612
4.149ValGlu: 4.149 ± 0.598
2.437ValPhe: 2.437 ± 0.44
2.569ValGly: 2.569 ± 0.473
0.988ValHis: 0.988 ± 0.259
4.084ValIle: 4.084 ± 0.594
5.796ValLys: 5.796 ± 0.715
4.61ValLeu: 4.61 ± 0.555
1.976ValMet: 1.976 ± 0.332
3.754ValAsn: 3.754 ± 0.503
2.108ValPro: 2.108 ± 0.426
2.239ValGln: 2.239 ± 0.388
3.425ValArg: 3.425 ± 0.518
4.347ValSer: 4.347 ± 0.55
3.491ValThr: 3.491 ± 0.592
3.952ValVal: 3.952 ± 0.676
0.395ValTrp: 0.395 ± 0.138
1.976ValTyr: 1.976 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
0.593TrpAla: 0.593 ± 0.205
0.066TrpCys: 0.066 ± 0.06
0.79TrpAsp: 0.79 ± 0.179
1.054TrpGlu: 1.054 ± 0.204
0.461TrpPhe: 0.461 ± 0.158
0.395TrpGly: 0.395 ± 0.174
0.263TrpHis: 0.263 ± 0.139
0.724TrpIle: 0.724 ± 0.195
1.449TrpLys: 1.449 ± 0.264
0.988TrpLeu: 0.988 ± 0.246
0.461TrpMet: 0.461 ± 0.209
0.724TrpAsn: 0.724 ± 0.218
0.198TrpPro: 0.198 ± 0.113
0.066TrpGln: 0.066 ± 0.075
0.395TrpArg: 0.395 ± 0.153
0.79TrpSer: 0.79 ± 0.181
0.263TrpThr: 0.263 ± 0.133
0.395TrpVal: 0.395 ± 0.153
0.066TrpTrp: 0.066 ± 0.068
0.461TrpTyr: 0.461 ± 0.356
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.503TyrAla: 2.503 ± 0.386
0.395TyrCys: 0.395 ± 0.197
1.778TyrAsp: 1.778 ± 0.356
2.766TyrGlu: 2.766 ± 0.505
1.383TyrPhe: 1.383 ± 0.303
1.976TyrGly: 1.976 ± 0.305
0.329TyrHis: 0.329 ± 0.15
2.7TyrIle: 2.7 ± 0.489
3.952TyrLys: 3.952 ± 0.599
2.371TyrLeu: 2.371 ± 0.44
0.724TyrMet: 0.724 ± 0.204
2.239TyrAsn: 2.239 ± 0.388
1.054TyrPro: 1.054 ± 0.338
1.449TyrGln: 1.449 ± 0.313
1.317TyrArg: 1.317 ± 0.342
2.635TyrSer: 2.635 ± 0.534
1.449TyrThr: 1.449 ± 0.225
1.844TyrVal: 1.844 ± 0.299
0.395TyrTrp: 0.395 ± 0.157
1.449TyrTyr: 1.449 ± 0.321
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (15184 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski