Amino acid dipepetide frequency for Tsukamurella phage TIN3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.898AlaAla: 10.898 ± 2.037
0.718AlaCys: 0.718 ± 0.204
5.829AlaAsp: 5.829 ± 0.51
6.59AlaGlu: 6.59 ± 0.656
3.633AlaPhe: 3.633 ± 0.382
6.421AlaGly: 6.421 ± 0.693
1.394AlaHis: 1.394 ± 0.232
5.154AlaIle: 5.154 ± 0.613
5.027AlaLys: 5.027 ± 0.514
7.139AlaLeu: 7.139 ± 0.787
2.408AlaMet: 2.408 ± 0.606
4.182AlaAsn: 4.182 ± 0.442
4.942AlaPro: 4.942 ± 0.473
2.915AlaGln: 2.915 ± 0.586
5.534AlaArg: 5.534 ± 0.605
5.745AlaSer: 5.745 ± 0.621
5.365AlaThr: 5.365 ± 0.5
5.829AlaVal: 5.829 ± 0.602
1.563AlaTrp: 1.563 ± 0.328
2.661AlaTyr: 2.661 ± 0.348
0.0AlaXaa: 0.0 ± 0.0
Cys
0.845CysAla: 0.845 ± 0.224
0.127CysCys: 0.127 ± 0.091
0.972CysAsp: 0.972 ± 0.272
0.845CysGlu: 0.845 ± 0.22
0.211CysPhe: 0.211 ± 0.104
1.056CysGly: 1.056 ± 0.288
0.296CysHis: 0.296 ± 0.116
0.507CysIle: 0.507 ± 0.166
0.253CysLys: 0.253 ± 0.112
0.549CysLeu: 0.549 ± 0.151
0.296CysMet: 0.296 ± 0.127
0.338CysAsn: 0.338 ± 0.139
0.634CysPro: 0.634 ± 0.175
0.422CysGln: 0.422 ± 0.125
0.76CysArg: 0.76 ± 0.191
1.056CysSer: 1.056 ± 0.247
0.803CysThr: 0.803 ± 0.21
0.845CysVal: 0.845 ± 0.226
0.084CysTrp: 0.084 ± 0.072
0.253CysTyr: 0.253 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
6.294AspAla: 6.294 ± 0.514
0.803AspCys: 0.803 ± 0.178
3.548AspAsp: 3.548 ± 0.547
4.773AspGlu: 4.773 ± 0.737
2.197AspPhe: 2.197 ± 0.322
4.773AspGly: 4.773 ± 0.497
1.183AspHis: 1.183 ± 0.253
2.746AspIle: 2.746 ± 0.308
3.21AspLys: 3.21 ± 0.391
5.829AspLeu: 5.829 ± 0.571
1.563AspMet: 1.563 ± 0.258
2.746AspAsn: 2.746 ± 0.331
4.055AspPro: 4.055 ± 0.621
2.281AspGln: 2.281 ± 0.3
2.83AspArg: 2.83 ± 0.332
3.464AspSer: 3.464 ± 0.399
3.591AspThr: 3.591 ± 0.398
3.633AspVal: 3.633 ± 0.429
0.929AspTrp: 0.929 ± 0.234
1.901AspTyr: 1.901 ± 0.346
0.0AspXaa: 0.0 ± 0.0
Glu
6.252GluAla: 6.252 ± 0.638
0.76GluCys: 0.76 ± 0.206
4.182GluAsp: 4.182 ± 0.555
5.703GluGlu: 5.703 ± 0.698
2.619GluPhe: 2.619 ± 0.459
4.478GluGly: 4.478 ± 0.314
1.352GluHis: 1.352 ± 0.324
3.929GluIle: 3.929 ± 0.504
3.337GluLys: 3.337 ± 0.377
5.745GluLeu: 5.745 ± 0.415
2.197GluMet: 2.197 ± 0.316
2.366GluAsn: 2.366 ± 0.292
2.872GluPro: 2.872 ± 0.424
2.788GluGln: 2.788 ± 0.361
4.52GluArg: 4.52 ± 0.583
3.76GluSer: 3.76 ± 0.373
3.168GluThr: 3.168 ± 0.355
5.196GluVal: 5.196 ± 0.602
1.225GluTrp: 1.225 ± 0.203
2.323GluTyr: 2.323 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
2.788PheAla: 2.788 ± 0.42
0.422PheCys: 0.422 ± 0.152
2.535PheAsp: 2.535 ± 0.346
2.366PheGlu: 2.366 ± 0.354
1.098PhePhe: 1.098 ± 0.189
3.591PheGly: 3.591 ± 0.488
0.591PheHis: 0.591 ± 0.175
1.69PheIle: 1.69 ± 0.27
2.323PheLys: 2.323 ± 0.297
1.563PheLeu: 1.563 ± 0.245
1.141PheMet: 1.141 ± 0.196
1.69PheAsn: 1.69 ± 0.285
1.436PhePro: 1.436 ± 0.295
1.014PheGln: 1.014 ± 0.263
1.943PheArg: 1.943 ± 0.334
2.492PheSer: 2.492 ± 0.308
2.028PheThr: 2.028 ± 0.298
2.45PheVal: 2.45 ± 0.303
0.296PheTrp: 0.296 ± 0.115
0.887PheTyr: 0.887 ± 0.204
0.0PheXaa: 0.0 ± 0.0
Gly
6.167GlyAla: 6.167 ± 0.572
0.718GlyCys: 0.718 ± 0.252
5.323GlyAsp: 5.323 ± 0.527
4.52GlyGlu: 4.52 ± 0.473
2.535GlyPhe: 2.535 ± 0.45
8.406GlyGly: 8.406 ± 1.509
1.563GlyHis: 1.563 ± 0.269
3.802GlyIle: 3.802 ± 0.469
5.449GlyLys: 5.449 ± 0.555
5.914GlyLeu: 5.914 ± 0.729
1.943GlyMet: 1.943 ± 0.489
3.21GlyAsn: 3.21 ± 0.428
3.591GlyPro: 3.591 ± 0.4
2.366GlyGln: 2.366 ± 0.421
3.886GlyArg: 3.886 ± 0.472
5.914GlySer: 5.914 ± 0.632
4.9GlyThr: 4.9 ± 0.56
5.872GlyVal: 5.872 ± 0.623
1.436GlyTrp: 1.436 ± 0.271
2.661GlyTyr: 2.661 ± 0.424
0.0GlyXaa: 0.0 ± 0.0
His
0.803HisAla: 0.803 ± 0.149
0.253HisCys: 0.253 ± 0.113
0.845HisAsp: 0.845 ± 0.188
1.014HisGlu: 1.014 ± 0.223
0.465HisPhe: 0.465 ± 0.142
1.521HisGly: 1.521 ± 0.338
0.338HisHis: 0.338 ± 0.159
0.929HisIle: 0.929 ± 0.197
1.056HisLys: 1.056 ± 0.225
2.07HisLeu: 2.07 ± 0.314
0.338HisMet: 0.338 ± 0.139
0.718HisAsn: 0.718 ± 0.165
0.845HisPro: 0.845 ± 0.199
0.338HisGln: 0.338 ± 0.126
1.394HisArg: 1.394 ± 0.332
0.845HisSer: 0.845 ± 0.18
1.521HisThr: 1.521 ± 0.266
1.141HisVal: 1.141 ± 0.306
0.169HisTrp: 0.169 ± 0.064
0.507HisTyr: 0.507 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
5.196IleAla: 5.196 ± 0.509
0.465IleCys: 0.465 ± 0.135
3.717IleAsp: 3.717 ± 0.441
4.097IleGlu: 4.097 ± 0.398
1.943IlePhe: 1.943 ± 0.247
4.266IleGly: 4.266 ± 0.538
0.845IleHis: 0.845 ± 0.208
2.323IleIle: 2.323 ± 0.399
3.21IleLys: 3.21 ± 0.367
3.379IleLeu: 3.379 ± 0.433
1.141IleMet: 1.141 ± 0.262
2.45IleAsn: 2.45 ± 0.3
2.281IlePro: 2.281 ± 0.349
2.492IleGln: 2.492 ± 0.298
2.746IleArg: 2.746 ± 0.359
3.253IleSer: 3.253 ± 0.398
3.253IleThr: 3.253 ± 0.57
3.929IleVal: 3.929 ± 0.443
0.887IleTrp: 0.887 ± 0.224
0.887IleTyr: 0.887 ± 0.185
0.0IleXaa: 0.0 ± 0.0
Lys
6.379LysAla: 6.379 ± 0.766
0.634LysCys: 0.634 ± 0.188
3.126LysAsp: 3.126 ± 0.334
2.535LysGlu: 2.535 ± 0.347
2.197LysPhe: 2.197 ± 0.33
3.379LysGly: 3.379 ± 0.483
0.718LysHis: 0.718 ± 0.166
3.422LysIle: 3.422 ± 0.335
3.379LysLys: 3.379 ± 0.523
3.675LysLeu: 3.675 ± 0.48
2.112LysMet: 2.112 ± 0.254
2.957LysAsn: 2.957 ± 0.324
2.577LysPro: 2.577 ± 0.458
2.281LysGln: 2.281 ± 0.282
4.097LysArg: 4.097 ± 0.605
3.21LysSer: 3.21 ± 0.408
2.83LysThr: 2.83 ± 0.279
3.675LysVal: 3.675 ± 0.389
0.718LysTrp: 0.718 ± 0.158
1.394LysTyr: 1.394 ± 0.263
0.0LysXaa: 0.0 ± 0.0
Leu
6.336LeuAla: 6.336 ± 0.694
1.014LeuCys: 1.014 ± 0.248
5.238LeuAsp: 5.238 ± 0.591
5.703LeuGlu: 5.703 ± 0.61
2.239LeuPhe: 2.239 ± 0.306
5.829LeuGly: 5.829 ± 0.666
1.141LeuHis: 1.141 ± 0.231
4.562LeuIle: 4.562 ± 0.393
3.971LeuLys: 3.971 ± 0.58
5.323LeuLeu: 5.323 ± 0.589
1.985LeuMet: 1.985 ± 0.318
3.422LeuAsn: 3.422 ± 0.374
3.886LeuPro: 3.886 ± 0.411
2.788LeuGln: 2.788 ± 0.547
4.097LeuArg: 4.097 ± 0.434
3.971LeuSer: 3.971 ± 0.406
4.9LeuThr: 4.9 ± 0.476
4.478LeuVal: 4.478 ± 0.455
1.056LeuTrp: 1.056 ± 0.174
1.859LeuTyr: 1.859 ± 0.403
0.0LeuXaa: 0.0 ± 0.0
Met
2.239MetAla: 2.239 ± 0.335
0.338MetCys: 0.338 ± 0.139
1.394MetAsp: 1.394 ± 0.246
1.141MetGlu: 1.141 ± 0.182
0.929MetPhe: 0.929 ± 0.222
1.31MetGly: 1.31 ± 0.29
0.634MetHis: 0.634 ± 0.178
1.521MetIle: 1.521 ± 0.294
1.901MetLys: 1.901 ± 0.271
2.281MetLeu: 2.281 ± 0.343
0.887MetMet: 0.887 ± 0.234
1.225MetAsn: 1.225 ± 0.261
1.943MetPro: 1.943 ± 0.308
1.056MetGln: 1.056 ± 0.285
1.394MetArg: 1.394 ± 0.267
2.366MetSer: 2.366 ± 0.316
1.901MetThr: 1.901 ± 0.221
1.521MetVal: 1.521 ± 0.329
0.296MetTrp: 0.296 ± 0.108
0.634MetTyr: 0.634 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
4.478AsnAla: 4.478 ± 0.85
0.507AsnCys: 0.507 ± 0.186
2.788AsnAsp: 2.788 ± 0.393
2.957AsnGlu: 2.957 ± 0.32
1.436AsnPhe: 1.436 ± 0.285
4.435AsnGly: 4.435 ± 0.529
0.718AsnHis: 0.718 ± 0.209
1.943AsnIle: 1.943 ± 0.246
2.661AsnLys: 2.661 ± 0.352
2.661AsnLeu: 2.661 ± 0.329
0.845AsnMet: 0.845 ± 0.198
2.154AsnAsn: 2.154 ± 0.269
2.577AsnPro: 2.577 ± 0.355
1.605AsnGln: 1.605 ± 0.26
2.197AsnArg: 2.197 ± 0.298
2.366AsnSer: 2.366 ± 0.338
2.957AsnThr: 2.957 ± 0.356
2.239AsnVal: 2.239 ± 0.318
0.845AsnTrp: 0.845 ± 0.175
1.394AsnTyr: 1.394 ± 0.3
0.0AsnXaa: 0.0 ± 0.0
Pro
5.407ProAla: 5.407 ± 0.517
0.38ProCys: 0.38 ± 0.137
3.464ProAsp: 3.464 ± 0.557
4.266ProGlu: 4.266 ± 0.645
1.478ProPhe: 1.478 ± 0.221
3.675ProGly: 3.675 ± 0.43
0.972ProHis: 0.972 ± 0.237
2.154ProIle: 2.154 ± 0.314
2.408ProLys: 2.408 ± 0.387
3.379ProLeu: 3.379 ± 0.355
1.141ProMet: 1.141 ± 0.226
2.07ProAsn: 2.07 ± 0.304
2.281ProPro: 2.281 ± 0.499
1.774ProGln: 1.774 ± 0.268
2.07ProArg: 2.07 ± 0.377
3.591ProSer: 3.591 ± 0.538
2.45ProThr: 2.45 ± 0.376
3.971ProVal: 3.971 ± 0.406
1.394ProTrp: 1.394 ± 0.251
1.69ProTyr: 1.69 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
3.76GlnAla: 3.76 ± 0.599
0.296GlnCys: 0.296 ± 0.133
1.943GlnAsp: 1.943 ± 0.263
2.07GlnGlu: 2.07 ± 0.328
1.859GlnPhe: 1.859 ± 0.278
2.45GlnGly: 2.45 ± 0.314
0.634GlnHis: 0.634 ± 0.171
2.535GlnIle: 2.535 ± 0.34
1.69GlnLys: 1.69 ± 0.264
3.041GlnLeu: 3.041 ± 0.334
0.718GlnMet: 0.718 ± 0.189
1.31GlnAsn: 1.31 ± 0.212
1.478GlnPro: 1.478 ± 0.279
1.69GlnGln: 1.69 ± 0.296
2.07GlnArg: 2.07 ± 0.295
2.112GlnSer: 2.112 ± 0.237
2.239GlnThr: 2.239 ± 0.408
2.661GlnVal: 2.661 ± 0.34
0.591GlnTrp: 0.591 ± 0.175
1.225GlnTyr: 1.225 ± 0.207
0.0GlnXaa: 0.0 ± 0.0
Arg
5.323ArgAla: 5.323 ± 0.477
0.929ArgCys: 0.929 ± 0.253
3.084ArgAsp: 3.084 ± 0.395
4.731ArgGlu: 4.731 ± 0.587
1.816ArgPhe: 1.816 ± 0.26
4.097ArgGly: 4.097 ± 0.599
1.014ArgHis: 1.014 ± 0.203
2.999ArgIle: 2.999 ± 0.453
2.999ArgLys: 2.999 ± 0.417
4.478ArgLeu: 4.478 ± 0.387
1.774ArgMet: 1.774 ± 0.308
2.45ArgAsn: 2.45 ± 0.35
3.21ArgPro: 3.21 ± 0.468
1.901ArgGln: 1.901 ± 0.307
3.633ArgArg: 3.633 ± 0.429
2.746ArgSer: 2.746 ± 0.328
2.492ArgThr: 2.492 ± 0.286
4.055ArgVal: 4.055 ± 0.482
0.718ArgTrp: 0.718 ± 0.157
2.408ArgTyr: 2.408 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
4.985SerAla: 4.985 ± 0.469
0.591SerCys: 0.591 ± 0.17
3.379SerAsp: 3.379 ± 0.402
3.591SerGlu: 3.591 ± 0.478
1.69SerPhe: 1.69 ± 0.318
6.759SerGly: 6.759 ± 0.609
0.803SerHis: 0.803 ± 0.207
3.253SerIle: 3.253 ± 0.33
3.168SerLys: 3.168 ± 0.347
4.858SerLeu: 4.858 ± 0.603
2.028SerMet: 2.028 ± 0.385
2.45SerAsn: 2.45 ± 0.324
3.084SerPro: 3.084 ± 0.293
2.197SerGln: 2.197 ± 0.292
3.295SerArg: 3.295 ± 0.383
3.76SerSer: 3.76 ± 0.396
4.055SerThr: 4.055 ± 0.376
4.309SerVal: 4.309 ± 0.411
1.056SerTrp: 1.056 ± 0.203
2.323SerTyr: 2.323 ± 0.293
0.0SerXaa: 0.0 ± 0.0
Thr
5.196ThrAla: 5.196 ± 0.637
0.887ThrCys: 0.887 ± 0.225
3.506ThrAsp: 3.506 ± 0.451
3.717ThrGlu: 3.717 ± 0.391
2.028ThrPhe: 2.028 ± 0.261
5.576ThrGly: 5.576 ± 0.495
0.845ThrHis: 0.845 ± 0.195
2.788ThrIle: 2.788 ± 0.344
3.084ThrLys: 3.084 ± 0.379
4.055ThrLeu: 4.055 ± 0.426
1.225ThrMet: 1.225 ± 0.285
2.619ThrAsn: 2.619 ± 0.325
3.295ThrPro: 3.295 ± 0.762
2.239ThrGln: 2.239 ± 0.296
2.957ThrArg: 2.957 ± 0.363
4.013ThrSer: 4.013 ± 0.607
3.548ThrThr: 3.548 ± 0.529
5.407ThrVal: 5.407 ± 0.615
1.225ThrTrp: 1.225 ± 0.208
1.141ThrTyr: 1.141 ± 0.226
0.0ThrXaa: 0.0 ± 0.0
Val
6.294ValAla: 6.294 ± 0.661
0.507ValCys: 0.507 ± 0.135
4.604ValAsp: 4.604 ± 0.691
4.858ValGlu: 4.858 ± 0.479
2.492ValPhe: 2.492 ± 0.268
4.647ValGly: 4.647 ± 0.43
1.225ValHis: 1.225 ± 0.34
4.14ValIle: 4.14 ± 0.476
4.266ValLys: 4.266 ± 0.416
4.478ValLeu: 4.478 ± 0.394
1.774ValMet: 1.774 ± 0.232
3.21ValAsn: 3.21 ± 0.334
3.548ValPro: 3.548 ± 0.351
2.45ValGln: 2.45 ± 0.327
4.224ValArg: 4.224 ± 0.461
4.816ValSer: 4.816 ± 0.552
4.224ValThr: 4.224 ± 0.437
4.647ValVal: 4.647 ± 0.512
0.929ValTrp: 0.929 ± 0.169
1.985ValTyr: 1.985 ± 0.308
0.0ValXaa: 0.0 ± 0.0
Trp
1.521TrpAla: 1.521 ± 0.348
0.296TrpCys: 0.296 ± 0.102
1.225TrpAsp: 1.225 ± 0.261
0.972TrpGlu: 0.972 ± 0.17
0.676TrpPhe: 0.676 ± 0.235
1.014TrpGly: 1.014 ± 0.203
0.296TrpHis: 0.296 ± 0.126
0.803TrpIle: 0.803 ± 0.156
0.634TrpLys: 0.634 ± 0.166
1.267TrpLeu: 1.267 ± 0.213
0.465TrpMet: 0.465 ± 0.156
0.76TrpAsn: 0.76 ± 0.155
0.465TrpPro: 0.465 ± 0.177
0.591TrpGln: 0.591 ± 0.145
1.352TrpArg: 1.352 ± 0.222
1.014TrpSer: 1.014 ± 0.173
1.183TrpThr: 1.183 ± 0.25
0.845TrpVal: 0.845 ± 0.145
0.253TrpTrp: 0.253 ± 0.094
0.591TrpTyr: 0.591 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.788TyrAla: 2.788 ± 0.348
0.507TyrCys: 0.507 ± 0.154
1.647TyrAsp: 1.647 ± 0.293
2.323TyrGlu: 2.323 ± 0.394
0.803TyrPhe: 0.803 ± 0.2
2.492TyrGly: 2.492 ± 0.327
0.549TyrHis: 0.549 ± 0.163
1.563TyrIle: 1.563 ± 0.304
1.267TyrLys: 1.267 ± 0.271
2.112TyrLeu: 2.112 ± 0.316
0.887TyrMet: 0.887 ± 0.194
1.436TyrAsn: 1.436 ± 0.241
1.014TyrPro: 1.014 ± 0.224
1.183TyrGln: 1.183 ± 0.249
1.901TyrArg: 1.901 ± 0.331
1.141TyrSer: 1.141 ± 0.212
1.943TyrThr: 1.943 ± 0.375
2.619TyrVal: 2.619 ± 0.404
0.549TyrTrp: 0.549 ± 0.2
1.141TyrTyr: 1.141 ± 0.226
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 109 proteins (23674 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski