Amino acid dipepetide frequency for Vibrio phage CKB-S1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.5AlaAla: 13.5 ± 1.362
0.79AlaCys: 0.79 ± 0.198
5.861AlaAsp: 5.861 ± 0.695
5.663AlaGlu: 5.663 ± 0.596
3.424AlaPhe: 3.424 ± 0.395
9.351AlaGly: 9.351 ± 1.068
1.251AlaHis: 1.251 ± 0.318
5.927AlaIle: 5.927 ± 0.677
3.359AlaLys: 3.359 ± 0.677
8.824AlaLeu: 8.824 ± 0.815
2.634AlaMet: 2.634 ± 0.384
4.478AlaAsn: 4.478 ± 0.526
4.742AlaPro: 4.742 ± 0.505
4.742AlaGln: 4.742 ± 0.514
5.795AlaArg: 5.795 ± 0.565
5.663AlaSer: 5.663 ± 0.561
7.705AlaThr: 7.705 ± 0.815
8.627AlaVal: 8.627 ± 0.732
1.844AlaTrp: 1.844 ± 0.412
2.239AlaTyr: 2.239 ± 0.435
0.0AlaXaa: 0.0 ± 0.0
Cys
0.461CysAla: 0.461 ± 0.238
0.066CysCys: 0.066 ± 0.058
0.461CysAsp: 0.461 ± 0.184
0.329CysGlu: 0.329 ± 0.154
0.198CysPhe: 0.198 ± 0.117
0.79CysGly: 0.79 ± 0.235
0.198CysHis: 0.198 ± 0.13
0.0CysIle: 0.0 ± 0.0
0.395CysLys: 0.395 ± 0.188
0.527CysLeu: 0.527 ± 0.218
0.132CysMet: 0.132 ± 0.098
0.263CysAsn: 0.263 ± 0.124
0.527CysPro: 0.527 ± 0.202
0.198CysGln: 0.198 ± 0.108
0.724CysArg: 0.724 ± 0.237
0.527CysSer: 0.527 ± 0.203
0.395CysThr: 0.395 ± 0.139
0.461CysVal: 0.461 ± 0.197
0.066CysTrp: 0.066 ± 0.07
0.395CysTyr: 0.395 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
5.993AspAla: 5.993 ± 0.7
0.461AspCys: 0.461 ± 0.185
4.215AspAsp: 4.215 ± 0.45
3.82AspGlu: 3.82 ± 0.463
1.844AspPhe: 1.844 ± 0.289
6.059AspGly: 6.059 ± 0.572
1.251AspHis: 1.251 ± 0.306
2.568AspIle: 2.568 ± 0.418
2.305AspLys: 2.305 ± 0.421
5.795AspLeu: 5.795 ± 0.783
1.646AspMet: 1.646 ± 0.263
1.91AspAsn: 1.91 ± 0.349
3.095AspPro: 3.095 ± 0.487
3.49AspGln: 3.49 ± 0.601
3.556AspArg: 3.556 ± 0.558
2.568AspSer: 2.568 ± 0.413
3.951AspThr: 3.951 ± 0.517
5.334AspVal: 5.334 ± 0.584
0.527AspTrp: 0.527 ± 0.18
2.107AspTyr: 2.107 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
7.244GluAla: 7.244 ± 0.638
0.132GluCys: 0.132 ± 0.092
3.095GluAsp: 3.095 ± 0.384
3.029GluGlu: 3.029 ± 0.616
2.898GluPhe: 2.898 ± 0.395
4.215GluGly: 4.215 ± 0.587
1.449GluHis: 1.449 ± 0.391
2.239GluIle: 2.239 ± 0.339
1.646GluLys: 1.646 ± 0.297
5.532GluLeu: 5.532 ± 0.729
1.712GluMet: 1.712 ± 0.295
2.371GluAsn: 2.371 ± 0.337
2.502GluPro: 2.502 ± 0.388
3.754GluGln: 3.754 ± 0.514
3.688GluArg: 3.688 ± 0.568
2.305GluSer: 2.305 ± 0.391
3.82GluThr: 3.82 ± 0.501
4.478GluVal: 4.478 ± 0.499
1.12GluTrp: 1.12 ± 0.314
1.778GluTyr: 1.778 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
2.634PheAla: 2.634 ± 0.434
0.198PheCys: 0.198 ± 0.11
2.107PheAsp: 2.107 ± 0.431
3.227PheGlu: 3.227 ± 0.475
1.449PhePhe: 1.449 ± 0.297
3.754PheGly: 3.754 ± 0.34
0.527PheHis: 0.527 ± 0.274
1.646PheIle: 1.646 ± 0.391
1.251PheLys: 1.251 ± 0.376
2.832PheLeu: 2.832 ± 0.472
0.856PheMet: 0.856 ± 0.255
1.844PheAsn: 1.844 ± 0.431
1.383PhePro: 1.383 ± 0.299
1.12PheGln: 1.12 ± 0.242
2.437PheArg: 2.437 ± 0.37
1.449PheSer: 1.449 ± 0.304
2.898PheThr: 2.898 ± 0.639
2.305PheVal: 2.305 ± 0.491
0.132PheTrp: 0.132 ± 0.08
0.593PheTyr: 0.593 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
7.968GlyAla: 7.968 ± 0.804
0.724GlyCys: 0.724 ± 0.324
5.466GlyAsp: 5.466 ± 0.571
4.742GlyGlu: 4.742 ± 0.43
2.634GlyPhe: 2.634 ± 0.416
7.112GlyGly: 7.112 ± 0.829
1.778GlyHis: 1.778 ± 0.293
3.095GlyIle: 3.095 ± 0.432
3.095GlyLys: 3.095 ± 0.623
7.178GlyLeu: 7.178 ± 0.626
1.449GlyMet: 1.449 ± 0.392
3.49GlyAsn: 3.49 ± 0.747
2.305GlyPro: 2.305 ± 0.365
5.071GlyGln: 5.071 ± 0.52
5.203GlyArg: 5.203 ± 0.549
4.083GlySer: 4.083 ± 0.422
5.663GlyThr: 5.663 ± 0.645
7.178GlyVal: 7.178 ± 0.755
1.251GlyTrp: 1.251 ± 0.3
2.502GlyTyr: 2.502 ± 0.342
0.0GlyXaa: 0.0 ± 0.0
His
1.449HisAla: 1.449 ± 0.34
0.263HisCys: 0.263 ± 0.15
1.185HisAsp: 1.185 ± 0.312
0.922HisGlu: 0.922 ± 0.36
0.659HisPhe: 0.659 ± 0.229
1.449HisGly: 1.449 ± 0.323
0.461HisHis: 0.461 ± 0.195
1.317HisIle: 1.317 ± 0.379
0.395HisLys: 0.395 ± 0.186
1.317HisLeu: 1.317 ± 0.285
0.659HisMet: 0.659 ± 0.208
0.79HisAsn: 0.79 ± 0.279
1.449HisPro: 1.449 ± 0.339
0.329HisGln: 0.329 ± 0.137
1.251HisArg: 1.251 ± 0.393
0.79HisSer: 0.79 ± 0.27
0.988HisThr: 0.988 ± 0.227
0.988HisVal: 0.988 ± 0.225
0.527HisTrp: 0.527 ± 0.228
0.724HisTyr: 0.724 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
5.268IleAla: 5.268 ± 0.602
0.263IleCys: 0.263 ± 0.128
4.083IleAsp: 4.083 ± 0.452
3.622IleGlu: 3.622 ± 0.538
1.054IlePhe: 1.054 ± 0.252
3.161IleGly: 3.161 ± 0.51
0.461IleHis: 0.461 ± 0.167
1.712IleIle: 1.712 ± 0.406
1.91IleLys: 1.91 ± 0.414
3.227IleLeu: 3.227 ± 0.562
1.251IleMet: 1.251 ± 0.328
2.568IleAsn: 2.568 ± 0.396
1.449IlePro: 1.449 ± 0.264
1.778IleGln: 1.778 ± 0.419
2.832IleArg: 2.832 ± 0.407
2.371IleSer: 2.371 ± 0.426
4.083IleThr: 4.083 ± 0.452
4.149IleVal: 4.149 ± 0.529
0.329IleTrp: 0.329 ± 0.128
1.251IleTyr: 1.251 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
4.544LysAla: 4.544 ± 0.776
0.132LysCys: 0.132 ± 0.081
1.91LysAsp: 1.91 ± 0.48
1.646LysGlu: 1.646 ± 0.327
1.185LysPhe: 1.185 ± 0.348
3.161LysGly: 3.161 ± 0.577
1.12LysHis: 1.12 ± 0.319
1.844LysIle: 1.844 ± 0.322
1.449LysLys: 1.449 ± 0.422
3.293LysLeu: 3.293 ± 0.587
0.659LysMet: 0.659 ± 0.175
1.581LysAsn: 1.581 ± 0.361
1.976LysPro: 1.976 ± 0.455
2.173LysGln: 2.173 ± 0.52
2.568LysArg: 2.568 ± 0.419
1.185LysSer: 1.185 ± 0.328
1.976LysThr: 1.976 ± 0.406
2.568LysVal: 2.568 ± 0.455
0.79LysTrp: 0.79 ± 0.219
0.988LysTyr: 0.988 ± 0.217
0.0LysXaa: 0.0 ± 0.0
Leu
10.207LeuAla: 10.207 ± 0.833
0.856LeuCys: 0.856 ± 0.239
5.137LeuAsp: 5.137 ± 0.567
5.071LeuGlu: 5.071 ± 0.467
2.041LeuPhe: 2.041 ± 0.349
5.795LeuGly: 5.795 ± 0.5
1.185LeuHis: 1.185 ± 0.283
4.017LeuIle: 4.017 ± 0.538
3.82LeuLys: 3.82 ± 0.724
5.993LeuLeu: 5.993 ± 0.665
2.173LeuMet: 2.173 ± 0.384
3.424LeuAsn: 3.424 ± 0.542
4.478LeuPro: 4.478 ± 0.585
3.293LeuGln: 3.293 ± 0.489
5.334LeuArg: 5.334 ± 0.435
4.149LeuSer: 4.149 ± 0.458
5.4LeuThr: 5.4 ± 0.702
6.322LeuVal: 6.322 ± 0.628
0.856LeuTrp: 0.856 ± 0.23
2.107LeuTyr: 2.107 ± 0.404
0.0LeuXaa: 0.0 ± 0.0
Met
3.161MetAla: 3.161 ± 0.368
0.066MetCys: 0.066 ± 0.056
1.449MetAsp: 1.449 ± 0.297
1.317MetGlu: 1.317 ± 0.352
1.251MetPhe: 1.251 ± 0.267
1.449MetGly: 1.449 ± 0.407
0.527MetHis: 0.527 ± 0.195
1.054MetIle: 1.054 ± 0.297
0.988MetLys: 0.988 ± 0.262
1.251MetLeu: 1.251 ± 0.316
0.593MetMet: 0.593 ± 0.18
0.593MetAsn: 0.593 ± 0.181
1.383MetPro: 1.383 ± 0.23
1.778MetGln: 1.778 ± 0.762
1.515MetArg: 1.515 ± 0.371
1.515MetSer: 1.515 ± 0.276
2.634MetThr: 2.634 ± 0.374
1.449MetVal: 1.449 ± 0.338
0.329MetTrp: 0.329 ± 0.141
0.659MetTyr: 0.659 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
3.885AsnAla: 3.885 ± 0.393
0.329AsnCys: 0.329 ± 0.147
3.029AsnAsp: 3.029 ± 0.482
2.371AsnGlu: 2.371 ± 0.403
1.449AsnPhe: 1.449 ± 0.239
4.346AsnGly: 4.346 ± 0.591
0.527AsnHis: 0.527 ± 0.179
2.766AsnIle: 2.766 ± 0.412
1.317AsnLys: 1.317 ± 0.323
3.293AsnLeu: 3.293 ± 0.519
1.054AsnMet: 1.054 ± 0.216
1.778AsnAsn: 1.778 ± 0.425
2.832AsnPro: 2.832 ± 0.43
1.976AsnGln: 1.976 ± 0.354
2.107AsnArg: 2.107 ± 0.42
2.239AsnSer: 2.239 ± 0.356
3.161AsnThr: 3.161 ± 0.507
3.754AsnVal: 3.754 ± 0.529
0.461AsnTrp: 0.461 ± 0.153
0.527AsnTyr: 0.527 ± 0.168
0.0AsnXaa: 0.0 ± 0.0
Pro
5.268ProAla: 5.268 ± 0.725
0.263ProCys: 0.263 ± 0.11
3.095ProAsp: 3.095 ± 0.45
2.832ProGlu: 2.832 ± 0.434
2.041ProPhe: 2.041 ± 0.314
4.61ProGly: 4.61 ± 0.567
0.724ProHis: 0.724 ± 0.29
1.778ProIle: 1.778 ± 0.402
1.581ProLys: 1.581 ± 0.314
3.161ProLeu: 3.161 ± 0.477
1.91ProMet: 1.91 ± 0.7
2.173ProAsn: 2.173 ± 0.396
1.646ProPro: 1.646 ± 0.396
2.173ProGln: 2.173 ± 0.517
2.041ProArg: 2.041 ± 0.353
3.095ProSer: 3.095 ± 0.455
3.754ProThr: 3.754 ± 0.524
3.161ProVal: 3.161 ± 0.432
0.856ProTrp: 0.856 ± 0.23
1.317ProTyr: 1.317 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
6.585GlnAla: 6.585 ± 0.684
0.593GlnCys: 0.593 ± 0.232
1.91GlnAsp: 1.91 ± 0.335
2.437GlnGlu: 2.437 ± 0.395
1.515GlnPhe: 1.515 ± 0.245
2.963GlnGly: 2.963 ± 0.416
1.317GlnHis: 1.317 ± 0.302
1.712GlnIle: 1.712 ± 0.352
1.383GlnLys: 1.383 ± 0.341
4.478GlnLeu: 4.478 ± 0.537
1.581GlnMet: 1.581 ± 0.357
2.568GlnAsn: 2.568 ± 0.341
3.293GlnPro: 3.293 ± 0.854
5.795GlnGln: 5.795 ± 3.093
2.7GlnArg: 2.7 ± 0.426
1.581GlnSer: 1.581 ± 0.316
3.359GlnThr: 3.359 ± 0.293
3.161GlnVal: 3.161 ± 0.587
0.922GlnTrp: 0.922 ± 0.195
1.383GlnTyr: 1.383 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
5.334ArgAla: 5.334 ± 0.637
0.724ArgCys: 0.724 ± 0.246
3.754ArgAsp: 3.754 ± 0.577
4.083ArgGlu: 4.083 ± 0.613
2.107ArgPhe: 2.107 ± 0.315
4.017ArgGly: 4.017 ± 0.598
0.79ArgHis: 0.79 ± 0.199
3.359ArgIle: 3.359 ± 0.574
3.293ArgLys: 3.293 ± 0.545
5.532ArgLeu: 5.532 ± 0.603
1.185ArgMet: 1.185 ± 0.318
3.161ArgAsn: 3.161 ± 0.432
2.7ArgPro: 2.7 ± 0.391
2.7ArgGln: 2.7 ± 0.461
4.742ArgArg: 4.742 ± 0.698
3.029ArgSer: 3.029 ± 0.48
3.688ArgThr: 3.688 ± 0.487
4.017ArgVal: 4.017 ± 0.571
0.922ArgTrp: 0.922 ± 0.257
1.778ArgTyr: 1.778 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
5.532SerAla: 5.532 ± 0.574
0.198SerCys: 0.198 ± 0.142
3.161SerAsp: 3.161 ± 0.414
3.095SerGlu: 3.095 ± 0.316
2.305SerPhe: 2.305 ± 0.478
4.149SerGly: 4.149 ± 0.472
1.12SerHis: 1.12 ± 0.258
2.437SerIle: 2.437 ± 0.418
1.646SerLys: 1.646 ± 0.414
3.556SerLeu: 3.556 ± 0.497
0.988SerMet: 0.988 ± 0.2
2.7SerAsn: 2.7 ± 0.284
2.173SerPro: 2.173 ± 0.348
1.976SerGln: 1.976 ± 0.348
2.634SerArg: 2.634 ± 0.38
2.502SerSer: 2.502 ± 0.496
3.227SerThr: 3.227 ± 0.51
4.017SerVal: 4.017 ± 0.499
0.527SerTrp: 0.527 ± 0.196
1.054SerTyr: 1.054 ± 0.341
0.0SerXaa: 0.0 ± 0.0
Thr
7.771ThrAla: 7.771 ± 0.763
0.198ThrCys: 0.198 ± 0.113
4.215ThrAsp: 4.215 ± 0.522
2.898ThrGlu: 2.898 ± 0.406
2.239ThrPhe: 2.239 ± 0.467
6.322ThrGly: 6.322 ± 0.664
0.922ThrHis: 0.922 ± 0.236
3.82ThrIle: 3.82 ± 0.537
2.568ThrLys: 2.568 ± 0.482
5.729ThrLeu: 5.729 ± 0.63
1.383ThrMet: 1.383 ± 0.272
2.502ThrAsn: 2.502 ± 0.4
3.227ThrPro: 3.227 ± 0.454
3.49ThrGln: 3.49 ± 0.498
3.885ThrArg: 3.885 ± 0.726
4.149ThrSer: 4.149 ± 0.597
4.149ThrThr: 4.149 ± 0.628
6.585ThrVal: 6.585 ± 0.798
1.185ThrTrp: 1.185 ± 0.273
2.107ThrTyr: 2.107 ± 0.372
0.0ThrXaa: 0.0 ± 0.0
Val
5.795ValAla: 5.795 ± 0.711
0.527ValCys: 0.527 ± 0.178
6.059ValAsp: 6.059 ± 0.55
5.268ValGlu: 5.268 ± 0.774
2.766ValPhe: 2.766 ± 0.552
5.334ValGly: 5.334 ± 0.501
1.383ValHis: 1.383 ± 0.292
4.149ValIle: 4.149 ± 0.522
2.832ValLys: 2.832 ± 0.505
6.52ValLeu: 6.52 ± 0.532
1.91ValMet: 1.91 ± 0.381
3.688ValAsn: 3.688 ± 0.426
3.951ValPro: 3.951 ± 0.633
3.951ValGln: 3.951 ± 0.732
5.071ValArg: 5.071 ± 0.652
3.359ValSer: 3.359 ± 0.421
5.598ValThr: 5.598 ± 0.895
5.4ValVal: 5.4 ± 0.611
1.185ValTrp: 1.185 ± 0.254
2.239ValTyr: 2.239 ± 0.376
0.0ValXaa: 0.0 ± 0.0
Trp
0.988TrpAla: 0.988 ± 0.254
0.066TrpCys: 0.066 ± 0.064
0.79TrpAsp: 0.79 ± 0.23
0.79TrpGlu: 0.79 ± 0.234
0.461TrpPhe: 0.461 ± 0.196
1.251TrpGly: 1.251 ± 0.283
0.329TrpHis: 0.329 ± 0.144
0.659TrpIle: 0.659 ± 0.21
0.329TrpLys: 0.329 ± 0.18
1.778TrpLeu: 1.778 ± 0.302
0.263TrpMet: 0.263 ± 0.13
0.593TrpAsn: 0.593 ± 0.213
0.856TrpPro: 0.856 ± 0.258
0.395TrpGln: 0.395 ± 0.196
1.12TrpArg: 1.12 ± 0.271
0.79TrpSer: 0.79 ± 0.18
1.251TrpThr: 1.251 ± 0.253
1.251TrpVal: 1.251 ± 0.317
0.329TrpTrp: 0.329 ± 0.156
0.395TrpTyr: 0.395 ± 0.277
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.832TyrAla: 2.832 ± 0.451
0.263TyrCys: 0.263 ± 0.137
1.515TyrAsp: 1.515 ± 0.297
1.844TyrGlu: 1.844 ± 0.359
1.12TyrPhe: 1.12 ± 0.293
2.766TyrGly: 2.766 ± 0.439
0.593TyrHis: 0.593 ± 0.182
0.724TyrIle: 0.724 ± 0.209
1.185TyrLys: 1.185 ± 0.263
1.844TyrLeu: 1.844 ± 0.283
0.856TyrMet: 0.856 ± 0.182
0.724TyrAsn: 0.724 ± 0.242
1.515TyrPro: 1.515 ± 0.288
1.054TyrGln: 1.054 ± 0.244
1.712TyrArg: 1.712 ± 0.46
1.646TyrSer: 1.646 ± 0.475
1.646TyrThr: 1.646 ± 0.345
1.778TyrVal: 1.778 ± 0.347
0.527TyrTrp: 0.527 ± 0.27
1.251TyrTyr: 1.251 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (15186 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski