Amino acid dipepetide frequency for Streptococcus phage Javan630

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.24AlaAla: 5.24 ± 0.869
0.62AlaCys: 0.62 ± 0.206
4.964AlaAsp: 4.964 ± 0.602
6.205AlaGlu: 6.205 ± 0.669
2.551AlaPhe: 2.551 ± 0.356
4.619AlaGly: 4.619 ± 0.586
0.689AlaHis: 0.689 ± 0.183
5.86AlaIle: 5.86 ± 0.765
4.619AlaLys: 4.619 ± 0.627
7.17AlaLeu: 7.17 ± 1.153
1.861AlaMet: 1.861 ± 0.359
3.171AlaAsn: 3.171 ± 0.376
1.586AlaPro: 1.586 ± 0.353
1.448AlaGln: 1.448 ± 0.296
2.689AlaArg: 2.689 ± 0.437
3.309AlaSer: 3.309 ± 0.462
4.068AlaThr: 4.068 ± 0.641
4.895AlaVal: 4.895 ± 0.626
0.896AlaTrp: 0.896 ± 0.262
3.447AlaTyr: 3.447 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.483CysAla: 0.483 ± 0.176
0.345CysCys: 0.345 ± 0.141
0.483CysAsp: 0.483 ± 0.203
1.172CysGlu: 1.172 ± 0.346
0.276CysPhe: 0.276 ± 0.13
0.965CysGly: 0.965 ± 0.294
0.276CysHis: 0.276 ± 0.141
0.896CysIle: 0.896 ± 0.252
0.552CysLys: 0.552 ± 0.168
0.827CysLeu: 0.827 ± 0.232
0.138CysMet: 0.138 ± 0.089
0.345CysAsn: 0.345 ± 0.119
0.345CysPro: 0.345 ± 0.149
0.276CysGln: 0.276 ± 0.125
0.62CysArg: 0.62 ± 0.227
1.034CysSer: 1.034 ± 0.299
0.552CysThr: 0.552 ± 0.221
0.62CysVal: 0.62 ± 0.185
0.138CysTrp: 0.138 ± 0.097
0.689CysTyr: 0.689 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
4.412AspAla: 4.412 ± 0.708
1.103AspCys: 1.103 ± 0.283
4.688AspAsp: 4.688 ± 0.505
5.653AspGlu: 5.653 ± 0.727
2.896AspPhe: 2.896 ± 0.405
5.171AspGly: 5.171 ± 0.583
1.103AspHis: 1.103 ± 0.242
5.033AspIle: 5.033 ± 0.563
5.929AspLys: 5.929 ± 0.693
5.446AspLeu: 5.446 ± 0.532
2.413AspMet: 2.413 ± 0.431
3.033AspAsn: 3.033 ± 0.361
1.586AspPro: 1.586 ± 0.386
0.62AspGln: 0.62 ± 0.216
2.068AspArg: 2.068 ± 0.368
3.723AspSer: 3.723 ± 0.505
2.964AspThr: 2.964 ± 0.489
4.274AspVal: 4.274 ± 0.511
1.31AspTrp: 1.31 ± 0.296
3.171AspTyr: 3.171 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
3.723GluAla: 3.723 ± 0.6
1.103GluCys: 1.103 ± 0.283
4.412GluAsp: 4.412 ± 0.501
7.17GluGlu: 7.17 ± 0.879
2.827GluPhe: 2.827 ± 0.527
4.757GluGly: 4.757 ± 0.391
1.241GluHis: 1.241 ± 0.245
6.343GluIle: 6.343 ± 0.741
6.894GluLys: 6.894 ± 0.722
6.756GluLeu: 6.756 ± 0.684
3.309GluMet: 3.309 ± 0.404
4.137GluAsn: 4.137 ± 0.538
2.206GluPro: 2.206 ± 0.373
3.102GluGln: 3.102 ± 0.537
4.068GluArg: 4.068 ± 0.493
3.93GluSer: 3.93 ± 0.508
3.516GluThr: 3.516 ± 0.456
4.964GluVal: 4.964 ± 0.566
0.827GluTrp: 0.827 ± 0.204
2.758GluTyr: 2.758 ± 0.49
0.0GluXaa: 0.0 ± 0.0
Phe
2.689PheAla: 2.689 ± 0.428
0.689PheCys: 0.689 ± 0.193
2.413PheAsp: 2.413 ± 0.349
2.137PheGlu: 2.137 ± 0.432
1.448PhePhe: 1.448 ± 0.282
2.344PheGly: 2.344 ± 0.367
0.689PheHis: 0.689 ± 0.213
3.171PheIle: 3.171 ± 0.385
2.344PheLys: 2.344 ± 0.424
2.068PheLeu: 2.068 ± 0.364
0.689PheMet: 0.689 ± 0.227
2.068PheAsn: 2.068 ± 0.343
1.379PhePro: 1.379 ± 0.304
0.965PheGln: 0.965 ± 0.203
1.517PheArg: 1.517 ± 0.323
3.102PheSer: 3.102 ± 0.484
2.413PheThr: 2.413 ± 0.32
2.137PheVal: 2.137 ± 0.353
0.62PheTrp: 0.62 ± 0.178
1.172PheTyr: 1.172 ± 0.297
0.0PheXaa: 0.0 ± 0.0
Gly
3.585GlyAla: 3.585 ± 0.527
0.483GlyCys: 0.483 ± 0.182
4.757GlyAsp: 4.757 ± 0.525
4.137GlyGlu: 4.137 ± 0.522
3.585GlyPhe: 3.585 ± 0.404
4.826GlyGly: 4.826 ± 0.609
1.31GlyHis: 1.31 ± 0.329
4.964GlyIle: 4.964 ± 0.766
5.653GlyLys: 5.653 ± 0.547
5.377GlyLeu: 5.377 ± 0.628
1.517GlyMet: 1.517 ± 0.333
3.309GlyAsn: 3.309 ± 0.383
1.103GlyPro: 1.103 ± 0.22
1.655GlyGln: 1.655 ± 0.24
3.378GlyArg: 3.378 ± 0.453
3.585GlySer: 3.585 ± 0.483
4.757GlyThr: 4.757 ± 0.645
4.205GlyVal: 4.205 ± 0.476
1.103GlyTrp: 1.103 ± 0.22
2.758GlyTyr: 2.758 ± 0.413
0.0GlyXaa: 0.0 ± 0.0
His
0.62HisAla: 0.62 ± 0.144
0.207HisCys: 0.207 ± 0.122
1.034HisAsp: 1.034 ± 0.263
0.62HisGlu: 0.62 ± 0.183
1.103HisPhe: 1.103 ± 0.325
1.448HisGly: 1.448 ± 0.328
0.138HisHis: 0.138 ± 0.086
0.896HisIle: 0.896 ± 0.197
1.241HisLys: 1.241 ± 0.284
1.792HisLeu: 1.792 ± 0.33
0.345HisMet: 0.345 ± 0.139
0.483HisAsn: 0.483 ± 0.196
0.965HisPro: 0.965 ± 0.288
0.552HisGln: 0.552 ± 0.197
0.62HisArg: 0.62 ± 0.215
0.62HisSer: 0.62 ± 0.2
0.758HisThr: 0.758 ± 0.253
1.241HisVal: 1.241 ± 0.243
0.207HisTrp: 0.207 ± 0.105
0.758HisTyr: 0.758 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
4.895IleAla: 4.895 ± 0.596
0.758IleCys: 0.758 ± 0.247
6.205IleAsp: 6.205 ± 0.669
5.309IleGlu: 5.309 ± 0.538
1.861IlePhe: 1.861 ± 0.335
3.792IleGly: 3.792 ± 0.688
1.172IleHis: 1.172 ± 0.292
4.55IleIle: 4.55 ± 0.556
4.688IleLys: 4.688 ± 0.692
5.377IleLeu: 5.377 ± 0.48
1.724IleMet: 1.724 ± 0.299
3.93IleAsn: 3.93 ± 0.552
3.171IlePro: 3.171 ± 0.547
2.896IleGln: 2.896 ± 0.613
2.964IleArg: 2.964 ± 0.401
6.205IleSer: 6.205 ± 0.541
3.654IleThr: 3.654 ± 0.484
4.688IleVal: 4.688 ± 0.804
1.103IleTrp: 1.103 ± 0.308
2.964IleTyr: 2.964 ± 0.47
0.0IleXaa: 0.0 ± 0.0
Lys
6.274LysAla: 6.274 ± 0.615
0.758LysCys: 0.758 ± 0.207
5.309LysAsp: 5.309 ± 0.592
7.17LysGlu: 7.17 ± 0.789
1.999LysPhe: 1.999 ± 0.327
4.343LysGly: 4.343 ± 0.401
1.241LysHis: 1.241 ± 0.275
3.654LysIle: 3.654 ± 0.406
5.584LysLys: 5.584 ± 0.864
6.274LysLeu: 6.274 ± 0.662
1.586LysMet: 1.586 ± 0.37
4.137LysAsn: 4.137 ± 0.492
3.309LysPro: 3.309 ± 0.566
2.137LysGln: 2.137 ± 0.354
4.343LysArg: 4.343 ± 0.597
4.619LysSer: 4.619 ± 0.596
3.033LysThr: 3.033 ± 0.555
4.068LysVal: 4.068 ± 0.404
0.758LysTrp: 0.758 ± 0.225
3.516LysTyr: 3.516 ± 0.558
0.0LysXaa: 0.0 ± 0.0
Leu
5.998LeuAla: 5.998 ± 0.554
0.758LeuCys: 0.758 ± 0.233
5.584LeuAsp: 5.584 ± 0.441
6.205LeuGlu: 6.205 ± 0.56
2.827LeuPhe: 2.827 ± 0.438
5.24LeuGly: 5.24 ± 0.55
1.379LeuHis: 1.379 ± 0.353
5.722LeuIle: 5.722 ± 0.735
5.653LeuLys: 5.653 ± 0.611
7.377LeuLeu: 7.377 ± 0.725
2.689LeuMet: 2.689 ± 0.445
3.516LeuAsn: 3.516 ± 0.476
4.343LeuPro: 4.343 ± 0.649
2.551LeuGln: 2.551 ± 0.512
3.999LeuArg: 3.999 ± 0.539
7.308LeuSer: 7.308 ± 0.601
4.481LeuThr: 4.481 ± 0.397
5.309LeuVal: 5.309 ± 0.759
0.965LeuTrp: 0.965 ± 0.234
3.24LeuTyr: 3.24 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
2.689MetAla: 2.689 ± 0.414
0.138MetCys: 0.138 ± 0.077
1.93MetAsp: 1.93 ± 0.392
2.62MetGlu: 2.62 ± 0.393
0.483MetPhe: 0.483 ± 0.222
2.413MetGly: 2.413 ± 0.414
0.069MetHis: 0.069 ± 0.056
1.103MetIle: 1.103 ± 0.278
2.62MetLys: 2.62 ± 0.38
2.275MetLeu: 2.275 ± 0.342
0.896MetMet: 0.896 ± 0.347
1.517MetAsn: 1.517 ± 0.314
1.103MetPro: 1.103 ± 0.286
0.62MetGln: 0.62 ± 0.247
1.103MetArg: 1.103 ± 0.241
1.999MetSer: 1.999 ± 0.433
1.655MetThr: 1.655 ± 0.315
1.655MetVal: 1.655 ± 0.249
0.276MetTrp: 0.276 ± 0.125
0.62MetTyr: 0.62 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.999AsnAla: 3.999 ± 0.793
0.207AsnCys: 0.207 ± 0.116
1.999AsnAsp: 1.999 ± 0.43
3.861AsnGlu: 3.861 ± 0.673
1.31AsnPhe: 1.31 ± 0.268
4.619AsnGly: 4.619 ± 0.616
0.896AsnHis: 0.896 ± 0.29
4.55AsnIle: 4.55 ± 0.573
4.205AsnLys: 4.205 ± 0.465
4.412AsnLeu: 4.412 ± 0.627
1.379AsnMet: 1.379 ± 0.317
1.792AsnAsn: 1.792 ± 0.415
2.275AsnPro: 2.275 ± 0.445
1.655AsnGln: 1.655 ± 0.349
1.861AsnArg: 1.861 ± 0.265
2.896AsnSer: 2.896 ± 0.387
2.62AsnThr: 2.62 ± 0.491
2.413AsnVal: 2.413 ± 0.374
0.345AsnTrp: 0.345 ± 0.153
1.724AsnTyr: 1.724 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
2.413ProAla: 2.413 ± 0.332
0.138ProCys: 0.138 ± 0.098
2.275ProAsp: 2.275 ± 0.396
2.689ProGlu: 2.689 ± 0.445
1.448ProPhe: 1.448 ± 0.258
1.655ProGly: 1.655 ± 0.321
0.414ProHis: 0.414 ± 0.18
1.724ProIle: 1.724 ± 0.33
2.413ProLys: 2.413 ± 0.394
2.206ProLeu: 2.206 ± 0.357
0.896ProMet: 0.896 ± 0.237
1.103ProAsn: 1.103 ± 0.284
1.379ProPro: 1.379 ± 0.273
1.724ProGln: 1.724 ± 0.419
1.31ProArg: 1.31 ± 0.286
1.93ProSer: 1.93 ± 0.339
2.827ProThr: 2.827 ± 0.422
2.413ProVal: 2.413 ± 0.311
0.689ProTrp: 0.689 ± 0.193
1.792ProTyr: 1.792 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
1.861GlnAla: 1.861 ± 0.322
0.414GlnCys: 0.414 ± 0.218
1.724GlnAsp: 1.724 ± 0.302
1.792GlnGlu: 1.792 ± 0.378
1.172GlnPhe: 1.172 ± 0.316
1.999GlnGly: 1.999 ± 0.393
0.276GlnHis: 0.276 ± 0.125
3.033GlnIle: 3.033 ± 0.42
2.482GlnLys: 2.482 ± 0.39
2.758GlnLeu: 2.758 ± 0.608
1.379GlnMet: 1.379 ± 0.243
1.861GlnAsn: 1.861 ± 0.446
0.758GlnPro: 0.758 ± 0.222
1.448GlnGln: 1.448 ± 0.306
1.448GlnArg: 1.448 ± 0.246
1.724GlnSer: 1.724 ± 0.391
1.517GlnThr: 1.517 ± 0.33
1.724GlnVal: 1.724 ± 0.331
0.414GlnTrp: 0.414 ± 0.185
0.896GlnTyr: 0.896 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
2.482ArgAla: 2.482 ± 0.328
0.62ArgCys: 0.62 ± 0.199
2.551ArgAsp: 2.551 ± 0.444
3.033ArgGlu: 3.033 ± 0.46
1.93ArgPhe: 1.93 ± 0.341
1.792ArgGly: 1.792 ± 0.363
0.62ArgHis: 0.62 ± 0.198
3.861ArgIle: 3.861 ± 0.558
3.93ArgLys: 3.93 ± 0.499
3.861ArgLeu: 3.861 ± 0.496
1.586ArgMet: 1.586 ± 0.353
2.758ArgAsn: 2.758 ± 0.447
1.172ArgPro: 1.172 ± 0.245
1.655ArgGln: 1.655 ± 0.38
1.792ArgArg: 1.792 ± 0.284
3.033ArgSer: 3.033 ± 0.362
2.275ArgThr: 2.275 ± 0.41
3.033ArgVal: 3.033 ± 0.39
0.965ArgTrp: 0.965 ± 0.308
1.724ArgTyr: 1.724 ± 0.325
0.0ArgXaa: 0.0 ± 0.0
Ser
5.171SerAla: 5.171 ± 0.693
0.689SerCys: 0.689 ± 0.181
3.516SerAsp: 3.516 ± 0.408
4.688SerGlu: 4.688 ± 0.548
2.206SerPhe: 2.206 ± 0.358
4.688SerGly: 4.688 ± 0.556
0.827SerHis: 0.827 ± 0.23
4.688SerIle: 4.688 ± 0.642
4.205SerLys: 4.205 ± 0.633
5.584SerLeu: 5.584 ± 0.694
1.724SerMet: 1.724 ± 0.26
4.068SerAsn: 4.068 ± 0.537
1.655SerPro: 1.655 ± 0.326
2.275SerGln: 2.275 ± 0.35
3.24SerArg: 3.24 ± 0.487
4.619SerSer: 4.619 ± 0.585
3.861SerThr: 3.861 ± 0.499
5.102SerVal: 5.102 ± 0.66
1.103SerTrp: 1.103 ± 0.264
2.551SerTyr: 2.551 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
4.412ThrAla: 4.412 ± 0.464
0.483ThrCys: 0.483 ± 0.172
3.723ThrAsp: 3.723 ± 0.618
4.826ThrGlu: 4.826 ± 0.714
1.999ThrPhe: 1.999 ± 0.387
3.654ThrGly: 3.654 ± 0.457
1.103ThrHis: 1.103 ± 0.269
4.137ThrIle: 4.137 ± 0.597
3.999ThrLys: 3.999 ± 0.519
5.653ThrLeu: 5.653 ± 0.554
0.896ThrMet: 0.896 ± 0.248
2.137ThrAsn: 2.137 ± 0.3
1.586ThrPro: 1.586 ± 0.33
1.172ThrGln: 1.172 ± 0.272
1.724ThrArg: 1.724 ± 0.377
3.999ThrSer: 3.999 ± 0.703
3.309ThrThr: 3.309 ± 0.664
3.792ThrVal: 3.792 ± 0.444
0.689ThrTrp: 0.689 ± 0.169
2.344ThrTyr: 2.344 ± 0.424
0.0ThrXaa: 0.0 ± 0.0
Val
4.481ValAla: 4.481 ± 0.537
0.758ValCys: 0.758 ± 0.21
4.757ValAsp: 4.757 ± 0.659
4.481ValGlu: 4.481 ± 0.569
2.275ValPhe: 2.275 ± 0.489
3.93ValGly: 3.93 ± 0.4
1.241ValHis: 1.241 ± 0.276
4.481ValIle: 4.481 ± 0.503
3.723ValLys: 3.723 ± 0.585
5.998ValLeu: 5.998 ± 0.567
1.448ValMet: 1.448 ± 0.273
2.964ValAsn: 2.964 ± 0.316
1.93ValPro: 1.93 ± 0.326
1.586ValGln: 1.586 ± 0.425
3.102ValArg: 3.102 ± 0.409
4.55ValSer: 4.55 ± 0.536
4.205ValThr: 4.205 ± 0.561
3.309ValVal: 3.309 ± 0.441
0.758ValTrp: 0.758 ± 0.249
2.689ValTyr: 2.689 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
0.896TrpAla: 0.896 ± 0.26
0.276TrpCys: 0.276 ± 0.134
0.896TrpAsp: 0.896 ± 0.242
1.241TrpGlu: 1.241 ± 0.213
0.552TrpPhe: 0.552 ± 0.18
0.414TrpGly: 0.414 ± 0.191
0.276TrpHis: 0.276 ± 0.189
0.827TrpIle: 0.827 ± 0.298
0.965TrpLys: 0.965 ± 0.251
1.379TrpLeu: 1.379 ± 0.313
0.414TrpMet: 0.414 ± 0.171
1.034TrpAsn: 1.034 ± 0.222
0.345TrpPro: 0.345 ± 0.181
0.965TrpGln: 0.965 ± 0.262
0.62TrpArg: 0.62 ± 0.202
0.965TrpSer: 0.965 ± 0.275
0.62TrpThr: 0.62 ± 0.225
0.896TrpVal: 0.896 ± 0.253
0.138TrpTrp: 0.138 ± 0.102
0.62TrpTyr: 0.62 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.585TyrAla: 3.585 ± 0.375
0.483TyrCys: 0.483 ± 0.175
3.378TyrAsp: 3.378 ± 0.508
3.033TyrGlu: 3.033 ± 0.419
1.31TyrPhe: 1.31 ± 0.303
3.378TyrGly: 3.378 ± 0.549
0.689TyrHis: 0.689 ± 0.198
2.413TyrIle: 2.413 ± 0.39
2.482TyrLys: 2.482 ± 0.479
2.551TyrLeu: 2.551 ± 0.41
0.827TyrMet: 0.827 ± 0.234
1.655TyrAsn: 1.655 ± 0.322
1.241TyrPro: 1.241 ± 0.246
1.379TyrGln: 1.379 ± 0.284
2.206TyrArg: 2.206 ± 0.369
3.309TyrSer: 3.309 ± 0.463
2.551TyrThr: 2.551 ± 0.417
1.93TyrVal: 1.93 ± 0.3
1.034TyrTrp: 1.034 ± 0.339
1.379TyrTyr: 1.379 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (14506 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski