Amino acid dipepetide frequency for Streptococcus phage IPP34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.378AlaAla: 2.378 ± 0.769
0.328AlaCys: 0.328 ± 0.181
5.821AlaAsp: 5.821 ± 0.681
6.231AlaGlu: 6.231 ± 0.747
2.378AlaPhe: 2.378 ± 0.675
5.001AlaGly: 5.001 ± 1.133
1.066AlaHis: 1.066 ± 0.303
4.673AlaIle: 4.673 ± 0.954
5.329AlaLys: 5.329 ± 0.737
6.559AlaLeu: 6.559 ± 0.993
2.132AlaMet: 2.132 ± 0.512
3.689AlaAsn: 3.689 ± 0.742
1.394AlaPro: 1.394 ± 0.421
2.87AlaGln: 2.87 ± 0.526
2.624AlaArg: 2.624 ± 0.415
2.132AlaSer: 2.132 ± 0.607
4.345AlaThr: 4.345 ± 0.686
4.755AlaVal: 4.755 ± 0.833
1.312AlaTrp: 1.312 ± 0.43
1.968AlaTyr: 1.968 ± 0.362
0.0AlaXaa: 0.0 ± 0.0
Cys
0.164CysAla: 0.164 ± 0.095
0.082CysCys: 0.082 ± 0.083
0.492CysAsp: 0.492 ± 0.205
0.492CysGlu: 0.492 ± 0.214
0.492CysPhe: 0.492 ± 0.21
0.246CysGly: 0.246 ± 0.152
0.0CysHis: 0.0 ± 0.0
0.41CysIle: 0.41 ± 0.2
0.82CysLys: 0.82 ± 0.254
0.246CysLeu: 0.246 ± 0.137
0.082CysMet: 0.082 ± 0.079
0.164CysAsn: 0.164 ± 0.118
0.246CysPro: 0.246 ± 0.152
0.328CysGln: 0.328 ± 0.147
0.328CysArg: 0.328 ± 0.134
0.492CysSer: 0.492 ± 0.196
0.246CysThr: 0.246 ± 0.198
0.082CysVal: 0.082 ± 0.098
0.082CysTrp: 0.082 ± 0.083
0.328CysTyr: 0.328 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
3.771AspAla: 3.771 ± 0.718
0.492AspCys: 0.492 ± 0.229
2.542AspAsp: 2.542 ± 0.539
4.345AspGlu: 4.345 ± 0.975
3.689AspPhe: 3.689 ± 0.541
4.673AspGly: 4.673 ± 0.613
0.492AspHis: 0.492 ± 0.264
4.673AspIle: 4.673 ± 0.635
4.427AspLys: 4.427 ± 0.664
5.329AspLeu: 5.329 ± 0.752
2.214AspMet: 2.214 ± 0.346
2.952AspAsn: 2.952 ± 0.453
1.968AspPro: 1.968 ± 0.569
1.968AspGln: 1.968 ± 0.431
2.378AspArg: 2.378 ± 0.486
3.853AspSer: 3.853 ± 0.534
4.099AspThr: 4.099 ± 0.538
3.607AspVal: 3.607 ± 0.633
1.476AspTrp: 1.476 ± 0.416
3.361AspTyr: 3.361 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
6.231GluAla: 6.231 ± 0.948
0.328GluCys: 0.328 ± 0.148
4.181GluAsp: 4.181 ± 0.699
5.985GluGlu: 5.985 ± 1.064
3.771GluPhe: 3.771 ± 0.657
3.034GluGly: 3.034 ± 0.511
1.148GluHis: 1.148 ± 0.283
5.739GluIle: 5.739 ± 0.593
8.363GluLys: 8.363 ± 1.427
9.511GluLeu: 9.511 ± 1.2
2.132GluMet: 2.132 ± 0.521
5.001GluAsn: 5.001 ± 0.631
1.476GluPro: 1.476 ± 0.315
3.607GluGln: 3.607 ± 0.591
5.001GluArg: 5.001 ± 0.67
4.837GluSer: 4.837 ± 0.7
3.689GluThr: 3.689 ± 0.561
5.001GluVal: 5.001 ± 0.569
0.738GluTrp: 0.738 ± 0.24
3.116GluTyr: 3.116 ± 0.538
0.0GluXaa: 0.0 ± 0.0
Phe
2.05PheAla: 2.05 ± 0.54
0.492PheCys: 0.492 ± 0.19
4.509PheAsp: 4.509 ± 0.444
4.755PheGlu: 4.755 ± 0.752
1.886PhePhe: 1.886 ± 0.525
1.968PheGly: 1.968 ± 0.647
0.328PheHis: 0.328 ± 0.207
2.542PheIle: 2.542 ± 0.459
3.607PheLys: 3.607 ± 0.532
2.624PheLeu: 2.624 ± 0.443
1.476PheMet: 1.476 ± 0.396
3.034PheAsn: 3.034 ± 0.531
0.738PhePro: 0.738 ± 0.336
1.722PheGln: 1.722 ± 0.333
1.23PheArg: 1.23 ± 0.3
2.952PheSer: 2.952 ± 0.698
2.542PheThr: 2.542 ± 0.398
1.64PheVal: 1.64 ± 0.372
0.574PheTrp: 0.574 ± 0.288
1.886PheTyr: 1.886 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
2.87GlyAla: 2.87 ± 0.504
0.0GlyCys: 0.0 ± 0.0
3.279GlyAsp: 3.279 ± 0.648
5.083GlyGlu: 5.083 ± 0.781
2.624GlyPhe: 2.624 ± 0.609
4.181GlyGly: 4.181 ± 1.335
1.066GlyHis: 1.066 ± 0.265
3.853GlyIle: 3.853 ± 0.588
4.755GlyLys: 4.755 ± 0.662
5.821GlyLeu: 5.821 ± 1.134
1.968GlyMet: 1.968 ± 0.409
3.116GlyAsn: 3.116 ± 0.489
0.82GlyPro: 0.82 ± 0.271
3.525GlyGln: 3.525 ± 0.411
3.279GlyArg: 3.279 ± 0.576
3.853GlySer: 3.853 ± 0.801
2.542GlyThr: 2.542 ± 0.48
4.263GlyVal: 4.263 ± 0.76
0.902GlyTrp: 0.902 ± 0.504
2.952GlyTyr: 2.952 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
0.984HisAla: 0.984 ± 0.302
0.164HisCys: 0.164 ± 0.115
0.902HisAsp: 0.902 ± 0.301
1.394HisGlu: 1.394 ± 0.343
0.82HisPhe: 0.82 ± 0.3
0.656HisGly: 0.656 ± 0.24
0.41HisHis: 0.41 ± 0.212
0.738HisIle: 0.738 ± 0.267
0.984HisLys: 0.984 ± 0.32
1.312HisLeu: 1.312 ± 0.368
0.41HisMet: 0.41 ± 0.179
1.23HisAsn: 1.23 ± 0.29
0.574HisPro: 0.574 ± 0.233
0.574HisGln: 0.574 ± 0.212
0.984HisArg: 0.984 ± 0.348
1.394HisSer: 1.394 ± 0.46
0.902HisThr: 0.902 ± 0.304
0.984HisVal: 0.984 ± 0.263
0.164HisTrp: 0.164 ± 0.129
0.41HisTyr: 0.41 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
5.411IleAla: 5.411 ± 0.78
0.41IleCys: 0.41 ± 0.146
4.673IleAsp: 4.673 ± 0.774
7.461IleGlu: 7.461 ± 0.855
2.788IlePhe: 2.788 ± 0.512
3.443IleGly: 3.443 ± 0.896
0.41IleHis: 0.41 ± 0.199
3.689IleIle: 3.689 ± 0.759
6.887IleLys: 6.887 ± 1.045
4.263IleLeu: 4.263 ± 0.615
1.476IleMet: 1.476 ± 0.475
3.689IleAsn: 3.689 ± 0.54
1.148IlePro: 1.148 ± 0.36
2.378IleGln: 2.378 ± 0.391
2.87IleArg: 2.87 ± 0.546
5.657IleSer: 5.657 ± 0.797
4.427IleThr: 4.427 ± 0.574
3.034IleVal: 3.034 ± 0.618
0.41IleTrp: 0.41 ± 0.225
2.132IleTyr: 2.132 ± 0.493
0.0IleXaa: 0.0 ± 0.0
Lys
5.083LysAla: 5.083 ± 0.75
0.41LysCys: 0.41 ± 0.222
5.575LysAsp: 5.575 ± 0.613
7.133LysGlu: 7.133 ± 0.916
2.624LysPhe: 2.624 ± 0.551
4.591LysGly: 4.591 ± 0.749
1.23LysHis: 1.23 ± 0.278
6.641LysIle: 6.641 ± 0.891
7.215LysLys: 7.215 ± 0.961
7.543LysLeu: 7.543 ± 0.733
2.624LysMet: 2.624 ± 0.421
5.083LysAsn: 5.083 ± 0.633
2.624LysPro: 2.624 ± 0.587
3.525LysGln: 3.525 ± 0.554
3.853LysArg: 3.853 ± 0.472
4.755LysSer: 4.755 ± 0.68
5.083LysThr: 5.083 ± 0.467
5.247LysVal: 5.247 ± 0.563
0.984LysTrp: 0.984 ± 0.368
3.116LysTyr: 3.116 ± 0.524
0.0LysXaa: 0.0 ± 0.0
Leu
7.051LeuAla: 7.051 ± 0.794
0.574LeuCys: 0.574 ± 0.265
5.739LeuAsp: 5.739 ± 0.6
7.051LeuGlu: 7.051 ± 0.905
2.378LeuPhe: 2.378 ± 0.488
5.657LeuGly: 5.657 ± 1.412
1.312LeuHis: 1.312 ± 0.32
4.755LeuIle: 4.755 ± 0.568
7.379LeuLys: 7.379 ± 0.837
7.871LeuLeu: 7.871 ± 1.099
2.132LeuMet: 2.132 ± 0.413
3.361LeuAsn: 3.361 ± 0.502
2.46LeuPro: 2.46 ± 0.506
3.443LeuGln: 3.443 ± 0.551
4.509LeuArg: 4.509 ± 0.691
6.231LeuSer: 6.231 ± 0.92
5.411LeuThr: 5.411 ± 0.626
3.443LeuVal: 3.443 ± 0.474
0.574LeuTrp: 0.574 ± 0.184
2.05LeuTyr: 2.05 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
2.46MetAla: 2.46 ± 0.573
0.0MetCys: 0.0 ± 0.0
1.476MetAsp: 1.476 ± 0.289
2.05MetGlu: 2.05 ± 0.396
1.148MetPhe: 1.148 ± 0.291
1.476MetGly: 1.476 ± 0.421
0.41MetHis: 0.41 ± 0.213
2.05MetIle: 2.05 ± 0.446
2.542MetLys: 2.542 ± 0.494
1.558MetLeu: 1.558 ± 0.359
0.41MetMet: 0.41 ± 0.264
1.312MetAsn: 1.312 ± 0.394
0.82MetPro: 0.82 ± 0.26
0.902MetGln: 0.902 ± 0.305
1.394MetArg: 1.394 ± 0.335
1.066MetSer: 1.066 ± 0.299
2.05MetThr: 2.05 ± 0.465
1.312MetVal: 1.312 ± 0.274
0.164MetTrp: 0.164 ± 0.115
0.82MetTyr: 0.82 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
4.427AsnAla: 4.427 ± 1.018
0.328AsnCys: 0.328 ± 0.136
3.034AsnAsp: 3.034 ± 0.537
2.87AsnGlu: 2.87 ± 0.537
2.214AsnPhe: 2.214 ± 0.542
4.345AsnGly: 4.345 ± 0.626
1.394AsnHis: 1.394 ± 0.362
3.279AsnIle: 3.279 ± 0.482
4.427AsnLys: 4.427 ± 0.59
4.509AsnLeu: 4.509 ± 0.642
1.066AsnMet: 1.066 ± 0.36
2.542AsnAsn: 2.542 ± 0.626
1.804AsnPro: 1.804 ± 0.433
2.706AsnGln: 2.706 ± 0.491
2.542AsnArg: 2.542 ± 0.536
3.361AsnSer: 3.361 ± 0.679
3.034AsnThr: 3.034 ± 0.6
3.607AsnVal: 3.607 ± 0.544
0.82AsnTrp: 0.82 ± 0.178
1.558AsnTyr: 1.558 ± 0.3
0.0AsnXaa: 0.0 ± 0.0
Pro
1.64ProAla: 1.64 ± 0.399
0.082ProCys: 0.082 ± 0.092
2.05ProAsp: 2.05 ± 0.422
3.198ProGlu: 3.198 ± 0.487
0.984ProPhe: 0.984 ± 0.345
0.984ProGly: 0.984 ± 0.299
0.41ProHis: 0.41 ± 0.184
1.886ProIle: 1.886 ± 0.479
2.132ProLys: 2.132 ± 0.367
1.066ProLeu: 1.066 ± 0.262
0.41ProMet: 0.41 ± 0.178
1.312ProAsn: 1.312 ± 0.422
0.574ProPro: 0.574 ± 0.234
1.23ProGln: 1.23 ± 0.393
1.23ProArg: 1.23 ± 0.278
1.066ProSer: 1.066 ± 0.376
0.902ProThr: 0.902 ± 0.317
1.886ProVal: 1.886 ± 0.354
0.41ProTrp: 0.41 ± 0.159
1.476ProTyr: 1.476 ± 0.419
0.0ProXaa: 0.0 ± 0.0
Gln
3.279GlnAla: 3.279 ± 0.627
0.328GlnCys: 0.328 ± 0.172
1.968GlnAsp: 1.968 ± 0.371
4.673GlnGlu: 4.673 ± 0.669
1.312GlnPhe: 1.312 ± 0.306
2.132GlnGly: 2.132 ± 0.453
0.492GlnHis: 0.492 ± 0.211
3.116GlnIle: 3.116 ± 0.518
3.525GlnLys: 3.525 ± 0.464
3.034GlnLeu: 3.034 ± 0.42
0.656GlnMet: 0.656 ± 0.221
1.968GlnAsn: 1.968 ± 0.463
1.066GlnPro: 1.066 ± 0.299
1.722GlnGln: 1.722 ± 0.418
2.296GlnArg: 2.296 ± 0.467
2.214GlnSer: 2.214 ± 0.414
2.788GlnThr: 2.788 ± 0.527
4.181GlnVal: 4.181 ± 0.592
0.328GlnTrp: 0.328 ± 0.144
0.984GlnTyr: 0.984 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
2.624ArgAla: 2.624 ± 0.42
0.492ArgCys: 0.492 ± 0.19
1.968ArgAsp: 1.968 ± 0.479
3.116ArgGlu: 3.116 ± 0.509
1.968ArgPhe: 1.968 ± 0.478
1.886ArgGly: 1.886 ± 0.375
0.738ArgHis: 0.738 ± 0.263
3.443ArgIle: 3.443 ± 0.498
3.198ArgLys: 3.198 ± 0.625
5.493ArgLeu: 5.493 ± 0.637
1.968ArgMet: 1.968 ± 0.43
2.952ArgAsn: 2.952 ± 0.653
1.066ArgPro: 1.066 ± 0.245
2.296ArgGln: 2.296 ± 0.462
2.46ArgArg: 2.46 ± 0.653
2.132ArgSer: 2.132 ± 0.408
2.952ArgThr: 2.952 ± 0.655
3.116ArgVal: 3.116 ± 0.473
0.328ArgTrp: 0.328 ± 0.16
2.05ArgTyr: 2.05 ± 0.47
0.0ArgXaa: 0.0 ± 0.0
Ser
4.591SerAla: 4.591 ± 1.021
0.246SerCys: 0.246 ± 0.224
3.935SerAsp: 3.935 ± 0.603
4.591SerGlu: 4.591 ± 0.646
2.378SerPhe: 2.378 ± 0.375
4.837SerGly: 4.837 ± 0.662
1.804SerHis: 1.804 ± 0.472
4.345SerIle: 4.345 ± 0.731
4.837SerLys: 4.837 ± 0.637
4.837SerLeu: 4.837 ± 0.773
1.066SerMet: 1.066 ± 0.304
3.034SerAsn: 3.034 ± 0.602
1.394SerPro: 1.394 ± 0.358
2.214SerGln: 2.214 ± 0.423
3.116SerArg: 3.116 ± 0.723
3.525SerSer: 3.525 ± 0.799
4.099SerThr: 4.099 ± 0.477
3.198SerVal: 3.198 ± 0.837
1.23SerTrp: 1.23 ± 0.359
2.46SerTyr: 2.46 ± 0.538
0.0SerXaa: 0.0 ± 0.0
Thr
4.919ThrAla: 4.919 ± 0.889
0.164ThrCys: 0.164 ± 0.136
3.771ThrAsp: 3.771 ± 0.623
4.345ThrGlu: 4.345 ± 0.543
3.116ThrPhe: 3.116 ± 0.749
4.345ThrGly: 4.345 ± 0.808
0.902ThrHis: 0.902 ± 0.34
4.181ThrIle: 4.181 ± 0.669
4.017ThrLys: 4.017 ± 0.691
4.509ThrLeu: 4.509 ± 0.65
1.066ThrMet: 1.066 ± 0.288
3.361ThrAsn: 3.361 ± 0.429
1.148ThrPro: 1.148 ± 0.354
2.87ThrGln: 2.87 ± 0.593
1.722ThrArg: 1.722 ± 0.365
3.853ThrSer: 3.853 ± 0.552
4.755ThrThr: 4.755 ± 0.822
4.755ThrVal: 4.755 ± 0.795
1.148ThrTrp: 1.148 ± 0.301
2.05ThrTyr: 2.05 ± 0.548
0.0ThrXaa: 0.0 ± 0.0
Val
4.345ValAla: 4.345 ± 0.534
0.328ValCys: 0.328 ± 0.2
3.689ValAsp: 3.689 ± 0.491
5.001ValGlu: 5.001 ± 0.66
2.542ValPhe: 2.542 ± 0.476
4.427ValGly: 4.427 ± 0.782
1.148ValHis: 1.148 ± 0.399
3.116ValIle: 3.116 ± 0.506
5.575ValLys: 5.575 ± 0.663
3.853ValLeu: 3.853 ± 0.651
1.066ValMet: 1.066 ± 0.321
3.935ValAsn: 3.935 ± 0.759
1.968ValPro: 1.968 ± 0.313
1.558ValGln: 1.558 ± 0.453
1.968ValArg: 1.968 ± 0.379
5.493ValSer: 5.493 ± 0.712
4.673ValThr: 4.673 ± 0.604
4.427ValVal: 4.427 ± 0.685
0.656ValTrp: 0.656 ± 0.223
2.378ValTyr: 2.378 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
0.984TrpAla: 0.984 ± 0.388
0.164TrpCys: 0.164 ± 0.101
0.82TrpAsp: 0.82 ± 0.334
0.656TrpGlu: 0.656 ± 0.261
1.23TrpPhe: 1.23 ± 0.46
0.656TrpGly: 0.656 ± 0.223
0.164TrpHis: 0.164 ± 0.136
0.41TrpIle: 0.41 ± 0.149
1.312TrpLys: 1.312 ± 0.402
0.574TrpLeu: 0.574 ± 0.308
0.492TrpMet: 0.492 ± 0.177
0.902TrpAsn: 0.902 ± 0.335
0.164TrpPro: 0.164 ± 0.102
0.656TrpGln: 0.656 ± 0.281
0.82TrpArg: 0.82 ± 0.315
0.246TrpSer: 0.246 ± 0.125
0.574TrpThr: 0.574 ± 0.286
1.23TrpVal: 1.23 ± 0.298
0.246TrpTrp: 0.246 ± 0.127
0.738TrpTyr: 0.738 ± 0.585
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.722TyrAla: 1.722 ± 0.329
0.41TyrCys: 0.41 ± 0.165
1.886TyrAsp: 1.886 ± 0.382
2.46TyrGlu: 2.46 ± 0.532
2.214TyrPhe: 2.214 ± 0.494
2.214TyrGly: 2.214 ± 0.398
1.066TyrHis: 1.066 ± 0.282
3.198TyrIle: 3.198 ± 0.575
3.607TyrLys: 3.607 ± 0.584
2.87TyrLeu: 2.87 ± 0.62
0.492TyrMet: 0.492 ± 0.251
1.23TyrAsn: 1.23 ± 0.3
1.476TyrPro: 1.476 ± 0.462
1.968TyrGln: 1.968 ± 0.511
1.64TyrArg: 1.64 ± 0.352
2.624TyrSer: 2.624 ± 0.592
1.886TyrThr: 1.886 ± 0.392
2.378TyrVal: 2.378 ± 0.534
0.492TyrTrp: 0.492 ± 0.26
1.476TyrTyr: 1.476 ± 0.511
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (12198 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski