Amino acid dipepetide frequency for Gordonia phage Kiko

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.034AlaAla: 18.034 ± 1.725
1.049AlaCys: 1.049 ± 0.272
8.318AlaAsp: 8.318 ± 1.143
7.689AlaGlu: 7.689 ± 1.166
3.006AlaPhe: 3.006 ± 0.506
10.974AlaGly: 10.974 ± 1.298
2.237AlaHis: 2.237 ± 0.37
5.382AlaIle: 5.382 ± 0.587
5.592AlaLys: 5.592 ± 1.025
11.044AlaLeu: 11.044 ± 0.97
3.845AlaMet: 3.845 ± 0.591
2.377AlaAsn: 2.377 ± 0.419
5.522AlaPro: 5.522 ± 0.578
4.753AlaGln: 4.753 ± 0.526
8.108AlaArg: 8.108 ± 0.857
5.243AlaSer: 5.243 ± 0.558
7.06AlaThr: 7.06 ± 0.712
8.877AlaVal: 8.877 ± 1.249
2.097AlaTrp: 2.097 ± 0.3
1.887AlaTyr: 1.887 ± 0.267
0.0AlaXaa: 0.0 ± 0.0
Cys
0.699CysAla: 0.699 ± 0.254
0.21CysCys: 0.21 ± 0.114
1.049CysAsp: 1.049 ± 0.304
0.629CysGlu: 0.629 ± 0.23
0.28CysPhe: 0.28 ± 0.155
1.118CysGly: 1.118 ± 0.378
0.28CysHis: 0.28 ± 0.154
0.35CysIle: 0.35 ± 0.152
0.28CysLys: 0.28 ± 0.123
0.629CysLeu: 0.629 ± 0.183
0.07CysMet: 0.07 ± 0.072
0.35CysAsn: 0.35 ± 0.122
0.559CysPro: 0.559 ± 0.22
0.419CysGln: 0.419 ± 0.171
1.118CysArg: 1.118 ± 0.28
0.559CysSer: 0.559 ± 0.186
0.28CysThr: 0.28 ± 0.143
0.559CysVal: 0.559 ± 0.232
0.35CysTrp: 0.35 ± 0.134
0.14CysTyr: 0.14 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
7.13AspAla: 7.13 ± 0.622
0.979AspCys: 0.979 ± 0.315
6.361AspAsp: 6.361 ± 0.88
4.963AspGlu: 4.963 ± 0.718
2.516AspPhe: 2.516 ± 0.375
6.78AspGly: 6.78 ± 0.65
1.748AspHis: 1.748 ± 0.443
2.377AspIle: 2.377 ± 0.427
1.957AspLys: 1.957 ± 0.324
6.431AspLeu: 6.431 ± 0.922
0.979AspMet: 0.979 ± 0.322
1.188AspAsn: 1.188 ± 0.285
4.823AspPro: 4.823 ± 0.714
2.027AspGln: 2.027 ± 0.325
5.243AspArg: 5.243 ± 0.689
2.936AspSer: 2.936 ± 0.512
3.914AspThr: 3.914 ± 0.556
3.775AspVal: 3.775 ± 0.654
1.049AspTrp: 1.049 ± 0.281
1.468AspTyr: 1.468 ± 0.328
0.0AspXaa: 0.0 ± 0.0
Glu
6.641GluAla: 6.641 ± 0.743
0.699GluCys: 0.699 ± 0.302
3.635GluAsp: 3.635 ± 0.556
1.678GluGlu: 1.678 ± 0.474
1.957GluPhe: 1.957 ± 0.399
3.355GluGly: 3.355 ± 0.417
1.258GluHis: 1.258 ± 0.28
2.866GluIle: 2.866 ± 0.476
1.608GluLys: 1.608 ± 0.327
5.033GluLeu: 5.033 ± 0.66
1.118GluMet: 1.118 ± 0.285
1.748GluAsn: 1.748 ± 0.34
2.796GluPro: 2.796 ± 0.568
2.167GluGln: 2.167 ± 0.462
4.124GluArg: 4.124 ± 0.636
3.775GluSer: 3.775 ± 0.606
3.076GluThr: 3.076 ± 0.421
4.054GluVal: 4.054 ± 0.553
0.769GluTrp: 0.769 ± 0.271
2.097GluTyr: 2.097 ± 0.292
0.0GluXaa: 0.0 ± 0.0
Phe
3.495PheAla: 3.495 ± 0.585
0.07PheCys: 0.07 ± 0.08
3.146PheAsp: 3.146 ± 0.503
2.097PheGlu: 2.097 ± 0.421
0.769PhePhe: 0.769 ± 0.189
2.377PheGly: 2.377 ± 0.347
0.699PheHis: 0.699 ± 0.217
1.328PheIle: 1.328 ± 0.322
0.559PheLys: 0.559 ± 0.228
2.656PheLeu: 2.656 ± 0.442
0.35PheMet: 0.35 ± 0.136
0.769PheAsn: 0.769 ± 0.274
1.118PhePro: 1.118 ± 0.313
0.769PheGln: 0.769 ± 0.213
1.608PheArg: 1.608 ± 0.395
1.328PheSer: 1.328 ± 0.32
1.748PheThr: 1.748 ± 0.385
2.377PheVal: 2.377 ± 0.373
0.839PheTrp: 0.839 ± 0.237
0.909PheTyr: 0.909 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
8.039GlyAla: 8.039 ± 1.238
0.419GlyCys: 0.419 ± 0.185
5.592GlyAsp: 5.592 ± 0.874
4.823GlyGlu: 4.823 ± 0.568
2.936GlyPhe: 2.936 ± 0.731
7.759GlyGly: 7.759 ± 1.266
2.237GlyHis: 2.237 ± 0.392
3.775GlyIle: 3.775 ± 0.671
2.656GlyLys: 2.656 ± 0.361
6.361GlyLeu: 6.361 ± 0.905
1.538GlyMet: 1.538 ± 0.276
2.656GlyAsn: 2.656 ± 0.372
4.474GlyPro: 4.474 ± 0.587
3.006GlyGln: 3.006 ± 0.419
6.501GlyArg: 6.501 ± 0.688
4.474GlySer: 4.474 ± 0.644
6.011GlyThr: 6.011 ± 0.777
6.99GlyVal: 6.99 ± 0.609
1.468GlyTrp: 1.468 ± 0.319
2.656GlyTyr: 2.656 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
2.796HisAla: 2.796 ± 0.682
0.14HisCys: 0.14 ± 0.084
1.328HisAsp: 1.328 ± 0.277
0.559HisGlu: 0.559 ± 0.224
0.489HisPhe: 0.489 ± 0.237
1.328HisGly: 1.328 ± 0.339
0.629HisHis: 0.629 ± 0.207
0.769HisIle: 0.769 ± 0.224
0.14HisLys: 0.14 ± 0.097
1.957HisLeu: 1.957 ± 0.449
0.14HisMet: 0.14 ± 0.087
0.489HisAsn: 0.489 ± 0.158
1.957HisPro: 1.957 ± 0.35
0.699HisGln: 0.699 ± 0.194
1.538HisArg: 1.538 ± 0.332
1.398HisSer: 1.398 ± 0.311
1.328HisThr: 1.328 ± 0.297
0.909HisVal: 0.909 ± 0.233
0.419HisTrp: 0.419 ± 0.152
0.699HisTyr: 0.699 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
5.872IleAla: 5.872 ± 0.807
0.21IleCys: 0.21 ± 0.117
2.726IleAsp: 2.726 ± 0.506
3.914IleGlu: 3.914 ± 0.65
1.468IlePhe: 1.468 ± 0.485
4.963IleGly: 4.963 ± 0.829
0.909IleHis: 0.909 ± 0.237
1.468IleIle: 1.468 ± 0.307
1.538IleLys: 1.538 ± 0.331
2.656IleLeu: 2.656 ± 0.397
0.419IleMet: 0.419 ± 0.205
1.608IleAsn: 1.608 ± 0.269
1.957IlePro: 1.957 ± 0.383
1.258IleGln: 1.258 ± 0.235
3.285IleArg: 3.285 ± 0.555
2.866IleSer: 2.866 ± 0.425
3.215IleThr: 3.215 ± 0.447
3.146IleVal: 3.146 ± 0.492
0.979IleTrp: 0.979 ± 0.35
1.468IleTyr: 1.468 ± 0.353
0.0IleXaa: 0.0 ± 0.0
Lys
3.845LysAla: 3.845 ± 0.569
0.35LysCys: 0.35 ± 0.155
1.748LysAsp: 1.748 ± 0.456
0.979LysGlu: 0.979 ± 0.249
0.629LysPhe: 0.629 ± 0.19
1.957LysGly: 1.957 ± 0.414
0.629LysHis: 0.629 ± 0.248
1.188LysIle: 1.188 ± 0.268
1.678LysLys: 1.678 ± 0.336
2.586LysLeu: 2.586 ± 0.416
0.559LysMet: 0.559 ± 0.218
0.699LysAsn: 0.699 ± 0.182
2.027LysPro: 2.027 ± 0.355
1.188LysGln: 1.188 ± 0.337
2.377LysArg: 2.377 ± 0.489
2.027LysSer: 2.027 ± 0.412
1.748LysThr: 1.748 ± 0.337
2.586LysVal: 2.586 ± 0.436
0.629LysTrp: 0.629 ± 0.216
0.559LysTyr: 0.559 ± 0.156
0.0LysXaa: 0.0 ± 0.0
Leu
10.555LeuAla: 10.555 ± 1.081
0.629LeuCys: 0.629 ± 0.236
4.823LeuAsp: 4.823 ± 0.692
4.544LeuGlu: 4.544 ± 0.646
2.447LeuPhe: 2.447 ± 0.545
6.151LeuGly: 6.151 ± 0.829
1.328LeuHis: 1.328 ± 0.242
3.635LeuIle: 3.635 ± 0.527
1.748LeuLys: 1.748 ± 0.294
6.361LeuLeu: 6.361 ± 0.711
1.748LeuMet: 1.748 ± 0.318
2.237LeuAsn: 2.237 ± 0.365
4.613LeuPro: 4.613 ± 0.514
2.586LeuGln: 2.586 ± 0.395
6.221LeuArg: 6.221 ± 0.652
4.753LeuSer: 4.753 ± 0.621
6.151LeuThr: 6.151 ± 0.86
7.2LeuVal: 7.2 ± 0.717
1.398LeuTrp: 1.398 ± 0.338
1.538LeuTyr: 1.538 ± 0.28
0.0LeuXaa: 0.0 ± 0.0
Met
2.796MetAla: 2.796 ± 0.565
0.14MetCys: 0.14 ± 0.106
0.629MetAsp: 0.629 ± 0.227
0.559MetGlu: 0.559 ± 0.194
0.35MetPhe: 0.35 ± 0.138
1.328MetGly: 1.328 ± 0.382
0.14MetHis: 0.14 ± 0.095
0.699MetIle: 0.699 ± 0.205
0.559MetLys: 0.559 ± 0.157
1.748MetLeu: 1.748 ± 0.407
0.629MetMet: 0.629 ± 0.177
0.419MetAsn: 0.419 ± 0.146
1.748MetPro: 1.748 ± 0.316
0.489MetGln: 0.489 ± 0.22
1.538MetArg: 1.538 ± 0.343
2.377MetSer: 2.377 ± 0.464
2.516MetThr: 2.516 ± 0.395
1.608MetVal: 1.608 ± 0.301
0.559MetTrp: 0.559 ± 0.199
0.21MetTyr: 0.21 ± 0.119
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 0.764
0.489AsnCys: 0.489 ± 0.207
1.608AsnAsp: 1.608 ± 0.339
0.909AsnGlu: 0.909 ± 0.233
0.629AsnPhe: 0.629 ± 0.212
2.796AsnGly: 2.796 ± 0.457
0.629AsnHis: 0.629 ± 0.186
1.188AsnIle: 1.188 ± 0.382
0.699AsnLys: 0.699 ± 0.225
1.957AsnLeu: 1.957 ± 0.359
0.35AsnMet: 0.35 ± 0.132
1.049AsnAsn: 1.049 ± 0.285
2.516AsnPro: 2.516 ± 0.373
0.909AsnGln: 0.909 ± 0.238
1.678AsnArg: 1.678 ± 0.371
1.258AsnSer: 1.258 ± 0.271
1.608AsnThr: 1.608 ± 0.281
2.027AsnVal: 2.027 ± 0.526
0.699AsnTrp: 0.699 ± 0.178
0.419AsnTyr: 0.419 ± 0.152
0.0AsnXaa: 0.0 ± 0.0
Pro
7.689ProAla: 7.689 ± 0.739
0.839ProCys: 0.839 ± 0.275
5.173ProAsp: 5.173 ± 0.552
3.914ProGlu: 3.914 ± 0.727
1.468ProPhe: 1.468 ± 0.316
5.312ProGly: 5.312 ± 0.703
0.839ProHis: 0.839 ± 0.292
2.516ProIle: 2.516 ± 0.42
1.608ProLys: 1.608 ± 0.302
3.425ProLeu: 3.425 ± 0.507
1.049ProMet: 1.049 ± 0.207
1.538ProAsn: 1.538 ± 0.259
3.495ProPro: 3.495 ± 0.848
2.027ProGln: 2.027 ± 0.425
4.613ProArg: 4.613 ± 0.704
2.936ProSer: 2.936 ± 0.645
4.334ProThr: 4.334 ± 0.58
4.823ProVal: 4.823 ± 0.538
1.398ProTrp: 1.398 ± 0.307
0.979ProTyr: 0.979 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
4.124GlnAla: 4.124 ± 0.539
0.21GlnCys: 0.21 ± 0.115
1.468GlnAsp: 1.468 ± 0.308
1.957GlnGlu: 1.957 ± 0.337
1.188GlnPhe: 1.188 ± 0.31
2.936GlnGly: 2.936 ± 0.578
0.769GlnHis: 0.769 ± 0.262
2.027GlnIle: 2.027 ± 0.561
0.629GlnLys: 0.629 ± 0.194
3.355GlnLeu: 3.355 ± 0.526
1.188GlnMet: 1.188 ± 0.231
0.839GlnAsn: 0.839 ± 0.177
1.748GlnPro: 1.748 ± 0.367
1.258GlnGln: 1.258 ± 0.321
3.355GlnArg: 3.355 ± 0.535
1.328GlnSer: 1.328 ± 0.287
1.608GlnThr: 1.608 ± 0.284
2.307GlnVal: 2.307 ± 0.428
1.328GlnTrp: 1.328 ± 0.347
0.699GlnTyr: 0.699 ± 0.192
0.0GlnXaa: 0.0 ± 0.0
Arg
9.437ArgAla: 9.437 ± 1.0
1.049ArgCys: 1.049 ± 0.281
5.243ArgAsp: 5.243 ± 0.647
3.215ArgGlu: 3.215 ± 0.444
1.608ArgPhe: 1.608 ± 0.334
5.872ArgGly: 5.872 ± 0.63
1.817ArgHis: 1.817 ± 0.368
3.775ArgIle: 3.775 ± 0.525
2.447ArgLys: 2.447 ± 0.424
6.151ArgLeu: 6.151 ± 0.541
2.097ArgMet: 2.097 ± 0.444
2.796ArgAsn: 2.796 ± 0.664
3.495ArgPro: 3.495 ± 0.742
3.006ArgGln: 3.006 ± 0.469
7.969ArgArg: 7.969 ± 1.096
3.775ArgSer: 3.775 ± 0.585
4.264ArgThr: 4.264 ± 0.602
5.103ArgVal: 5.103 ± 0.539
1.608ArgTrp: 1.608 ± 0.35
1.118ArgTyr: 1.118 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
8.039SerAla: 8.039 ± 0.731
0.28SerCys: 0.28 ± 0.136
3.076SerAsp: 3.076 ± 0.464
2.447SerGlu: 2.447 ± 0.4
1.468SerPhe: 1.468 ± 0.38
5.802SerGly: 5.802 ± 0.784
0.909SerHis: 0.909 ± 0.27
3.565SerIle: 3.565 ± 0.618
1.049SerLys: 1.049 ± 0.229
3.146SerLeu: 3.146 ± 0.465
1.118SerMet: 1.118 ± 0.34
1.258SerAsn: 1.258 ± 0.259
3.845SerPro: 3.845 ± 0.494
1.887SerGln: 1.887 ± 0.325
3.635SerArg: 3.635 ± 0.483
3.705SerSer: 3.705 ± 0.513
3.495SerThr: 3.495 ± 0.445
4.264SerVal: 4.264 ± 0.452
0.629SerTrp: 0.629 ± 0.224
0.979SerTyr: 0.979 ± 0.266
0.0SerXaa: 0.0 ± 0.0
Thr
7.619ThrAla: 7.619 ± 0.834
0.699ThrCys: 0.699 ± 0.215
4.404ThrAsp: 4.404 ± 0.552
3.355ThrGlu: 3.355 ± 0.541
2.237ThrPhe: 2.237 ± 0.365
5.522ThrGly: 5.522 ± 0.578
0.839ThrHis: 0.839 ± 0.233
3.006ThrIle: 3.006 ± 0.531
1.608ThrLys: 1.608 ± 0.278
5.452ThrLeu: 5.452 ± 0.604
1.118ThrMet: 1.118 ± 0.33
0.909ThrAsn: 0.909 ± 0.243
5.802ThrPro: 5.802 ± 0.536
1.887ThrGln: 1.887 ± 0.394
3.775ThrArg: 3.775 ± 0.649
3.425ThrSer: 3.425 ± 0.586
3.775ThrThr: 3.775 ± 0.541
5.872ThrVal: 5.872 ± 0.635
1.538ThrTrp: 1.538 ± 0.307
1.608ThrTyr: 1.608 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
9.157ValAla: 9.157 ± 0.976
0.699ValCys: 0.699 ± 0.23
5.243ValAsp: 5.243 ± 0.686
3.425ValGlu: 3.425 ± 0.622
1.817ValPhe: 1.817 ± 0.401
5.522ValGly: 5.522 ± 0.557
0.909ValHis: 0.909 ± 0.291
3.775ValIle: 3.775 ± 0.652
2.656ValLys: 2.656 ± 0.444
5.802ValLeu: 5.802 ± 0.544
1.538ValMet: 1.538 ± 0.314
2.097ValAsn: 2.097 ± 0.292
5.103ValPro: 5.103 ± 0.599
2.656ValGln: 2.656 ± 0.344
5.452ValArg: 5.452 ± 0.721
3.984ValSer: 3.984 ± 0.595
5.662ValThr: 5.662 ± 0.837
5.662ValVal: 5.662 ± 0.602
1.817ValTrp: 1.817 ± 0.341
1.748ValTyr: 1.748 ± 0.43
0.0ValXaa: 0.0 ± 0.0
Trp
1.538TrpAla: 1.538 ± 0.323
0.629TrpCys: 0.629 ± 0.213
1.887TrpAsp: 1.887 ± 0.429
0.979TrpGlu: 0.979 ± 0.279
0.839TrpPhe: 0.839 ± 0.247
0.839TrpGly: 0.839 ± 0.181
0.489TrpHis: 0.489 ± 0.195
1.398TrpIle: 1.398 ± 0.331
0.629TrpLys: 0.629 ± 0.215
1.957TrpLeu: 1.957 ± 0.35
0.769TrpMet: 0.769 ± 0.219
1.118TrpAsn: 1.118 ± 0.354
0.979TrpPro: 0.979 ± 0.284
0.699TrpGln: 0.699 ± 0.22
1.957TrpArg: 1.957 ± 0.311
1.118TrpSer: 1.118 ± 0.292
0.839TrpThr: 0.839 ± 0.234
0.979TrpVal: 0.979 ± 0.25
0.419TrpTrp: 0.419 ± 0.194
0.419TrpTyr: 0.419 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.377TyrAla: 2.377 ± 0.363
0.14TyrCys: 0.14 ± 0.104
1.188TyrAsp: 1.188 ± 0.277
1.817TyrGlu: 1.817 ± 0.37
0.909TyrPhe: 0.909 ± 0.288
1.678TyrGly: 1.678 ± 0.377
0.419TyrHis: 0.419 ± 0.163
0.839TyrIle: 0.839 ± 0.29
0.419TyrLys: 0.419 ± 0.18
1.957TyrLeu: 1.957 ± 0.415
0.35TyrMet: 0.35 ± 0.134
0.489TyrAsn: 0.489 ± 0.177
1.398TyrPro: 1.398 ± 0.364
0.629TyrGln: 0.629 ± 0.186
1.748TyrArg: 1.748 ± 0.39
1.328TyrSer: 1.328 ± 0.351
1.817TyrThr: 1.817 ± 0.348
1.608TyrVal: 1.608 ± 0.333
0.489TyrTrp: 0.489 ± 0.186
0.559TyrTyr: 0.559 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (14307 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski