Amino acid dipepetide frequency for Acyrthosiphon pisum secondary endosymbiont phage 1 (Bacteriophage APSE-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.346AlaAla: 7.346 ± 1.373
0.958AlaCys: 0.958 ± 0.342
5.829AlaAsp: 5.829 ± 0.989
6.627AlaGlu: 6.627 ± 0.954
3.034AlaPhe: 3.034 ± 0.5
5.509AlaGly: 5.509 ± 0.669
2.076AlaHis: 2.076 ± 0.56
4.711AlaIle: 4.711 ± 0.82
6.468AlaLys: 6.468 ± 0.841
11.019AlaLeu: 11.019 ± 0.949
2.316AlaMet: 2.316 ± 0.46
4.232AlaAsn: 4.232 ± 0.74
3.114AlaPro: 3.114 ± 0.537
4.072AlaGln: 4.072 ± 0.622
3.992AlaArg: 3.992 ± 0.557
5.749AlaSer: 5.749 ± 0.763
4.152AlaThr: 4.152 ± 0.511
3.912AlaVal: 3.912 ± 0.586
1.517AlaTrp: 1.517 ± 0.401
2.874AlaTyr: 2.874 ± 0.601
0.0AlaXaa: 0.0 ± 0.0
Cys
0.878CysAla: 0.878 ± 0.319
0.479CysCys: 0.479 ± 0.174
0.798CysAsp: 0.798 ± 0.333
0.08CysGlu: 0.08 ± 0.071
0.878CysPhe: 0.878 ± 0.34
0.958CysGly: 0.958 ± 0.287
0.319CysHis: 0.319 ± 0.185
0.798CysIle: 0.798 ± 0.291
0.878CysLys: 0.878 ± 0.266
1.836CysLeu: 1.836 ± 0.364
0.319CysMet: 0.319 ± 0.164
0.399CysAsn: 0.399 ± 0.17
0.319CysPro: 0.319 ± 0.162
0.559CysGln: 0.559 ± 0.26
1.038CysArg: 1.038 ± 0.311
0.719CysSer: 0.719 ± 0.244
0.559CysThr: 0.559 ± 0.21
0.479CysVal: 0.479 ± 0.192
0.08CysTrp: 0.08 ± 0.079
0.319CysTyr: 0.319 ± 0.158
0.0CysXaa: 0.0 ± 0.0
Asp
4.791AspAla: 4.791 ± 0.715
0.559AspCys: 0.559 ± 0.217
3.114AspAsp: 3.114 ± 0.48
3.433AspGlu: 3.433 ± 0.559
1.836AspPhe: 1.836 ± 0.529
4.072AspGly: 4.072 ± 0.605
0.719AspHis: 0.719 ± 0.225
4.072AspIle: 4.072 ± 0.532
4.072AspLys: 4.072 ± 0.661
5.19AspLeu: 5.19 ± 0.789
1.437AspMet: 1.437 ± 0.416
3.274AspAsn: 3.274 ± 0.487
2.555AspPro: 2.555 ± 0.407
1.198AspGln: 1.198 ± 0.308
2.156AspArg: 2.156 ± 0.378
2.874AspSer: 2.874 ± 0.531
2.954AspThr: 2.954 ± 0.518
3.593AspVal: 3.593 ± 0.72
0.878AspTrp: 0.878 ± 0.219
1.757AspTyr: 1.757 ± 0.375
0.0AspXaa: 0.0 ± 0.0
Glu
5.669GluAla: 5.669 ± 0.823
0.319GluCys: 0.319 ± 0.172
2.236GluAsp: 2.236 ± 0.527
3.194GluGlu: 3.194 ± 0.694
1.996GluPhe: 1.996 ± 0.476
2.795GluGly: 2.795 ± 0.488
1.118GluHis: 1.118 ± 0.27
3.833GluIle: 3.833 ± 0.612
4.631GluLys: 4.631 ± 0.619
5.03GluLeu: 5.03 ± 0.577
1.517GluMet: 1.517 ± 0.39
3.354GluAsn: 3.354 ± 0.491
2.076GluPro: 2.076 ± 0.573
3.114GluGln: 3.114 ± 0.464
3.354GluArg: 3.354 ± 0.556
2.635GluSer: 2.635 ± 0.51
3.354GluThr: 3.354 ± 0.569
3.114GluVal: 3.114 ± 0.556
1.038GluTrp: 1.038 ± 0.317
1.517GluTyr: 1.517 ± 0.31
0.0GluXaa: 0.0 ± 0.0
Phe
3.274PheAla: 3.274 ± 0.544
1.198PheCys: 1.198 ± 0.321
2.954PheAsp: 2.954 ± 0.531
2.076PheGlu: 2.076 ± 0.395
1.517PhePhe: 1.517 ± 0.392
2.156PheGly: 2.156 ± 0.391
0.639PheHis: 0.639 ± 0.272
2.236PheIle: 2.236 ± 0.455
2.156PheLys: 2.156 ± 0.334
2.555PheLeu: 2.555 ± 0.416
1.038PheMet: 1.038 ± 0.323
2.076PheAsn: 2.076 ± 0.401
1.437PhePro: 1.437 ± 0.466
1.357PheGln: 1.357 ± 0.326
2.076PheArg: 2.076 ± 0.523
2.874PheSer: 2.874 ± 0.508
2.395PheThr: 2.395 ± 0.432
1.757PheVal: 1.757 ± 0.351
0.319PheTrp: 0.319 ± 0.16
1.198PheTyr: 1.198 ± 0.319
0.0PheXaa: 0.0 ± 0.0
Gly
4.551GlyAla: 4.551 ± 0.678
0.798GlyCys: 0.798 ± 0.23
4.312GlyAsp: 4.312 ± 0.666
3.833GlyGlu: 3.833 ± 0.658
3.114GlyPhe: 3.114 ± 0.421
4.312GlyGly: 4.312 ± 0.978
0.878GlyHis: 0.878 ± 0.241
4.631GlyIle: 4.631 ± 0.593
4.871GlyLys: 4.871 ± 0.407
4.711GlyLeu: 4.711 ± 0.676
1.677GlyMet: 1.677 ± 0.31
1.916GlyAsn: 1.916 ± 0.38
0.958GlyPro: 0.958 ± 0.301
3.433GlyGln: 3.433 ± 0.72
3.194GlyArg: 3.194 ± 0.515
4.152GlySer: 4.152 ± 0.719
3.354GlyThr: 3.354 ± 0.49
3.513GlyVal: 3.513 ± 0.656
0.878GlyTrp: 0.878 ± 0.266
2.076GlyTyr: 2.076 ± 0.419
0.0GlyXaa: 0.0 ± 0.0
His
1.757HisAla: 1.757 ± 0.292
0.08HisCys: 0.08 ± 0.087
1.357HisAsp: 1.357 ± 0.309
1.118HisGlu: 1.118 ± 0.328
0.639HisPhe: 0.639 ± 0.242
1.118HisGly: 1.118 ± 0.308
0.559HisHis: 0.559 ± 0.225
1.437HisIle: 1.437 ± 0.341
0.719HisLys: 0.719 ± 0.201
2.635HisLeu: 2.635 ± 0.525
0.639HisMet: 0.639 ± 0.241
1.118HisAsn: 1.118 ± 0.276
1.038HisPro: 1.038 ± 0.31
1.916HisGln: 1.916 ± 0.405
1.597HisArg: 1.597 ± 0.342
1.278HisSer: 1.278 ± 0.319
1.038HisThr: 1.038 ± 0.354
0.878HisVal: 0.878 ± 0.236
0.479HisTrp: 0.479 ± 0.201
0.479HisTyr: 0.479 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
5.509IleAla: 5.509 ± 0.796
0.878IleCys: 0.878 ± 0.286
3.354IleAsp: 3.354 ± 0.622
5.43IleGlu: 5.43 ± 0.619
1.916IlePhe: 1.916 ± 0.403
3.912IleGly: 3.912 ± 0.569
1.198IleHis: 1.198 ± 0.35
3.513IleIle: 3.513 ± 0.632
3.753IleLys: 3.753 ± 0.599
3.833IleLeu: 3.833 ± 0.388
1.198IleMet: 1.198 ± 0.283
4.072IleAsn: 4.072 ± 0.559
3.114IlePro: 3.114 ± 0.448
2.555IleGln: 2.555 ± 0.416
3.034IleArg: 3.034 ± 0.577
5.27IleSer: 5.27 ± 0.868
3.433IleThr: 3.433 ± 0.444
3.034IleVal: 3.034 ± 0.448
0.559IleTrp: 0.559 ± 0.184
2.156IleTyr: 2.156 ± 0.492
0.0IleXaa: 0.0 ± 0.0
Lys
7.027LysAla: 7.027 ± 0.743
0.559LysCys: 0.559 ± 0.189
3.354LysAsp: 3.354 ± 0.694
2.874LysGlu: 2.874 ± 0.441
1.357LysPhe: 1.357 ± 0.398
3.433LysGly: 3.433 ± 0.518
1.836LysHis: 1.836 ± 0.352
3.593LysIle: 3.593 ± 0.684
4.152LysLys: 4.152 ± 0.515
5.27LysLeu: 5.27 ± 0.614
2.475LysMet: 2.475 ± 0.535
2.874LysAsn: 2.874 ± 0.518
2.954LysPro: 2.954 ± 0.743
2.954LysGln: 2.954 ± 0.479
5.35LysArg: 5.35 ± 0.802
5.509LysSer: 5.509 ± 0.674
4.072LysThr: 4.072 ± 0.487
2.795LysVal: 2.795 ± 0.446
1.198LysTrp: 1.198 ± 0.301
2.236LysTyr: 2.236 ± 0.437
0.0LysXaa: 0.0 ± 0.0
Leu
8.544LeuAla: 8.544 ± 0.649
1.278LeuCys: 1.278 ± 0.347
3.833LeuAsp: 3.833 ± 0.452
4.072LeuGlu: 4.072 ± 0.59
3.513LeuPhe: 3.513 ± 0.625
5.43LeuGly: 5.43 ± 0.753
1.357LeuHis: 1.357 ± 0.312
5.749LeuIle: 5.749 ± 0.849
5.509LeuLys: 5.509 ± 0.735
7.426LeuLeu: 7.426 ± 0.841
2.475LeuMet: 2.475 ± 0.496
4.631LeuAsn: 4.631 ± 0.631
4.312LeuPro: 4.312 ± 0.892
3.513LeuGln: 3.513 ± 0.559
7.266LeuArg: 7.266 ± 0.955
7.346LeuSer: 7.346 ± 0.692
5.749LeuThr: 5.749 ± 0.608
4.871LeuVal: 4.871 ± 0.601
0.958LeuTrp: 0.958 ± 0.304
2.715LeuTyr: 2.715 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
2.395MetAla: 2.395 ± 0.519
0.399MetCys: 0.399 ± 0.162
0.878MetAsp: 0.878 ± 0.266
1.278MetGlu: 1.278 ± 0.333
0.479MetPhe: 0.479 ± 0.173
1.836MetGly: 1.836 ± 0.378
0.639MetHis: 0.639 ± 0.209
1.357MetIle: 1.357 ± 0.301
1.916MetLys: 1.916 ± 0.432
2.316MetLeu: 2.316 ± 0.449
0.878MetMet: 0.878 ± 0.318
0.878MetAsn: 0.878 ± 0.246
1.198MetPro: 1.198 ± 0.323
1.357MetGln: 1.357 ± 0.438
2.236MetArg: 2.236 ± 0.39
1.916MetSer: 1.916 ± 0.521
1.278MetThr: 1.278 ± 0.345
1.597MetVal: 1.597 ± 0.308
0.24MetTrp: 0.24 ± 0.16
0.559MetTyr: 0.559 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
4.791AsnAla: 4.791 ± 0.834
0.878AsnCys: 0.878 ± 0.341
1.916AsnAsp: 1.916 ± 0.447
1.757AsnGlu: 1.757 ± 0.31
1.198AsnPhe: 1.198 ± 0.34
2.954AsnGly: 2.954 ± 0.702
1.757AsnHis: 1.757 ± 0.363
3.673AsnIle: 3.673 ± 0.49
3.513AsnLys: 3.513 ± 0.55
3.833AsnLeu: 3.833 ± 0.638
1.038AsnMet: 1.038 ± 0.26
2.954AsnAsn: 2.954 ± 0.553
2.954AsnPro: 2.954 ± 0.422
2.475AsnGln: 2.475 ± 0.479
3.992AsnArg: 3.992 ± 0.555
3.114AsnSer: 3.114 ± 0.522
3.114AsnThr: 3.114 ± 0.396
2.156AsnVal: 2.156 ± 0.411
0.958AsnTrp: 0.958 ± 0.243
1.517AsnTyr: 1.517 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
3.274ProAla: 3.274 ± 0.502
0.479ProCys: 0.479 ± 0.222
2.715ProAsp: 2.715 ± 0.442
2.156ProGlu: 2.156 ± 0.491
1.517ProPhe: 1.517 ± 0.403
2.635ProGly: 2.635 ± 0.387
0.878ProHis: 0.878 ± 0.296
2.076ProIle: 2.076 ± 0.363
3.034ProLys: 3.034 ± 0.519
3.912ProLeu: 3.912 ± 0.519
0.798ProMet: 0.798 ± 0.287
2.395ProAsn: 2.395 ± 0.586
1.916ProPro: 1.916 ± 0.545
1.996ProGln: 1.996 ± 0.409
2.156ProArg: 2.156 ± 0.525
3.194ProSer: 3.194 ± 0.57
3.114ProThr: 3.114 ± 0.515
3.354ProVal: 3.354 ± 0.518
0.319ProTrp: 0.319 ± 0.16
1.278ProTyr: 1.278 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
3.992GlnAla: 3.992 ± 0.622
0.319GlnCys: 0.319 ± 0.143
2.156GlnAsp: 2.156 ± 0.411
2.236GlnGlu: 2.236 ± 0.602
1.916GlnPhe: 1.916 ± 0.367
2.395GlnGly: 2.395 ± 0.574
0.878GlnHis: 0.878 ± 0.249
3.753GlnIle: 3.753 ± 0.487
2.635GlnLys: 2.635 ± 0.488
3.912GlnLeu: 3.912 ± 0.428
1.038GlnMet: 1.038 ± 0.311
2.475GlnAsn: 2.475 ± 0.684
2.395GlnPro: 2.395 ± 0.414
3.194GlnGln: 3.194 ± 0.561
2.795GlnArg: 2.795 ± 0.49
3.274GlnSer: 3.274 ± 0.622
2.316GlnThr: 2.316 ± 0.346
2.316GlnVal: 2.316 ± 0.524
0.559GlnTrp: 0.559 ± 0.199
1.437GlnTyr: 1.437 ± 0.526
0.0GlnXaa: 0.0 ± 0.0
Arg
4.95ArgAla: 4.95 ± 0.645
0.878ArgCys: 0.878 ± 0.319
2.236ArgAsp: 2.236 ± 0.414
2.874ArgGlu: 2.874 ± 0.536
4.072ArgPhe: 4.072 ± 0.661
2.316ArgGly: 2.316 ± 0.468
1.517ArgHis: 1.517 ± 0.321
4.072ArgIle: 4.072 ± 0.501
4.232ArgLys: 4.232 ± 0.818
6.148ArgLeu: 6.148 ± 0.695
1.437ArgMet: 1.437 ± 0.298
3.593ArgAsn: 3.593 ± 0.531
2.316ArgPro: 2.316 ± 0.421
2.236ArgGln: 2.236 ± 0.431
4.392ArgArg: 4.392 ± 0.614
2.795ArgSer: 2.795 ± 0.469
3.433ArgThr: 3.433 ± 0.734
3.114ArgVal: 3.114 ± 0.418
1.597ArgTrp: 1.597 ± 0.384
2.236ArgTyr: 2.236 ± 0.543
0.0ArgXaa: 0.0 ± 0.0
Ser
6.947SerAla: 6.947 ± 1.354
0.958SerCys: 0.958 ± 0.328
3.593SerAsp: 3.593 ± 0.479
3.992SerGlu: 3.992 ± 0.477
2.236SerPhe: 2.236 ± 0.417
4.871SerGly: 4.871 ± 0.586
1.916SerHis: 1.916 ± 0.449
3.513SerIle: 3.513 ± 0.491
3.513SerLys: 3.513 ± 0.591
6.547SerLeu: 6.547 ± 0.958
2.156SerMet: 2.156 ± 0.414
3.593SerAsn: 3.593 ± 0.641
4.232SerPro: 4.232 ± 0.477
2.395SerGln: 2.395 ± 0.414
3.513SerArg: 3.513 ± 0.5
5.11SerSer: 5.11 ± 0.723
3.354SerThr: 3.354 ± 0.592
3.833SerVal: 3.833 ± 0.421
0.719SerTrp: 0.719 ± 0.228
1.597SerTyr: 1.597 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
5.27ThrAla: 5.27 ± 0.62
0.399ThrCys: 0.399 ± 0.178
3.354ThrAsp: 3.354 ± 0.678
3.114ThrGlu: 3.114 ± 0.433
2.076ThrPhe: 2.076 ± 0.499
4.631ThrGly: 4.631 ± 0.653
1.437ThrHis: 1.437 ± 0.373
2.635ThrIle: 2.635 ± 0.509
3.034ThrLys: 3.034 ± 0.503
5.669ThrLeu: 5.669 ± 0.722
1.278ThrMet: 1.278 ± 0.361
1.996ThrAsn: 1.996 ± 0.378
2.635ThrPro: 2.635 ± 0.393
3.513ThrGln: 3.513 ± 0.788
2.475ThrArg: 2.475 ± 0.478
3.593ThrSer: 3.593 ± 0.827
2.316ThrThr: 2.316 ± 0.627
3.833ThrVal: 3.833 ± 0.539
0.719ThrTrp: 0.719 ± 0.26
1.437ThrTyr: 1.437 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
4.871ValAla: 4.871 ± 0.613
0.798ValCys: 0.798 ± 0.318
3.433ValAsp: 3.433 ± 0.696
3.833ValGlu: 3.833 ± 0.473
1.677ValPhe: 1.677 ± 0.351
3.034ValGly: 3.034 ± 0.674
1.118ValHis: 1.118 ± 0.286
3.114ValIle: 3.114 ± 0.515
4.312ValLys: 4.312 ± 0.747
4.312ValLeu: 4.312 ± 0.49
1.038ValMet: 1.038 ± 0.285
2.635ValAsn: 2.635 ± 0.387
1.996ValPro: 1.996 ± 0.392
1.437ValGln: 1.437 ± 0.301
2.475ValArg: 2.475 ± 0.775
4.152ValSer: 4.152 ± 0.532
3.354ValThr: 3.354 ± 0.403
2.874ValVal: 2.874 ± 0.672
1.118ValTrp: 1.118 ± 0.307
2.076ValTyr: 2.076 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
1.198TrpAla: 1.198 ± 0.267
0.24TrpCys: 0.24 ± 0.163
0.878TrpAsp: 0.878 ± 0.281
0.399TrpGlu: 0.399 ± 0.147
0.958TrpPhe: 0.958 ± 0.25
0.798TrpGly: 0.798 ± 0.236
0.16TrpHis: 0.16 ± 0.104
0.559TrpIle: 0.559 ± 0.3
0.719TrpLys: 0.719 ± 0.257
2.076TrpLeu: 2.076 ± 0.475
0.319TrpMet: 0.319 ± 0.192
0.399TrpAsn: 0.399 ± 0.177
0.639TrpPro: 0.639 ± 0.213
0.639TrpGln: 0.639 ± 0.217
0.878TrpArg: 0.878 ± 0.295
1.198TrpSer: 1.198 ± 0.307
0.639TrpThr: 0.639 ± 0.171
1.198TrpVal: 1.198 ± 0.349
0.16TrpTrp: 0.16 ± 0.095
0.559TrpTyr: 0.559 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.475TyrAla: 2.475 ± 0.598
0.24TyrCys: 0.24 ± 0.135
2.236TyrAsp: 2.236 ± 0.548
1.597TyrGlu: 1.597 ± 0.332
1.357TyrPhe: 1.357 ± 0.341
1.996TyrGly: 1.996 ± 0.454
0.878TyrHis: 0.878 ± 0.281
2.395TyrIle: 2.395 ± 0.557
1.757TyrLys: 1.757 ± 0.446
2.316TyrLeu: 2.316 ± 0.494
0.479TyrMet: 0.479 ± 0.182
1.677TyrAsn: 1.677 ± 0.415
1.038TyrPro: 1.038 ± 0.292
2.076TyrGln: 2.076 ± 0.42
2.475TyrArg: 2.475 ± 0.542
1.836TyrSer: 1.836 ± 0.401
1.437TyrThr: 1.437 ± 0.307
1.437TyrVal: 1.437 ± 0.318
0.319TyrTrp: 0.319 ± 0.15
0.798TyrTyr: 0.798 ± 0.211
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12525 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski