Amino acid dipepetide frequency for Streptococcus phage phiARI0468-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.947AlaAla: 3.947 ± 1.066
0.439AlaCys: 0.439 ± 0.194
5.526AlaAsp: 5.526 ± 1.4
6.14AlaGlu: 6.14 ± 0.994
3.158AlaPhe: 3.158 ± 0.496
3.947AlaGly: 3.947 ± 0.589
0.526AlaHis: 0.526 ± 0.223
5.175AlaIle: 5.175 ± 0.763
4.825AlaLys: 4.825 ± 0.584
6.667AlaLeu: 6.667 ± 1.005
2.018AlaMet: 2.018 ± 0.549
3.596AlaAsn: 3.596 ± 0.496
1.491AlaPro: 1.491 ± 0.331
2.807AlaGln: 2.807 ± 0.665
2.982AlaArg: 2.982 ± 0.526
4.298AlaSer: 4.298 ± 0.639
4.474AlaThr: 4.474 ± 0.83
4.649AlaVal: 4.649 ± 0.638
0.614AlaTrp: 0.614 ± 0.22
2.807AlaTyr: 2.807 ± 0.645
0.0AlaXaa: 0.0 ± 0.0
Cys
0.263CysAla: 0.263 ± 0.159
0.088CysCys: 0.088 ± 0.089
0.175CysAsp: 0.175 ± 0.146
0.789CysGlu: 0.789 ± 0.24
0.526CysPhe: 0.526 ± 0.241
0.351CysGly: 0.351 ± 0.156
0.088CysHis: 0.088 ± 0.091
0.263CysIle: 0.263 ± 0.138
0.439CysLys: 0.439 ± 0.17
0.175CysLeu: 0.175 ± 0.118
0.088CysMet: 0.088 ± 0.084
0.0CysAsn: 0.0 ± 0.0
0.263CysPro: 0.263 ± 0.166
0.175CysGln: 0.175 ± 0.136
0.0CysArg: 0.0 ± 0.0
0.351CysSer: 0.351 ± 0.181
0.0CysThr: 0.0 ± 0.0
0.439CysVal: 0.439 ± 0.218
0.175CysTrp: 0.175 ± 0.115
0.175CysTyr: 0.175 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
3.509AspAla: 3.509 ± 0.561
0.263AspCys: 0.263 ± 0.146
3.86AspAsp: 3.86 ± 0.704
3.07AspGlu: 3.07 ± 0.572
4.035AspPhe: 4.035 ± 0.532
6.93AspGly: 6.93 ± 1.157
0.614AspHis: 0.614 ± 0.222
4.298AspIle: 4.298 ± 0.587
6.491AspLys: 6.491 ± 0.719
4.123AspLeu: 4.123 ± 0.669
1.754AspMet: 1.754 ± 0.335
4.035AspAsn: 4.035 ± 0.551
1.491AspPro: 1.491 ± 0.384
1.491AspGln: 1.491 ± 0.315
2.456AspArg: 2.456 ± 0.353
3.421AspSer: 3.421 ± 0.739
3.158AspThr: 3.158 ± 0.356
3.684AspVal: 3.684 ± 0.489
0.439AspTrp: 0.439 ± 0.191
4.035AspTyr: 4.035 ± 0.646
0.0AspXaa: 0.0 ± 0.0
Glu
4.211GluAla: 4.211 ± 0.686
0.088GluCys: 0.088 ± 0.089
4.211GluAsp: 4.211 ± 0.843
6.316GluGlu: 6.316 ± 0.95
2.544GluPhe: 2.544 ± 0.477
2.368GluGly: 2.368 ± 0.373
1.14GluHis: 1.14 ± 0.311
5.965GluIle: 5.965 ± 0.88
6.316GluLys: 6.316 ± 0.797
7.895GluLeu: 7.895 ± 1.045
2.281GluMet: 2.281 ± 0.495
4.737GluAsn: 4.737 ± 0.568
1.667GluPro: 1.667 ± 0.41
4.649GluGln: 4.649 ± 0.529
3.421GluArg: 3.421 ± 0.628
3.246GluSer: 3.246 ± 0.747
3.86GluThr: 3.86 ± 0.447
4.825GluVal: 4.825 ± 0.643
0.702GluTrp: 0.702 ± 0.217
3.07GluTyr: 3.07 ± 0.573
0.0GluXaa: 0.0 ± 0.0
Phe
2.368PheAla: 2.368 ± 0.386
0.175PheCys: 0.175 ± 0.109
4.123PheAsp: 4.123 ± 0.5
4.474PheGlu: 4.474 ± 0.676
1.316PhePhe: 1.316 ± 0.442
2.105PheGly: 2.105 ± 0.388
0.614PheHis: 0.614 ± 0.246
2.368PheIle: 2.368 ± 0.374
2.105PheLys: 2.105 ± 0.402
2.105PheLeu: 2.105 ± 0.444
0.877PheMet: 0.877 ± 0.312
3.246PheAsn: 3.246 ± 0.426
0.789PhePro: 0.789 ± 0.272
1.93PheGln: 1.93 ± 0.485
1.667PheArg: 1.667 ± 0.342
3.333PheSer: 3.333 ± 0.592
3.246PheThr: 3.246 ± 0.628
2.368PheVal: 2.368 ± 0.497
0.175PheTrp: 0.175 ± 0.116
1.842PheTyr: 1.842 ± 0.446
0.0PheXaa: 0.0 ± 0.0
Gly
5.175GlyAla: 5.175 ± 0.969
0.351GlyCys: 0.351 ± 0.162
3.158GlyAsp: 3.158 ± 0.498
4.035GlyGlu: 4.035 ± 0.553
3.421GlyPhe: 3.421 ± 0.599
5.0GlyGly: 5.0 ± 0.607
1.316GlyHis: 1.316 ± 0.304
4.561GlyIle: 4.561 ± 0.48
5.175GlyLys: 5.175 ± 0.665
4.912GlyLeu: 4.912 ± 0.838
2.193GlyMet: 2.193 ± 0.405
3.509GlyAsn: 3.509 ± 0.693
0.351GlyPro: 0.351 ± 0.175
2.456GlyGln: 2.456 ± 0.496
3.947GlyArg: 3.947 ± 0.918
5.263GlySer: 5.263 ± 0.976
4.737GlyThr: 4.737 ± 0.848
5.351GlyVal: 5.351 ± 0.649
0.526GlyTrp: 0.526 ± 0.265
2.105GlyTyr: 2.105 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
0.789HisAla: 0.789 ± 0.202
0.175HisCys: 0.175 ± 0.13
0.877HisAsp: 0.877 ± 0.257
0.965HisGlu: 0.965 ± 0.274
0.702HisPhe: 0.702 ± 0.192
0.614HisGly: 0.614 ± 0.278
0.439HisHis: 0.439 ± 0.181
1.316HisIle: 1.316 ± 0.343
1.228HisLys: 1.228 ± 0.37
1.14HisLeu: 1.14 ± 0.293
0.439HisMet: 0.439 ± 0.18
0.526HisAsn: 0.526 ± 0.223
0.702HisPro: 0.702 ± 0.253
0.526HisGln: 0.526 ± 0.173
0.526HisArg: 0.526 ± 0.2
1.491HisSer: 1.491 ± 0.269
1.053HisThr: 1.053 ± 0.234
0.439HisVal: 0.439 ± 0.191
0.175HisTrp: 0.175 ± 0.112
0.877HisTyr: 0.877 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
4.912IleAla: 4.912 ± 0.656
0.263IleCys: 0.263 ± 0.138
4.649IleAsp: 4.649 ± 0.581
7.018IleGlu: 7.018 ± 1.046
2.281IlePhe: 2.281 ± 0.491
5.175IleGly: 5.175 ± 0.798
0.965IleHis: 0.965 ± 0.239
4.211IleIle: 4.211 ± 0.938
7.544IleLys: 7.544 ± 0.95
3.158IleLeu: 3.158 ± 0.592
1.316IleMet: 1.316 ± 0.291
4.035IleAsn: 4.035 ± 0.399
2.632IlePro: 2.632 ± 0.51
2.895IleGln: 2.895 ± 0.466
1.754IleArg: 1.754 ± 0.408
4.561IleSer: 4.561 ± 0.496
3.509IleThr: 3.509 ± 0.501
4.123IleVal: 4.123 ± 0.571
0.614IleTrp: 0.614 ± 0.207
2.105IleTyr: 2.105 ± 0.514
0.0IleXaa: 0.0 ± 0.0
Lys
6.579LysAla: 6.579 ± 0.919
0.614LysCys: 0.614 ± 0.19
3.421LysAsp: 3.421 ± 0.541
5.789LysGlu: 5.789 ± 0.882
2.105LysPhe: 2.105 ± 0.324
6.491LysGly: 6.491 ± 1.114
1.228LysHis: 1.228 ± 0.382
4.912LysIle: 4.912 ± 0.828
6.93LysLys: 6.93 ± 1.042
6.491LysLeu: 6.491 ± 0.794
1.579LysMet: 1.579 ± 0.33
5.789LysAsn: 5.789 ± 0.704
2.368LysPro: 2.368 ± 0.43
3.596LysGln: 3.596 ± 0.65
2.895LysArg: 2.895 ± 0.45
5.526LysSer: 5.526 ± 0.608
5.614LysThr: 5.614 ± 0.636
5.965LysVal: 5.965 ± 0.537
1.491LysTrp: 1.491 ± 0.256
3.684LysTyr: 3.684 ± 0.517
0.0LysXaa: 0.0 ± 0.0
Leu
5.439LeuAla: 5.439 ± 0.733
0.439LeuCys: 0.439 ± 0.193
5.175LeuAsp: 5.175 ± 0.631
6.404LeuGlu: 6.404 ± 0.948
2.807LeuPhe: 2.807 ± 0.591
4.298LeuGly: 4.298 ± 0.639
1.228LeuHis: 1.228 ± 0.304
2.982LeuIle: 2.982 ± 0.629
7.368LeuLys: 7.368 ± 0.987
4.825LeuLeu: 4.825 ± 0.779
1.316LeuMet: 1.316 ± 0.3
5.439LeuAsn: 5.439 ± 0.764
3.07LeuPro: 3.07 ± 0.515
4.123LeuGln: 4.123 ± 0.699
2.456LeuArg: 2.456 ± 0.568
4.737LeuSer: 4.737 ± 0.57
4.912LeuThr: 4.912 ± 0.652
4.386LeuVal: 4.386 ± 0.759
0.702LeuTrp: 0.702 ± 0.247
2.193LeuTyr: 2.193 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
2.544MetAla: 2.544 ± 0.497
0.175MetCys: 0.175 ± 0.139
1.14MetAsp: 1.14 ± 0.268
1.579MetGlu: 1.579 ± 0.41
0.702MetPhe: 0.702 ± 0.211
0.877MetGly: 0.877 ± 0.266
0.175MetHis: 0.175 ± 0.118
1.93MetIle: 1.93 ± 0.44
2.018MetLys: 2.018 ± 0.27
1.053MetLeu: 1.053 ± 0.21
0.526MetMet: 0.526 ± 0.182
1.491MetAsn: 1.491 ± 0.371
0.789MetPro: 0.789 ± 0.243
1.228MetGln: 1.228 ± 0.331
1.667MetArg: 1.667 ± 0.491
0.965MetSer: 0.965 ± 0.274
2.544MetThr: 2.544 ± 0.338
1.053MetVal: 1.053 ± 0.374
0.263MetTrp: 0.263 ± 0.137
0.614MetTyr: 0.614 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
4.649AsnAla: 4.649 ± 0.573
0.351AsnCys: 0.351 ± 0.219
3.772AsnAsp: 3.772 ± 0.534
4.211AsnGlu: 4.211 ± 0.532
2.456AsnPhe: 2.456 ± 0.565
5.0AsnGly: 5.0 ± 0.898
0.965AsnHis: 0.965 ± 0.243
3.684AsnIle: 3.684 ± 0.599
5.263AsnLys: 5.263 ± 0.605
4.912AsnLeu: 4.912 ± 0.809
1.579AsnMet: 1.579 ± 0.376
4.211AsnAsn: 4.211 ± 0.899
2.632AsnPro: 2.632 ± 0.614
2.544AsnGln: 2.544 ± 0.454
1.754AsnArg: 1.754 ± 0.438
3.772AsnSer: 3.772 ± 0.559
3.158AsnThr: 3.158 ± 0.388
2.719AsnVal: 2.719 ± 0.528
0.789AsnTrp: 0.789 ± 0.238
2.193AsnTyr: 2.193 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
2.018ProAla: 2.018 ± 0.361
0.263ProCys: 0.263 ± 0.143
1.93ProAsp: 1.93 ± 0.476
1.93ProGlu: 1.93 ± 0.409
1.579ProPhe: 1.579 ± 0.428
1.228ProGly: 1.228 ± 0.456
0.351ProHis: 0.351 ± 0.199
2.105ProIle: 2.105 ± 0.425
2.193ProLys: 2.193 ± 0.557
1.754ProLeu: 1.754 ± 0.455
0.263ProMet: 0.263 ± 0.201
1.93ProAsn: 1.93 ± 0.496
0.526ProPro: 0.526 ± 0.234
1.14ProGln: 1.14 ± 0.396
0.965ProArg: 0.965 ± 0.241
1.842ProSer: 1.842 ± 0.313
2.193ProThr: 2.193 ± 0.417
1.93ProVal: 1.93 ± 0.361
0.351ProTrp: 0.351 ± 0.194
1.754ProTyr: 1.754 ± 0.454
0.0ProXaa: 0.0 ± 0.0
Gln
3.509GlnAla: 3.509 ± 0.53
0.088GlnCys: 0.088 ± 0.095
1.404GlnAsp: 1.404 ± 0.376
3.509GlnGlu: 3.509 ± 0.586
1.491GlnPhe: 1.491 ± 0.41
4.298GlnGly: 4.298 ± 1.171
0.614GlnHis: 0.614 ± 0.207
3.07GlnIle: 3.07 ± 0.391
4.737GlnLys: 4.737 ± 0.794
2.982GlnLeu: 2.982 ± 0.439
0.965GlnMet: 0.965 ± 0.293
1.93GlnAsn: 1.93 ± 0.549
0.877GlnPro: 0.877 ± 0.274
2.544GlnGln: 2.544 ± 0.748
2.281GlnArg: 2.281 ± 0.439
2.456GlnSer: 2.456 ± 0.507
2.193GlnThr: 2.193 ± 0.445
2.368GlnVal: 2.368 ± 0.584
0.526GlnTrp: 0.526 ± 0.161
2.193GlnTyr: 2.193 ± 0.343
0.0GlnXaa: 0.0 ± 0.0
Arg
2.982ArgAla: 2.982 ± 0.497
0.088ArgCys: 0.088 ± 0.107
2.105ArgAsp: 2.105 ± 0.369
2.544ArgGlu: 2.544 ± 0.6
1.842ArgPhe: 1.842 ± 0.321
2.895ArgGly: 2.895 ± 0.647
0.614ArgHis: 0.614 ± 0.236
2.193ArgIle: 2.193 ± 0.395
2.368ArgLys: 2.368 ± 0.406
3.772ArgLeu: 3.772 ± 0.612
0.965ArgMet: 0.965 ± 0.257
2.544ArgAsn: 2.544 ± 0.447
1.754ArgPro: 1.754 ± 0.537
1.842ArgGln: 1.842 ± 0.392
1.754ArgArg: 1.754 ± 0.315
1.754ArgSer: 1.754 ± 0.326
2.544ArgThr: 2.544 ± 0.585
2.456ArgVal: 2.456 ± 0.434
0.965ArgTrp: 0.965 ± 0.421
2.018ArgTyr: 2.018 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
4.474SerAla: 4.474 ± 0.737
0.175SerCys: 0.175 ± 0.124
4.298SerAsp: 4.298 ± 0.671
3.947SerGlu: 3.947 ± 0.633
2.456SerPhe: 2.456 ± 0.478
4.035SerGly: 4.035 ± 0.96
1.228SerHis: 1.228 ± 0.345
3.333SerIle: 3.333 ± 0.469
3.947SerLys: 3.947 ± 0.614
5.0SerLeu: 5.0 ± 0.421
1.93SerMet: 1.93 ± 0.311
3.596SerAsn: 3.596 ± 0.521
1.754SerPro: 1.754 ± 0.365
2.895SerGln: 2.895 ± 0.467
2.018SerArg: 2.018 ± 0.368
4.649SerSer: 4.649 ± 0.852
4.912SerThr: 4.912 ± 1.021
4.737SerVal: 4.737 ± 0.77
0.877SerTrp: 0.877 ± 0.214
2.018SerTyr: 2.018 ± 0.338
0.0SerXaa: 0.0 ± 0.0
Thr
3.509ThrAla: 3.509 ± 0.962
0.439ThrCys: 0.439 ± 0.183
5.351ThrAsp: 5.351 ± 1.119
4.211ThrGlu: 4.211 ± 0.682
2.982ThrPhe: 2.982 ± 0.394
4.737ThrGly: 4.737 ± 0.503
0.614ThrHis: 0.614 ± 0.359
6.053ThrIle: 6.053 ± 0.593
5.0ThrLys: 5.0 ± 0.633
5.702ThrLeu: 5.702 ± 0.862
1.053ThrMet: 1.053 ± 0.271
3.86ThrAsn: 3.86 ± 0.496
2.719ThrPro: 2.719 ± 0.492
2.544ThrGln: 2.544 ± 0.522
2.632ThrArg: 2.632 ± 0.453
2.632ThrSer: 2.632 ± 0.419
4.474ThrThr: 4.474 ± 0.721
4.386ThrVal: 4.386 ± 0.614
1.053ThrTrp: 1.053 ± 0.276
2.193ThrTyr: 2.193 ± 0.475
0.0ThrXaa: 0.0 ± 0.0
Val
5.439ValAla: 5.439 ± 0.573
0.088ValCys: 0.088 ± 0.086
4.737ValAsp: 4.737 ± 0.65
3.509ValGlu: 3.509 ± 0.566
2.368ValPhe: 2.368 ± 0.411
3.684ValGly: 3.684 ± 0.584
1.14ValHis: 1.14 ± 0.353
5.351ValIle: 5.351 ± 0.673
5.0ValLys: 5.0 ± 0.66
3.246ValLeu: 3.246 ± 0.499
1.053ValMet: 1.053 ± 0.268
3.246ValAsn: 3.246 ± 0.528
0.877ValPro: 0.877 ± 0.334
2.456ValGln: 2.456 ± 0.518
2.193ValArg: 2.193 ± 0.422
4.386ValSer: 4.386 ± 0.556
5.526ValThr: 5.526 ± 0.564
3.772ValVal: 3.772 ± 0.701
0.877ValTrp: 0.877 ± 0.382
2.895ValTyr: 2.895 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
1.93TrpAla: 1.93 ± 0.474
0.0TrpCys: 0.0 ± 0.0
0.526TrpAsp: 0.526 ± 0.21
0.614TrpGlu: 0.614 ± 0.191
0.526TrpPhe: 0.526 ± 0.215
0.351TrpGly: 0.351 ± 0.159
0.351TrpHis: 0.351 ± 0.18
0.789TrpIle: 0.789 ± 0.275
0.965TrpLys: 0.965 ± 0.242
0.965TrpLeu: 0.965 ± 0.304
0.263TrpMet: 0.263 ± 0.145
0.614TrpAsn: 0.614 ± 0.189
0.175TrpPro: 0.175 ± 0.153
0.526TrpGln: 0.526 ± 0.204
0.439TrpArg: 0.439 ± 0.203
0.526TrpSer: 0.526 ± 0.149
0.965TrpThr: 0.965 ± 0.361
0.526TrpVal: 0.526 ± 0.228
0.263TrpTrp: 0.263 ± 0.137
0.702TrpTyr: 0.702 ± 0.281
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.842TyrAla: 1.842 ± 0.405
0.263TyrCys: 0.263 ± 0.216
2.719TyrAsp: 2.719 ± 0.534
2.456TyrGlu: 2.456 ± 0.572
2.018TyrPhe: 2.018 ± 0.416
3.158TyrGly: 3.158 ± 0.464
0.789TyrHis: 0.789 ± 0.412
3.158TyrIle: 3.158 ± 0.655
3.158TyrLys: 3.158 ± 0.582
3.421TyrLeu: 3.421 ± 0.583
0.789TyrMet: 0.789 ± 0.312
2.281TyrAsn: 2.281 ± 0.418
1.316TyrPro: 1.316 ± 0.292
1.667TyrGln: 1.667 ± 0.36
2.193TyrArg: 2.193 ± 0.488
3.07TyrSer: 3.07 ± 0.433
3.07TyrThr: 3.07 ± 0.724
1.667TyrVal: 1.667 ± 0.361
0.439TyrTrp: 0.439 ± 0.233
1.579TyrTyr: 1.579 ± 0.533
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11401 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski