Amino acid dipepetide frequency for Lactococcus phage proPhi1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.165AlaAla: 4.165 ± 0.62
0.472AlaCys: 0.472 ± 0.156
4.323AlaAsp: 4.323 ± 0.502
4.008AlaGlu: 4.008 ± 0.812
3.458AlaPhe: 3.458 ± 0.46
2.908AlaGly: 2.908 ± 0.547
0.786AlaHis: 0.786 ± 0.237
4.637AlaIle: 4.637 ± 0.794
4.715AlaLys: 4.715 ± 0.532
5.816AlaLeu: 5.816 ± 0.737
1.415AlaMet: 1.415 ± 0.267
5.187AlaAsn: 5.187 ± 0.86
1.179AlaPro: 1.179 ± 0.323
2.829AlaGln: 2.829 ± 0.457
2.594AlaArg: 2.594 ± 0.483
4.637AlaSer: 4.637 ± 0.466
3.301AlaThr: 3.301 ± 0.575
4.165AlaVal: 4.165 ± 0.763
1.257AlaTrp: 1.257 ± 0.433
2.986AlaTyr: 2.986 ± 0.53
0.0AlaXaa: 0.0 ± 0.0
Cys
0.314CysAla: 0.314 ± 0.146
0.0CysCys: 0.0 ± 0.0
0.629CysAsp: 0.629 ± 0.251
0.629CysGlu: 0.629 ± 0.226
0.157CysPhe: 0.157 ± 0.111
0.472CysGly: 0.472 ± 0.222
0.157CysHis: 0.157 ± 0.104
0.236CysIle: 0.236 ± 0.141
0.393CysLys: 0.393 ± 0.154
0.393CysLeu: 0.393 ± 0.163
0.314CysMet: 0.314 ± 0.145
0.314CysAsn: 0.314 ± 0.148
0.314CysPro: 0.314 ± 0.164
0.157CysGln: 0.157 ± 0.104
0.314CysArg: 0.314 ± 0.143
0.707CysSer: 0.707 ± 0.188
0.0CysThr: 0.0 ± 0.0
0.157CysVal: 0.157 ± 0.096
0.0CysTrp: 0.0 ± 0.0
0.236CysTyr: 0.236 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
3.144AspAla: 3.144 ± 0.504
0.314AspCys: 0.314 ± 0.171
3.93AspAsp: 3.93 ± 0.656
6.366AspGlu: 6.366 ± 0.96
3.065AspPhe: 3.065 ± 0.535
4.873AspGly: 4.873 ± 0.773
0.55AspHis: 0.55 ± 0.211
3.93AspIle: 3.93 ± 0.413
5.108AspLys: 5.108 ± 0.63
5.187AspLeu: 5.187 ± 0.561
1.257AspMet: 1.257 ± 0.289
3.537AspAsn: 3.537 ± 0.451
0.865AspPro: 0.865 ± 0.304
1.179AspGln: 1.179 ± 0.351
2.515AspArg: 2.515 ± 0.443
3.694AspSer: 3.694 ± 0.529
3.065AspThr: 3.065 ± 0.441
3.851AspVal: 3.851 ± 0.456
1.415AspTrp: 1.415 ± 0.377
3.222AspTyr: 3.222 ± 0.502
0.0AspXaa: 0.0 ± 0.0
Glu
4.401GluAla: 4.401 ± 0.68
0.55GluCys: 0.55 ± 0.183
2.986GluAsp: 2.986 ± 0.56
6.209GluGlu: 6.209 ± 1.133
4.008GluPhe: 4.008 ± 0.496
2.751GluGly: 2.751 ± 0.424
0.629GluHis: 0.629 ± 0.247
4.794GluIle: 4.794 ± 0.845
7.938GluLys: 7.938 ± 1.3
8.488GluLeu: 8.488 ± 1.018
1.572GluMet: 1.572 ± 0.306
4.165GluAsn: 4.165 ± 0.583
2.358GluPro: 2.358 ± 0.421
3.144GluGln: 3.144 ± 0.535
2.672GluArg: 2.672 ± 0.513
3.694GluSer: 3.694 ± 0.568
3.851GluThr: 3.851 ± 0.62
5.03GluVal: 5.03 ± 0.511
1.179GluTrp: 1.179 ± 0.309
3.065GluTyr: 3.065 ± 0.523
0.0GluXaa: 0.0 ± 0.0
Phe
2.436PheAla: 2.436 ± 0.41
0.629PheCys: 0.629 ± 0.213
3.93PheAsp: 3.93 ± 0.568
3.222PheGlu: 3.222 ± 0.585
1.179PhePhe: 1.179 ± 0.37
2.436PheGly: 2.436 ± 0.391
0.472PheHis: 0.472 ± 0.196
2.751PheIle: 2.751 ± 0.505
5.03PheLys: 5.03 ± 0.62
2.043PheLeu: 2.043 ± 0.362
1.415PheMet: 1.415 ± 0.343
2.908PheAsn: 2.908 ± 0.422
0.786PhePro: 0.786 ± 0.261
1.965PheGln: 1.965 ± 0.411
1.179PheArg: 1.179 ± 0.297
3.537PheSer: 3.537 ± 0.442
2.751PheThr: 2.751 ± 0.544
2.908PheVal: 2.908 ± 0.369
0.236PheTrp: 0.236 ± 0.145
1.1PheTyr: 1.1 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
3.537GlyAla: 3.537 ± 0.488
0.472GlyCys: 0.472 ± 0.182
2.672GlyAsp: 2.672 ± 0.559
3.851GlyGlu: 3.851 ± 0.622
3.301GlyPhe: 3.301 ± 0.537
4.637GlyGly: 4.637 ± 0.793
0.786GlyHis: 0.786 ± 0.237
4.244GlyIle: 4.244 ± 0.613
5.737GlyLys: 5.737 ± 0.676
4.637GlyLeu: 4.637 ± 0.89
2.201GlyMet: 2.201 ± 0.481
3.301GlyAsn: 3.301 ± 0.688
0.943GlyPro: 0.943 ± 0.311
2.829GlyGln: 2.829 ± 0.629
2.358GlyArg: 2.358 ± 0.535
4.244GlySer: 4.244 ± 0.464
4.558GlyThr: 4.558 ± 0.849
3.458GlyVal: 3.458 ± 0.5
0.786GlyTrp: 0.786 ± 0.278
3.458GlyTyr: 3.458 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
1.415HisAla: 1.415 ± 0.396
0.079HisCys: 0.079 ± 0.077
0.786HisAsp: 0.786 ± 0.247
0.786HisGlu: 0.786 ± 0.247
0.943HisPhe: 0.943 ± 0.282
0.55HisGly: 0.55 ± 0.22
0.157HisHis: 0.157 ± 0.11
0.393HisIle: 0.393 ± 0.15
0.786HisLys: 0.786 ± 0.242
0.943HisLeu: 0.943 ± 0.298
0.157HisMet: 0.157 ± 0.121
0.393HisAsn: 0.393 ± 0.169
0.472HisPro: 0.472 ± 0.143
0.707HisGln: 0.707 ± 0.254
0.157HisArg: 0.157 ± 0.103
0.865HisSer: 0.865 ± 0.222
0.707HisThr: 0.707 ± 0.205
0.707HisVal: 0.707 ± 0.187
0.157HisTrp: 0.157 ± 0.113
0.55HisTyr: 0.55 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
4.401IleAla: 4.401 ± 0.551
0.472IleCys: 0.472 ± 0.197
4.323IleAsp: 4.323 ± 0.492
6.052IleGlu: 6.052 ± 0.84
1.965IlePhe: 1.965 ± 0.377
4.715IleGly: 4.715 ± 0.529
0.629IleHis: 0.629 ± 0.223
4.48IleIle: 4.48 ± 0.621
5.816IleLys: 5.816 ± 0.567
4.165IleLeu: 4.165 ± 0.628
1.415IleMet: 1.415 ± 0.285
4.244IleAsn: 4.244 ± 0.504
2.122IlePro: 2.122 ± 0.402
2.515IleGln: 2.515 ± 0.398
2.594IleArg: 2.594 ± 0.5
5.973IleSer: 5.973 ± 0.646
3.772IleThr: 3.772 ± 0.573
3.694IleVal: 3.694 ± 0.72
0.865IleTrp: 0.865 ± 0.281
2.201IleTyr: 2.201 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
7.388LysAla: 7.388 ± 0.881
0.236LysCys: 0.236 ± 0.134
5.58LysAsp: 5.58 ± 0.675
7.466LysGlu: 7.466 ± 1.077
3.694LysPhe: 3.694 ± 0.492
5.659LysGly: 5.659 ± 0.599
1.729LysHis: 1.729 ± 0.435
5.501LysIle: 5.501 ± 0.697
9.195LysLys: 9.195 ± 1.132
7.859LysLeu: 7.859 ± 0.844
2.043LysMet: 2.043 ± 0.369
6.209LysAsn: 6.209 ± 0.701
2.751LysPro: 2.751 ± 0.429
3.537LysGln: 3.537 ± 0.628
3.694LysArg: 3.694 ± 0.603
5.108LysSer: 5.108 ± 0.685
5.58LysThr: 5.58 ± 0.725
4.637LysVal: 4.637 ± 0.627
1.336LysTrp: 1.336 ± 0.343
3.222LysTyr: 3.222 ± 0.55
0.0LysXaa: 0.0 ± 0.0
Leu
5.816LeuAla: 5.816 ± 0.681
0.393LeuCys: 0.393 ± 0.178
5.737LeuAsp: 5.737 ± 0.615
5.894LeuGlu: 5.894 ± 0.75
2.672LeuPhe: 2.672 ± 0.495
5.187LeuGly: 5.187 ± 0.611
0.865LeuHis: 0.865 ± 0.283
5.266LeuIle: 5.266 ± 0.576
8.409LeuLys: 8.409 ± 0.89
6.916LeuLeu: 6.916 ± 0.92
2.201LeuMet: 2.201 ± 0.39
4.794LeuAsn: 4.794 ± 0.555
3.379LeuPro: 3.379 ± 0.514
3.851LeuGln: 3.851 ± 0.563
2.122LeuArg: 2.122 ± 0.397
6.209LeuSer: 6.209 ± 0.588
4.794LeuThr: 4.794 ± 0.666
4.087LeuVal: 4.087 ± 0.509
1.022LeuTrp: 1.022 ± 0.559
2.908LeuTyr: 2.908 ± 0.581
0.0LeuXaa: 0.0 ± 0.0
Met
1.808MetAla: 1.808 ± 0.326
0.236MetCys: 0.236 ± 0.119
1.257MetAsp: 1.257 ± 0.267
1.886MetGlu: 1.886 ± 0.436
0.629MetPhe: 0.629 ± 0.205
1.022MetGly: 1.022 ± 0.282
0.314MetHis: 0.314 ± 0.141
1.1MetIle: 1.1 ± 0.324
3.144MetLys: 3.144 ± 0.491
1.493MetLeu: 1.493 ± 0.31
0.393MetMet: 0.393 ± 0.2
1.1MetAsn: 1.1 ± 0.285
1.022MetPro: 1.022 ± 0.275
1.1MetGln: 1.1 ± 0.34
0.943MetArg: 0.943 ± 0.269
1.65MetSer: 1.65 ± 0.408
2.986MetThr: 2.986 ± 0.438
1.493MetVal: 1.493 ± 0.351
0.079MetTrp: 0.079 ± 0.077
0.707MetTyr: 0.707 ± 0.279
0.0MetXaa: 0.0 ± 0.0
Asn
4.48AsnAla: 4.48 ± 0.753
0.393AsnCys: 0.393 ± 0.164
3.772AsnAsp: 3.772 ± 0.452
3.065AsnGlu: 3.065 ± 0.574
2.515AsnPhe: 2.515 ± 0.437
6.287AsnGly: 6.287 ± 0.904
0.786AsnHis: 0.786 ± 0.29
3.93AsnIle: 3.93 ± 0.503
5.108AsnLys: 5.108 ± 0.607
5.816AsnLeu: 5.816 ± 0.745
1.493AsnMet: 1.493 ± 0.359
2.751AsnAsn: 2.751 ± 0.544
2.594AsnPro: 2.594 ± 0.424
2.751AsnGln: 2.751 ± 0.543
2.122AsnArg: 2.122 ± 0.351
4.087AsnSer: 4.087 ± 0.494
2.751AsnThr: 2.751 ± 0.462
2.986AsnVal: 2.986 ± 0.53
1.022AsnTrp: 1.022 ± 0.228
2.436AsnTyr: 2.436 ± 0.396
0.0AsnXaa: 0.0 ± 0.0
Pro
1.572ProAla: 1.572 ± 0.42
0.079ProCys: 0.079 ± 0.068
2.201ProAsp: 2.201 ± 0.489
2.043ProGlu: 2.043 ± 0.393
1.493ProPhe: 1.493 ± 0.366
0.629ProGly: 0.629 ± 0.24
0.472ProHis: 0.472 ± 0.157
2.043ProIle: 2.043 ± 0.432
2.515ProLys: 2.515 ± 0.406
2.122ProLeu: 2.122 ± 0.366
0.629ProMet: 0.629 ± 0.221
1.729ProAsn: 1.729 ± 0.514
0.865ProPro: 0.865 ± 0.203
1.336ProGln: 1.336 ± 0.309
0.943ProArg: 0.943 ± 0.256
2.043ProSer: 2.043 ± 0.395
2.122ProThr: 2.122 ± 0.286
2.122ProVal: 2.122 ± 0.389
0.629ProTrp: 0.629 ± 0.232
0.865ProTyr: 0.865 ± 0.233
0.0ProXaa: 0.0 ± 0.0
Gln
4.323GlnAla: 4.323 ± 0.502
0.236GlnCys: 0.236 ± 0.129
1.257GlnAsp: 1.257 ± 0.297
4.165GlnGlu: 4.165 ± 0.468
1.65GlnPhe: 1.65 ± 0.311
2.515GlnGly: 2.515 ± 0.589
0.236GlnHis: 0.236 ± 0.14
2.594GlnIle: 2.594 ± 0.36
2.751GlnLys: 2.751 ± 0.474
3.222GlnLeu: 3.222 ± 0.553
0.943GlnMet: 0.943 ± 0.221
2.515GlnAsn: 2.515 ± 0.44
1.493GlnPro: 1.493 ± 0.379
2.122GlnGln: 2.122 ± 0.387
1.179GlnArg: 1.179 ± 0.24
2.515GlnSer: 2.515 ± 0.559
2.986GlnThr: 2.986 ± 0.647
2.751GlnVal: 2.751 ± 0.455
1.022GlnTrp: 1.022 ± 0.236
1.257GlnTyr: 1.257 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
1.965ArgAla: 1.965 ± 0.339
0.314ArgCys: 0.314 ± 0.159
1.729ArgAsp: 1.729 ± 0.389
2.122ArgGlu: 2.122 ± 0.492
1.808ArgPhe: 1.808 ± 0.314
1.572ArgGly: 1.572 ± 0.433
0.472ArgHis: 0.472 ± 0.178
2.829ArgIle: 2.829 ± 0.491
3.93ArgLys: 3.93 ± 0.665
4.401ArgLeu: 4.401 ± 0.831
1.022ArgMet: 1.022 ± 0.242
2.436ArgAsn: 2.436 ± 0.445
0.786ArgPro: 0.786 ± 0.395
1.1ArgGln: 1.1 ± 0.227
1.336ArgArg: 1.336 ± 0.469
1.808ArgSer: 1.808 ± 0.314
1.808ArgThr: 1.808 ± 0.347
2.279ArgVal: 2.279 ± 0.357
0.236ArgTrp: 0.236 ± 0.133
1.729ArgTyr: 1.729 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
3.93SerAla: 3.93 ± 0.837
0.236SerCys: 0.236 ± 0.132
5.03SerAsp: 5.03 ± 0.575
5.03SerGlu: 5.03 ± 0.629
3.144SerPhe: 3.144 ± 0.491
4.637SerGly: 4.637 ± 0.651
0.865SerHis: 0.865 ± 0.228
4.244SerIle: 4.244 ± 0.509
4.794SerLys: 4.794 ± 0.643
5.108SerLeu: 5.108 ± 0.711
1.886SerMet: 1.886 ± 0.388
4.323SerAsn: 4.323 ± 0.574
1.1SerPro: 1.1 ± 0.248
3.222SerGln: 3.222 ± 0.457
2.201SerArg: 2.201 ± 0.386
5.108SerSer: 5.108 ± 0.749
4.165SerThr: 4.165 ± 0.591
4.715SerVal: 4.715 ± 0.612
0.943SerTrp: 0.943 ± 0.245
2.751SerTyr: 2.751 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
4.087ThrAla: 4.087 ± 0.738
0.157ThrCys: 0.157 ± 0.104
3.615ThrAsp: 3.615 ± 0.681
3.537ThrGlu: 3.537 ± 0.577
2.751ThrPhe: 2.751 ± 0.493
4.48ThrGly: 4.48 ± 0.71
0.55ThrHis: 0.55 ± 0.22
4.637ThrIle: 4.637 ± 0.815
5.737ThrLys: 5.737 ± 0.493
4.715ThrLeu: 4.715 ± 0.625
1.179ThrMet: 1.179 ± 0.347
3.065ThrAsn: 3.065 ± 0.466
1.729ThrPro: 1.729 ± 0.488
2.043ThrGln: 2.043 ± 0.373
2.122ThrArg: 2.122 ± 0.385
3.458ThrSer: 3.458 ± 0.423
5.737ThrThr: 5.737 ± 0.912
4.951ThrVal: 4.951 ± 0.687
0.943ThrTrp: 0.943 ± 0.349
2.436ThrTyr: 2.436 ± 0.382
0.0ThrXaa: 0.0 ± 0.0
Val
2.751ValAla: 2.751 ± 0.678
0.314ValCys: 0.314 ± 0.146
3.851ValAsp: 3.851 ± 0.696
4.48ValGlu: 4.48 ± 0.561
2.358ValPhe: 2.358 ± 0.471
3.301ValGly: 3.301 ± 0.594
0.393ValHis: 0.393 ± 0.169
4.637ValIle: 4.637 ± 0.564
6.445ValLys: 6.445 ± 0.806
5.108ValLeu: 5.108 ± 0.623
1.572ValMet: 1.572 ± 0.353
4.48ValAsn: 4.48 ± 0.724
1.886ValPro: 1.886 ± 0.471
2.594ValGln: 2.594 ± 0.613
1.65ValArg: 1.65 ± 0.315
3.93ValSer: 3.93 ± 0.573
4.087ValThr: 4.087 ± 0.656
3.694ValVal: 3.694 ± 0.579
1.1ValTrp: 1.1 ± 0.25
2.043ValTyr: 2.043 ± 0.382
0.0ValXaa: 0.0 ± 0.0
Trp
0.786TrpAla: 0.786 ± 0.258
0.0TrpCys: 0.0 ± 0.0
1.022TrpAsp: 1.022 ± 0.415
0.629TrpGlu: 0.629 ± 0.279
0.707TrpPhe: 0.707 ± 0.252
0.707TrpGly: 0.707 ± 0.24
0.393TrpHis: 0.393 ± 0.156
1.336TrpIle: 1.336 ± 0.297
1.257TrpLys: 1.257 ± 0.266
1.179TrpLeu: 1.179 ± 0.345
0.393TrpMet: 0.393 ± 0.133
1.415TrpAsn: 1.415 ± 0.445
0.157TrpPro: 0.157 ± 0.096
1.022TrpGln: 1.022 ± 0.316
0.707TrpArg: 0.707 ± 0.205
0.707TrpSer: 0.707 ± 0.254
0.707TrpThr: 0.707 ± 0.248
1.1TrpVal: 1.1 ± 0.324
0.472TrpTrp: 0.472 ± 0.221
0.786TrpTyr: 0.786 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.122TyrAla: 2.122 ± 0.285
0.314TyrCys: 0.314 ± 0.171
2.436TyrAsp: 2.436 ± 0.478
2.043TyrGlu: 2.043 ± 0.429
1.415TyrPhe: 1.415 ± 0.307
2.672TyrGly: 2.672 ± 0.497
0.472TyrHis: 0.472 ± 0.181
2.829TyrIle: 2.829 ± 0.435
3.615TyrLys: 3.615 ± 0.558
2.986TyrLeu: 2.986 ± 0.524
0.786TyrMet: 0.786 ± 0.216
2.279TyrAsn: 2.279 ± 0.449
1.65TyrPro: 1.65 ± 0.331
1.729TyrGln: 1.729 ± 0.336
2.279TyrArg: 2.279 ± 0.429
3.458TyrSer: 3.458 ± 0.527
2.201TyrThr: 2.201 ± 0.476
1.886TyrVal: 1.886 ± 0.397
0.786TyrTrp: 0.786 ± 0.217
1.886TyrTyr: 1.886 ± 0.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12725 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski