Amino acid dipepetide frequency for Lactococcus phage proPhi4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.311AlaAla: 5.311 ± 1.643
0.261AlaCys: 0.261 ± 0.158
4.615AlaAsp: 4.615 ± 0.615
4.441AlaGlu: 4.441 ± 0.88
2.438AlaPhe: 2.438 ± 0.393
5.05AlaGly: 5.05 ± 0.92
0.958AlaHis: 0.958 ± 0.335
5.137AlaIle: 5.137 ± 0.658
5.747AlaLys: 5.747 ± 0.974
5.224AlaLeu: 5.224 ± 0.76
1.741AlaMet: 1.741 ± 0.377
3.657AlaAsn: 3.657 ± 0.62
1.741AlaPro: 1.741 ± 0.336
3.831AlaGln: 3.831 ± 0.758
1.219AlaArg: 1.219 ± 0.465
5.485AlaSer: 5.485 ± 0.666
4.354AlaThr: 4.354 ± 0.574
3.57AlaVal: 3.57 ± 0.515
1.045AlaTrp: 1.045 ± 0.296
2.525AlaTyr: 2.525 ± 0.452
0.0AlaXaa: 0.0 ± 0.0
Cys
0.087CysAla: 0.087 ± 0.085
0.087CysCys: 0.087 ± 0.086
0.261CysAsp: 0.261 ± 0.202
0.435CysGlu: 0.435 ± 0.269
0.174CysPhe: 0.174 ± 0.134
0.174CysGly: 0.174 ± 0.127
0.261CysHis: 0.261 ± 0.207
0.174CysIle: 0.174 ± 0.128
0.435CysLys: 0.435 ± 0.197
0.348CysLeu: 0.348 ± 0.18
0.174CysMet: 0.174 ± 0.118
0.348CysAsn: 0.348 ± 0.164
0.174CysPro: 0.174 ± 0.135
0.087CysGln: 0.087 ± 0.086
0.261CysArg: 0.261 ± 0.145
0.609CysSer: 0.609 ± 0.255
0.784CysThr: 0.784 ± 0.268
0.348CysVal: 0.348 ± 0.195
0.087CysTrp: 0.087 ± 0.085
0.174CysTyr: 0.174 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
3.396AspAla: 3.396 ± 0.437
0.174AspCys: 0.174 ± 0.114
4.615AspAsp: 4.615 ± 0.805
4.702AspGlu: 4.702 ± 0.848
2.786AspPhe: 2.786 ± 0.497
6.53AspGly: 6.53 ± 1.2
0.784AspHis: 0.784 ± 0.255
3.744AspIle: 3.744 ± 0.402
5.224AspLys: 5.224 ± 0.759
4.441AspLeu: 4.441 ± 0.6
1.48AspMet: 1.48 ± 0.332
3.918AspAsn: 3.918 ± 0.627
2.177AspPro: 2.177 ± 0.441
1.48AspGln: 1.48 ± 0.351
2.003AspArg: 2.003 ± 0.384
4.789AspSer: 4.789 ± 0.601
3.309AspThr: 3.309 ± 0.454
4.092AspVal: 4.092 ± 0.568
1.219AspTrp: 1.219 ± 0.346
2.96AspTyr: 2.96 ± 0.455
0.0AspXaa: 0.0 ± 0.0
Glu
4.963GluAla: 4.963 ± 0.739
0.261GluCys: 0.261 ± 0.14
3.047GluAsp: 3.047 ± 0.633
4.179GluGlu: 4.179 ± 0.816
3.047GluPhe: 3.047 ± 0.532
3.047GluGly: 3.047 ± 0.576
0.522GluHis: 0.522 ± 0.196
6.704GluIle: 6.704 ± 0.862
7.14GluLys: 7.14 ± 0.964
6.095GluLeu: 6.095 ± 0.82
1.48GluMet: 1.48 ± 0.345
3.918GluAsn: 3.918 ± 0.687
1.916GluPro: 1.916 ± 0.442
2.786GluGln: 2.786 ± 0.519
2.003GluArg: 2.003 ± 0.519
3.047GluSer: 3.047 ± 0.381
2.699GluThr: 2.699 ± 0.394
3.222GluVal: 3.222 ± 0.683
0.784GluTrp: 0.784 ± 0.249
2.699GluTyr: 2.699 ± 0.576
0.0GluXaa: 0.0 ± 0.0
Phe
2.525PheAla: 2.525 ± 0.382
0.522PheCys: 0.522 ± 0.235
3.135PheAsp: 3.135 ± 0.591
2.003PheGlu: 2.003 ± 0.444
1.132PhePhe: 1.132 ± 0.251
2.612PheGly: 2.612 ± 0.382
0.348PheHis: 0.348 ± 0.144
3.047PheIle: 3.047 ± 0.544
3.657PheLys: 3.657 ± 0.678
3.57PheLeu: 3.57 ± 0.542
1.219PheMet: 1.219 ± 0.27
2.525PheAsn: 2.525 ± 0.544
1.132PhePro: 1.132 ± 0.226
1.48PheGln: 1.48 ± 0.305
0.871PheArg: 0.871 ± 0.236
2.786PheSer: 2.786 ± 0.527
2.873PheThr: 2.873 ± 0.558
1.654PheVal: 1.654 ± 0.42
0.348PheTrp: 0.348 ± 0.19
1.916PheTyr: 1.916 ± 0.435
0.0PheXaa: 0.0 ± 0.0
Gly
3.396GlyAla: 3.396 ± 0.736
0.261GlyCys: 0.261 ± 0.161
3.57GlyAsp: 3.57 ± 0.438
4.179GlyGlu: 4.179 ± 0.689
2.699GlyPhe: 2.699 ± 0.504
5.224GlyGly: 5.224 ± 0.938
1.045GlyHis: 1.045 ± 0.295
4.441GlyIle: 4.441 ± 0.734
5.66GlyLys: 5.66 ± 0.524
5.224GlyLeu: 5.224 ± 0.898
1.48GlyMet: 1.48 ± 0.332
4.441GlyAsn: 4.441 ± 0.705
0.958GlyPro: 0.958 ± 0.248
3.918GlyGln: 3.918 ± 0.454
1.654GlyArg: 1.654 ± 0.443
5.834GlySer: 5.834 ± 0.684
7.14GlyThr: 7.14 ± 1.52
3.744GlyVal: 3.744 ± 0.716
0.784GlyTrp: 0.784 ± 0.248
3.309GlyTyr: 3.309 ± 0.461
0.087GlyXaa: 0.087 ± 0.081
His
1.567HisAla: 1.567 ± 0.447
0.174HisCys: 0.174 ± 0.135
0.958HisAsp: 0.958 ± 0.272
0.871HisGlu: 0.871 ± 0.271
0.871HisPhe: 0.871 ± 0.291
0.784HisGly: 0.784 ± 0.239
0.174HisHis: 0.174 ± 0.096
0.784HisIle: 0.784 ± 0.235
0.609HisLys: 0.609 ± 0.243
1.306HisLeu: 1.306 ± 0.363
0.348HisMet: 0.348 ± 0.151
0.958HisAsn: 0.958 ± 0.241
0.261HisPro: 0.261 ± 0.119
0.697HisGln: 0.697 ± 0.24
0.174HisArg: 0.174 ± 0.132
0.871HisSer: 0.871 ± 0.326
0.609HisThr: 0.609 ± 0.225
0.697HisVal: 0.697 ± 0.27
0.087HisTrp: 0.087 ± 0.081
0.522HisTyr: 0.522 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
4.876IleAla: 4.876 ± 0.705
0.174IleCys: 0.174 ± 0.113
5.485IleAsp: 5.485 ± 0.72
5.05IleGlu: 5.05 ± 0.709
2.786IlePhe: 2.786 ± 0.462
5.224IleGly: 5.224 ± 0.706
0.958IleHis: 0.958 ± 0.298
3.744IleIle: 3.744 ± 0.723
4.876IleLys: 4.876 ± 0.7
4.179IleLeu: 4.179 ± 0.517
1.567IleMet: 1.567 ± 0.373
5.572IleAsn: 5.572 ± 0.733
2.264IlePro: 2.264 ± 0.419
2.438IleGln: 2.438 ± 0.551
2.264IleArg: 2.264 ± 0.415
5.485IleSer: 5.485 ± 0.766
5.311IleThr: 5.311 ± 1.177
3.396IleVal: 3.396 ± 0.53
0.609IleTrp: 0.609 ± 0.244
2.177IleTyr: 2.177 ± 0.522
0.087IleXaa: 0.087 ± 0.081
Lys
6.356LysAla: 6.356 ± 1.386
0.435LysCys: 0.435 ± 0.214
6.095LysAsp: 6.095 ± 0.632
5.921LysGlu: 5.921 ± 0.964
3.657LysPhe: 3.657 ± 0.526
5.137LysGly: 5.137 ± 0.817
1.567LysHis: 1.567 ± 0.592
5.398LysIle: 5.398 ± 0.788
7.662LysLys: 7.662 ± 1.219
6.356LysLeu: 6.356 ± 0.913
1.828LysMet: 1.828 ± 0.38
6.356LysAsn: 6.356 ± 0.796
2.177LysPro: 2.177 ± 0.537
4.354LysGln: 4.354 ± 0.859
3.135LysArg: 3.135 ± 0.586
5.05LysSer: 5.05 ± 0.701
6.356LysThr: 6.356 ± 0.674
4.266LysVal: 4.266 ± 0.695
1.045LysTrp: 1.045 ± 0.28
3.309LysTyr: 3.309 ± 0.564
0.0LysXaa: 0.0 ± 0.0
Leu
5.137LeuAla: 5.137 ± 0.682
0.522LeuCys: 0.522 ± 0.242
4.179LeuAsp: 4.179 ± 0.623
4.876LeuGlu: 4.876 ± 0.692
2.003LeuPhe: 2.003 ± 0.462
4.789LeuGly: 4.789 ± 0.614
0.784LeuHis: 0.784 ± 0.252
4.005LeuIle: 4.005 ± 0.43
6.008LeuLys: 6.008 ± 0.88
6.269LeuLeu: 6.269 ± 0.774
2.177LeuMet: 2.177 ± 0.518
7.14LeuAsn: 7.14 ± 0.781
2.525LeuPro: 2.525 ± 0.398
3.918LeuGln: 3.918 ± 0.608
2.525LeuArg: 2.525 ± 0.485
8.62LeuSer: 8.62 ± 0.978
5.311LeuThr: 5.311 ± 0.822
3.309LeuVal: 3.309 ± 0.512
1.045LeuTrp: 1.045 ± 0.3
2.003LeuTyr: 2.003 ± 0.362
0.0LeuXaa: 0.0 ± 0.0
Met
1.567MetAla: 1.567 ± 0.335
0.348MetCys: 0.348 ± 0.17
0.784MetAsp: 0.784 ± 0.199
1.219MetGlu: 1.219 ± 0.32
0.697MetPhe: 0.697 ± 0.205
1.132MetGly: 1.132 ± 0.29
0.261MetHis: 0.261 ± 0.17
1.654MetIle: 1.654 ± 0.366
3.047MetLys: 3.047 ± 0.524
1.219MetLeu: 1.219 ± 0.34
0.697MetMet: 0.697 ± 0.245
1.916MetAsn: 1.916 ± 0.483
0.958MetPro: 0.958 ± 0.262
1.132MetGln: 1.132 ± 0.38
0.958MetArg: 0.958 ± 0.27
2.438MetSer: 2.438 ± 0.543
2.09MetThr: 2.09 ± 0.456
1.045MetVal: 1.045 ± 0.231
0.435MetTrp: 0.435 ± 0.197
0.784MetTyr: 0.784 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
4.179AsnAla: 4.179 ± 0.582
0.174AsnCys: 0.174 ± 0.138
4.005AsnAsp: 4.005 ± 0.83
3.483AsnGlu: 3.483 ± 0.539
2.438AsnPhe: 2.438 ± 0.531
6.008AsnGly: 6.008 ± 0.847
0.958AsnHis: 0.958 ± 0.285
4.528AsnIle: 4.528 ± 0.697
5.398AsnLys: 5.398 ± 0.78
5.311AsnLeu: 5.311 ± 0.676
1.916AsnMet: 1.916 ± 0.413
4.266AsnAsn: 4.266 ± 0.595
2.351AsnPro: 2.351 ± 0.414
3.047AsnGln: 3.047 ± 0.573
2.699AsnArg: 2.699 ± 0.495
4.789AsnSer: 4.789 ± 0.804
3.483AsnThr: 3.483 ± 0.566
3.483AsnVal: 3.483 ± 0.568
0.784AsnTrp: 0.784 ± 0.307
1.654AsnTyr: 1.654 ± 0.317
0.0AsnXaa: 0.0 ± 0.0
Pro
1.654ProAla: 1.654 ± 0.436
0.087ProCys: 0.087 ± 0.085
1.393ProAsp: 1.393 ± 0.397
2.003ProGlu: 2.003 ± 0.437
1.132ProPhe: 1.132 ± 0.29
1.045ProGly: 1.045 ± 0.289
0.697ProHis: 0.697 ± 0.247
1.654ProIle: 1.654 ± 0.398
2.612ProLys: 2.612 ± 0.531
2.786ProLeu: 2.786 ± 0.501
0.871ProMet: 0.871 ± 0.285
1.741ProAsn: 1.741 ± 0.583
0.784ProPro: 0.784 ± 0.262
1.567ProGln: 1.567 ± 0.348
0.261ProArg: 0.261 ± 0.23
2.351ProSer: 2.351 ± 0.757
2.525ProThr: 2.525 ± 0.509
2.264ProVal: 2.264 ± 0.564
0.348ProTrp: 0.348 ± 0.157
0.871ProTyr: 0.871 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
3.657GlnAla: 3.657 ± 0.737
0.087GlnCys: 0.087 ± 0.087
2.525GlnAsp: 2.525 ± 0.531
3.047GlnGlu: 3.047 ± 0.498
2.003GlnPhe: 2.003 ± 0.508
2.96GlnGly: 2.96 ± 0.534
0.348GlnHis: 0.348 ± 0.125
3.57GlnIle: 3.57 ± 0.538
3.744GlnLys: 3.744 ± 0.853
3.657GlnLeu: 3.657 ± 0.533
1.306GlnMet: 1.306 ± 0.317
1.48GlnAsn: 1.48 ± 0.421
0.784GlnPro: 0.784 ± 0.269
2.351GlnGln: 2.351 ± 0.498
1.916GlnArg: 1.916 ± 0.384
3.222GlnSer: 3.222 ± 0.436
2.699GlnThr: 2.699 ± 0.528
2.264GlnVal: 2.264 ± 0.384
0.261GlnTrp: 0.261 ± 0.176
1.828GlnTyr: 1.828 ± 0.381
0.0GlnXaa: 0.0 ± 0.0
Arg
2.525ArgAla: 2.525 ± 0.446
0.261ArgCys: 0.261 ± 0.14
1.828ArgAsp: 1.828 ± 0.403
2.351ArgGlu: 2.351 ± 0.595
2.177ArgPhe: 2.177 ± 0.37
1.828ArgGly: 1.828 ± 0.38
0.522ArgHis: 0.522 ± 0.241
1.916ArgIle: 1.916 ± 0.411
2.873ArgLys: 2.873 ± 0.616
2.438ArgLeu: 2.438 ± 0.541
0.697ArgMet: 0.697 ± 0.275
2.264ArgAsn: 2.264 ± 0.344
1.132ArgPro: 1.132 ± 0.353
1.045ArgGln: 1.045 ± 0.289
1.132ArgArg: 1.132 ± 0.396
1.654ArgSer: 1.654 ± 0.346
1.567ArgThr: 1.567 ± 0.433
1.916ArgVal: 1.916 ± 0.479
0.697ArgTrp: 0.697 ± 0.228
1.393ArgTyr: 1.393 ± 0.311
0.0ArgXaa: 0.0 ± 0.0
Ser
4.179SerAla: 4.179 ± 0.657
0.435SerCys: 0.435 ± 0.216
6.008SerAsp: 6.008 ± 0.936
5.137SerGlu: 5.137 ± 0.827
3.831SerPhe: 3.831 ± 0.524
5.398SerGly: 5.398 ± 1.086
0.871SerHis: 0.871 ± 0.32
5.137SerIle: 5.137 ± 0.82
5.921SerLys: 5.921 ± 0.761
5.398SerLeu: 5.398 ± 0.66
1.828SerMet: 1.828 ± 0.436
4.266SerAsn: 4.266 ± 0.523
1.828SerPro: 1.828 ± 0.317
2.873SerGln: 2.873 ± 0.502
2.96SerArg: 2.96 ± 0.496
6.095SerSer: 6.095 ± 0.872
5.137SerThr: 5.137 ± 0.811
5.485SerVal: 5.485 ± 0.592
0.522SerTrp: 0.522 ± 0.226
2.264SerTyr: 2.264 ± 0.417
0.0SerXaa: 0.0 ± 0.0
Thr
6.008ThrAla: 6.008 ± 0.907
0.435ThrCys: 0.435 ± 0.255
4.005ThrAsp: 4.005 ± 0.72
3.918ThrGlu: 3.918 ± 0.472
2.003ThrPhe: 2.003 ± 0.384
5.66ThrGly: 5.66 ± 0.771
1.132ThrHis: 1.132 ± 0.227
5.224ThrIle: 5.224 ± 0.977
5.66ThrLys: 5.66 ± 0.709
4.354ThrLeu: 4.354 ± 0.819
1.132ThrMet: 1.132 ± 0.321
4.092ThrAsn: 4.092 ± 1.046
1.916ThrPro: 1.916 ± 0.417
2.351ThrGln: 2.351 ± 0.415
2.09ThrArg: 2.09 ± 0.448
4.615ThrSer: 4.615 ± 0.708
5.747ThrThr: 5.747 ± 1.5
6.182ThrVal: 6.182 ± 0.989
1.045ThrTrp: 1.045 ± 0.304
2.264ThrTyr: 2.264 ± 0.536
0.0ThrXaa: 0.0 ± 0.0
Val
3.309ValAla: 3.309 ± 0.546
0.174ValCys: 0.174 ± 0.134
4.354ValAsp: 4.354 ± 0.719
2.96ValGlu: 2.96 ± 0.623
1.654ValPhe: 1.654 ± 0.451
3.047ValGly: 3.047 ± 0.434
0.522ValHis: 0.522 ± 0.191
4.354ValIle: 4.354 ± 0.584
6.356ValLys: 6.356 ± 1.003
4.876ValLeu: 4.876 ± 0.596
1.219ValMet: 1.219 ± 0.296
3.657ValAsn: 3.657 ± 0.605
2.264ValPro: 2.264 ± 0.614
2.003ValGln: 2.003 ± 0.421
1.567ValArg: 1.567 ± 0.509
4.266ValSer: 4.266 ± 0.547
4.179ValThr: 4.179 ± 0.649
3.918ValVal: 3.918 ± 0.487
0.784ValTrp: 0.784 ± 0.272
1.567ValTyr: 1.567 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
0.871TrpAla: 0.871 ± 0.231
0.0TrpCys: 0.0 ± 0.0
0.348TrpAsp: 0.348 ± 0.174
0.697TrpGlu: 0.697 ± 0.256
0.435TrpPhe: 0.435 ± 0.198
0.871TrpGly: 0.871 ± 0.231
0.261TrpHis: 0.261 ± 0.143
0.958TrpIle: 0.958 ± 0.266
0.871TrpLys: 0.871 ± 0.288
1.132TrpLeu: 1.132 ± 0.306
0.348TrpMet: 0.348 ± 0.155
0.697TrpAsn: 0.697 ± 0.306
0.0TrpPro: 0.0 ± 0.0
0.522TrpGln: 0.522 ± 0.189
0.522TrpArg: 0.522 ± 0.204
0.784TrpSer: 0.784 ± 0.312
1.306TrpThr: 1.306 ± 0.52
0.871TrpVal: 0.871 ± 0.255
0.261TrpTrp: 0.261 ± 0.135
0.609TrpTyr: 0.609 ± 0.286
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.786TyrAla: 2.786 ± 0.481
0.609TyrCys: 0.609 ± 0.314
2.612TyrAsp: 2.612 ± 0.452
2.09TyrGlu: 2.09 ± 0.415
1.393TyrPhe: 1.393 ± 0.342
2.612TyrGly: 2.612 ± 0.48
0.348TyrHis: 0.348 ± 0.194
2.351TyrIle: 2.351 ± 0.443
2.96TyrLys: 2.96 ± 0.49
2.873TyrLeu: 2.873 ± 0.591
0.784TyrMet: 0.784 ± 0.213
1.741TyrAsn: 1.741 ± 0.41
1.219TyrPro: 1.219 ± 0.413
1.828TyrGln: 1.828 ± 0.557
2.003TyrArg: 2.003 ± 0.408
2.786TyrSer: 2.786 ± 0.494
2.351TyrThr: 2.351 ± 0.586
1.306TyrVal: 1.306 ± 0.281
0.174TyrTrp: 0.174 ± 0.116
1.219TyrTyr: 1.219 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.087XaaLeu: 0.087 ± 0.081
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.087XaaPro: 0.087 ± 0.081
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski