Amino acid dipepetide frequency for Pseudoalteromonas virus vB_PspP-H6/1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.022AlaAla: 9.022 ± 1.126
0.438AlaCys: 0.438 ± 0.219
5.518AlaAsp: 5.518 ± 0.74
7.971AlaGlu: 7.971 ± 1.382
3.854AlaPhe: 3.854 ± 0.538
6.92AlaGly: 6.92 ± 1.446
0.963AlaHis: 0.963 ± 0.318
5.343AlaIle: 5.343 ± 0.576
5.606AlaLys: 5.606 ± 0.841
6.92AlaLeu: 6.92 ± 0.888
2.628AlaMet: 2.628 ± 0.501
5.518AlaAsn: 5.518 ± 0.784
2.89AlaPro: 2.89 ± 0.561
4.292AlaGln: 4.292 ± 1.132
3.416AlaArg: 3.416 ± 0.434
6.044AlaSer: 6.044 ± 0.695
4.905AlaThr: 4.905 ± 0.842
4.905AlaVal: 4.905 ± 0.744
0.963AlaTrp: 0.963 ± 0.307
2.54AlaTyr: 2.54 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
0.788CysAla: 0.788 ± 0.289
0.088CysCys: 0.088 ± 0.091
0.526CysAsp: 0.526 ± 0.243
0.788CysGlu: 0.788 ± 0.289
0.438CysPhe: 0.438 ± 0.219
1.314CysGly: 1.314 ± 0.357
0.263CysHis: 0.263 ± 0.132
0.438CysIle: 0.438 ± 0.18
1.051CysLys: 1.051 ± 0.316
0.613CysLeu: 0.613 ± 0.219
0.0CysMet: 0.0 ± 0.0
0.526CysAsn: 0.526 ± 0.186
0.438CysPro: 0.438 ± 0.231
0.263CysGln: 0.263 ± 0.152
0.701CysArg: 0.701 ± 0.234
0.701CysSer: 0.701 ± 0.302
0.263CysThr: 0.263 ± 0.156
0.788CysVal: 0.788 ± 0.275
0.175CysTrp: 0.175 ± 0.152
0.526CysTyr: 0.526 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
6.044AspAla: 6.044 ± 0.609
1.226AspCys: 1.226 ± 0.357
3.591AspAsp: 3.591 ± 0.566
4.555AspGlu: 4.555 ± 0.646
3.416AspPhe: 3.416 ± 0.602
5.606AspGly: 5.606 ± 0.705
1.226AspHis: 1.226 ± 0.286
4.117AspIle: 4.117 ± 0.675
3.679AspLys: 3.679 ± 0.618
4.817AspLeu: 4.817 ± 0.655
1.226AspMet: 1.226 ± 0.345
3.241AspAsn: 3.241 ± 0.428
1.664AspPro: 1.664 ± 0.498
1.577AspGln: 1.577 ± 0.682
2.54AspArg: 2.54 ± 0.411
4.555AspSer: 4.555 ± 0.525
2.803AspThr: 2.803 ± 0.553
5.168AspVal: 5.168 ± 0.691
0.701AspTrp: 0.701 ± 0.215
2.365AspTyr: 2.365 ± 0.451
0.0AspXaa: 0.0 ± 0.0
Glu
6.744GluAla: 6.744 ± 1.309
0.876GluCys: 0.876 ± 0.315
2.628GluAsp: 2.628 ± 0.401
3.854GluGlu: 3.854 ± 0.979
2.19GluPhe: 2.19 ± 0.413
3.941GluGly: 3.941 ± 0.626
0.701GluHis: 0.701 ± 0.247
4.379GluIle: 4.379 ± 0.676
3.766GluLys: 3.766 ± 0.591
7.182GluLeu: 7.182 ± 0.882
1.664GluMet: 1.664 ± 0.37
2.715GluAsn: 2.715 ± 0.401
2.015GluPro: 2.015 ± 0.438
3.854GluGln: 3.854 ± 0.979
3.416GluArg: 3.416 ± 0.837
4.467GluSer: 4.467 ± 0.712
4.204GluThr: 4.204 ± 0.587
4.117GluVal: 4.117 ± 0.648
1.577GluTrp: 1.577 ± 0.38
2.628GluTyr: 2.628 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
2.978PheAla: 2.978 ± 0.484
0.263PheCys: 0.263 ± 0.147
3.854PheAsp: 3.854 ± 0.597
3.153PheGlu: 3.153 ± 0.641
1.401PhePhe: 1.401 ± 0.397
3.504PheGly: 3.504 ± 0.583
0.613PheHis: 0.613 ± 0.229
2.628PheIle: 2.628 ± 0.471
2.015PheLys: 2.015 ± 0.353
1.401PheLeu: 1.401 ± 0.29
0.963PheMet: 0.963 ± 0.321
2.102PheAsn: 2.102 ± 0.448
1.226PhePro: 1.226 ± 0.39
1.226PheGln: 1.226 ± 0.377
1.139PheArg: 1.139 ± 0.272
2.715PheSer: 2.715 ± 0.44
3.328PheThr: 3.328 ± 0.82
2.365PheVal: 2.365 ± 0.6
0.701PheTrp: 0.701 ± 0.231
1.839PheTyr: 1.839 ± 0.477
0.0PheXaa: 0.0 ± 0.0
Gly
8.409GlyAla: 8.409 ± 1.241
0.613GlyCys: 0.613 ± 0.213
5.255GlyAsp: 5.255 ± 0.625
5.868GlyGlu: 5.868 ± 0.588
3.066GlyPhe: 3.066 ± 0.467
6.044GlyGly: 6.044 ± 0.856
0.526GlyHis: 0.526 ± 0.238
4.379GlyIle: 4.379 ± 0.708
4.467GlyLys: 4.467 ± 0.671
7.095GlyLeu: 7.095 ± 0.875
2.365GlyMet: 2.365 ± 0.426
4.555GlyAsn: 4.555 ± 0.722
0.526GlyPro: 0.526 ± 0.233
3.241GlyGln: 3.241 ± 0.608
3.941GlyArg: 3.941 ± 0.647
4.117GlySer: 4.117 ± 0.759
4.817GlyThr: 4.817 ± 1.045
6.569GlyVal: 6.569 ± 0.766
1.051GlyTrp: 1.051 ± 0.328
2.89GlyTyr: 2.89 ± 0.593
0.0GlyXaa: 0.0 ± 0.0
His
1.314HisAla: 1.314 ± 0.322
0.175HisCys: 0.175 ± 0.112
0.701HisAsp: 0.701 ± 0.209
0.788HisGlu: 0.788 ± 0.37
0.526HisPhe: 0.526 ± 0.213
0.876HisGly: 0.876 ± 0.231
0.088HisHis: 0.088 ± 0.088
0.263HisIle: 0.263 ± 0.235
1.139HisLys: 1.139 ± 0.327
0.963HisLeu: 0.963 ± 0.237
0.263HisMet: 0.263 ± 0.135
0.963HisAsn: 0.963 ± 0.293
0.701HisPro: 0.701 ± 0.29
0.613HisGln: 0.613 ± 0.19
0.876HisArg: 0.876 ± 0.248
1.051HisSer: 1.051 ± 0.234
0.526HisThr: 0.526 ± 0.203
1.401HisVal: 1.401 ± 0.439
0.0HisTrp: 0.0 ± 0.0
0.35HisTyr: 0.35 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
3.941IleAla: 3.941 ± 0.559
0.876IleCys: 0.876 ± 0.24
4.817IleAsp: 4.817 ± 0.641
4.555IleGlu: 4.555 ± 0.755
1.401IlePhe: 1.401 ± 0.262
4.379IleGly: 4.379 ± 0.462
0.788IleHis: 0.788 ± 0.262
2.715IleIle: 2.715 ± 0.47
3.854IleLys: 3.854 ± 0.439
3.766IleLeu: 3.766 ± 0.562
1.839IleMet: 1.839 ± 0.337
3.241IleAsn: 3.241 ± 0.416
1.664IlePro: 1.664 ± 0.381
2.803IleGln: 2.803 ± 0.485
2.277IleArg: 2.277 ± 0.41
3.504IleSer: 3.504 ± 0.455
3.679IleThr: 3.679 ± 0.463
2.978IleVal: 2.978 ± 0.487
0.876IleTrp: 0.876 ± 0.285
1.839IleTyr: 1.839 ± 0.457
0.0IleXaa: 0.0 ± 0.0
Lys
5.956LysAla: 5.956 ± 0.74
0.613LysCys: 0.613 ± 0.222
2.978LysAsp: 2.978 ± 0.508
3.153LysGlu: 3.153 ± 0.798
1.314LysPhe: 1.314 ± 0.392
4.993LysGly: 4.993 ± 0.659
1.401LysHis: 1.401 ± 0.382
3.679LysIle: 3.679 ± 0.608
3.591LysLys: 3.591 ± 0.764
5.343LysLeu: 5.343 ± 0.61
3.066LysMet: 3.066 ± 0.656
3.066LysAsn: 3.066 ± 0.558
3.241LysPro: 3.241 ± 0.598
2.803LysGln: 2.803 ± 0.51
4.204LysArg: 4.204 ± 0.566
3.679LysSer: 3.679 ± 0.665
3.241LysThr: 3.241 ± 0.533
4.204LysVal: 4.204 ± 0.869
0.963LysTrp: 0.963 ± 0.31
2.365LysTyr: 2.365 ± 0.309
0.0LysXaa: 0.0 ± 0.0
Leu
7.182LeuAla: 7.182 ± 0.938
0.613LeuCys: 0.613 ± 0.226
5.08LeuAsp: 5.08 ± 0.65
5.08LeuGlu: 5.08 ± 0.936
2.628LeuPhe: 2.628 ± 0.362
5.168LeuGly: 5.168 ± 0.771
0.876LeuHis: 0.876 ± 0.343
3.066LeuIle: 3.066 ± 0.476
5.43LeuLys: 5.43 ± 0.819
3.416LeuLeu: 3.416 ± 0.725
2.277LeuMet: 2.277 ± 0.362
4.029LeuAsn: 4.029 ± 0.698
3.241LeuPro: 3.241 ± 0.403
3.153LeuGln: 3.153 ± 0.52
3.066LeuArg: 3.066 ± 0.458
6.044LeuSer: 6.044 ± 0.794
3.854LeuThr: 3.854 ± 0.724
3.241LeuVal: 3.241 ± 0.472
1.051LeuTrp: 1.051 ± 0.33
2.715LeuTyr: 2.715 ± 0.627
0.0LeuXaa: 0.0 ± 0.0
Met
2.277MetAla: 2.277 ± 0.501
0.526MetCys: 0.526 ± 0.277
1.664MetAsp: 1.664 ± 0.398
1.314MetGlu: 1.314 ± 0.348
0.701MetPhe: 0.701 ± 0.295
2.102MetGly: 2.102 ± 0.445
0.088MetHis: 0.088 ± 0.078
1.314MetIle: 1.314 ± 0.471
1.489MetLys: 1.489 ± 0.361
2.365MetLeu: 2.365 ± 0.537
0.438MetMet: 0.438 ± 0.169
1.577MetAsn: 1.577 ± 0.422
0.788MetPro: 0.788 ± 0.284
1.752MetGln: 1.752 ± 0.428
1.314MetArg: 1.314 ± 0.289
3.153MetSer: 3.153 ± 0.484
2.015MetThr: 2.015 ± 0.424
1.401MetVal: 1.401 ± 0.343
0.175MetTrp: 0.175 ± 0.195
0.613MetTyr: 0.613 ± 0.241
0.0MetXaa: 0.0 ± 0.0
Asn
4.817AsnAla: 4.817 ± 0.616
0.876AsnCys: 0.876 ± 0.32
3.591AsnAsp: 3.591 ± 0.571
2.89AsnGlu: 2.89 ± 0.553
2.452AsnPhe: 2.452 ± 0.328
4.993AsnGly: 4.993 ± 0.727
0.438AsnHis: 0.438 ± 0.188
2.89AsnIle: 2.89 ± 0.432
2.452AsnLys: 2.452 ± 0.424
3.854AsnLeu: 3.854 ± 0.671
1.051AsnMet: 1.051 ± 0.354
2.54AsnAsn: 2.54 ± 0.462
2.365AsnPro: 2.365 ± 0.407
3.241AsnGln: 3.241 ± 0.923
3.066AsnArg: 3.066 ± 0.423
2.803AsnSer: 2.803 ± 0.478
2.54AsnThr: 2.54 ± 0.659
3.941AsnVal: 3.941 ± 0.875
0.788AsnTrp: 0.788 ± 0.222
1.839AsnTyr: 1.839 ± 0.345
0.0AsnXaa: 0.0 ± 0.0
Pro
2.365ProAla: 2.365 ± 0.443
0.35ProCys: 0.35 ± 0.174
3.066ProAsp: 3.066 ± 0.676
2.628ProGlu: 2.628 ± 0.53
1.752ProPhe: 1.752 ± 0.334
0.438ProGly: 0.438 ± 0.161
0.35ProHis: 0.35 ± 0.151
1.839ProIle: 1.839 ± 0.399
2.015ProLys: 2.015 ± 0.563
2.102ProLeu: 2.102 ± 0.482
0.788ProMet: 0.788 ± 0.245
1.489ProAsn: 1.489 ± 0.27
0.876ProPro: 0.876 ± 0.332
1.489ProGln: 1.489 ± 0.48
0.788ProArg: 0.788 ± 0.264
2.715ProSer: 2.715 ± 0.417
1.927ProThr: 1.927 ± 0.393
2.978ProVal: 2.978 ± 0.592
0.438ProTrp: 0.438 ± 0.184
1.139ProTyr: 1.139 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
5.43GlnAla: 5.43 ± 1.175
0.175GlnCys: 0.175 ± 0.107
1.927GlnAsp: 1.927 ± 0.5
3.241GlnGlu: 3.241 ± 0.674
1.139GlnPhe: 1.139 ± 0.261
3.679GlnGly: 3.679 ± 0.799
0.526GlnHis: 0.526 ± 0.218
2.978GlnIle: 2.978 ± 0.563
2.19GlnLys: 2.19 ± 0.503
2.89GlnLeu: 2.89 ± 0.491
1.226GlnMet: 1.226 ± 0.362
1.489GlnAsn: 1.489 ± 0.396
1.314GlnPro: 1.314 ± 0.412
4.905GlnGln: 4.905 ± 1.727
1.927GlnArg: 1.927 ± 0.629
3.854GlnSer: 3.854 ± 0.795
2.803GlnThr: 2.803 ± 0.457
2.102GlnVal: 2.102 ± 0.351
0.613GlnTrp: 0.613 ± 0.204
1.839GlnTyr: 1.839 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
4.905ArgAla: 4.905 ± 0.7
0.613ArgCys: 0.613 ± 0.249
2.628ArgAsp: 2.628 ± 0.512
2.978ArgGlu: 2.978 ± 0.85
1.839ArgPhe: 1.839 ± 0.33
2.365ArgGly: 2.365 ± 0.408
0.263ArgHis: 0.263 ± 0.154
2.978ArgIle: 2.978 ± 0.678
4.029ArgLys: 4.029 ± 0.671
3.766ArgLeu: 3.766 ± 0.598
1.664ArgMet: 1.664 ± 0.339
2.277ArgAsn: 2.277 ± 0.388
1.314ArgPro: 1.314 ± 0.383
2.015ArgGln: 2.015 ± 0.336
1.664ArgArg: 1.664 ± 0.402
2.715ArgSer: 2.715 ± 0.482
1.139ArgThr: 1.139 ± 0.307
3.941ArgVal: 3.941 ± 0.612
0.701ArgTrp: 0.701 ± 0.271
1.577ArgTyr: 1.577 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
5.168SerAla: 5.168 ± 0.73
0.438SerCys: 0.438 ± 0.218
3.679SerAsp: 3.679 ± 0.527
4.993SerGlu: 4.993 ± 0.637
3.066SerPhe: 3.066 ± 0.631
7.27SerGly: 7.27 ± 0.957
0.788SerHis: 0.788 ± 0.268
4.029SerIle: 4.029 ± 0.605
4.817SerLys: 4.817 ± 0.661
5.343SerLeu: 5.343 ± 0.495
1.314SerMet: 1.314 ± 0.295
4.117SerAsn: 4.117 ± 0.632
1.752SerPro: 1.752 ± 0.415
2.54SerGln: 2.54 ± 0.564
3.066SerArg: 3.066 ± 0.454
3.591SerSer: 3.591 ± 0.664
4.029SerThr: 4.029 ± 0.735
3.679SerVal: 3.679 ± 0.662
1.314SerTrp: 1.314 ± 0.314
2.715SerTyr: 2.715 ± 0.44
0.0SerXaa: 0.0 ± 0.0
Thr
3.766ThrAla: 3.766 ± 0.566
0.088ThrCys: 0.088 ± 0.076
3.679ThrAsp: 3.679 ± 0.676
2.978ThrGlu: 2.978 ± 0.539
3.066ThrPhe: 3.066 ± 0.626
6.92ThrGly: 6.92 ± 1.075
1.226ThrHis: 1.226 ± 0.421
2.89ThrIle: 2.89 ± 0.45
3.328ThrLys: 3.328 ± 0.664
3.679ThrLeu: 3.679 ± 0.825
1.401ThrMet: 1.401 ± 0.346
2.54ThrAsn: 2.54 ± 0.509
2.19ThrPro: 2.19 ± 0.403
2.89ThrGln: 2.89 ± 0.437
2.89ThrArg: 2.89 ± 0.491
3.416ThrSer: 3.416 ± 0.61
3.066ThrThr: 3.066 ± 0.525
4.555ThrVal: 4.555 ± 0.704
0.438ThrTrp: 0.438 ± 0.162
1.489ThrTyr: 1.489 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
4.642ValAla: 4.642 ± 0.63
0.963ValCys: 0.963 ± 0.342
5.693ValAsp: 5.693 ± 0.743
3.416ValGlu: 3.416 ± 0.686
2.89ValPhe: 2.89 ± 0.689
4.905ValGly: 4.905 ± 0.653
1.489ValHis: 1.489 ± 0.346
3.328ValIle: 3.328 ± 0.565
5.868ValLys: 5.868 ± 0.713
2.803ValLeu: 2.803 ± 0.579
1.577ValMet: 1.577 ± 0.3
4.817ValAsn: 4.817 ± 0.453
2.102ValPro: 2.102 ± 0.376
1.839ValGln: 1.839 ± 0.381
2.365ValArg: 2.365 ± 0.315
4.905ValSer: 4.905 ± 0.593
4.73ValThr: 4.73 ± 0.781
5.606ValVal: 5.606 ± 0.637
0.788ValTrp: 0.788 ± 0.252
2.19ValTyr: 2.19 ± 0.524
0.0ValXaa: 0.0 ± 0.0
Trp
1.489TrpAla: 1.489 ± 0.373
0.175TrpCys: 0.175 ± 0.143
0.701TrpAsp: 0.701 ± 0.235
0.701TrpGlu: 0.701 ± 0.216
0.788TrpPhe: 0.788 ± 0.23
1.139TrpGly: 1.139 ± 0.263
0.613TrpHis: 0.613 ± 0.276
0.701TrpIle: 0.701 ± 0.253
0.963TrpLys: 0.963 ± 0.32
1.489TrpLeu: 1.489 ± 0.351
0.35TrpMet: 0.35 ± 0.165
0.175TrpAsn: 0.175 ± 0.104
0.175TrpPro: 0.175 ± 0.11
0.438TrpGln: 0.438 ± 0.177
0.963TrpArg: 0.963 ± 0.277
0.788TrpSer: 0.788 ± 0.307
0.438TrpThr: 0.438 ± 0.133
1.314TrpVal: 1.314 ± 0.421
0.263TrpTrp: 0.263 ± 0.19
0.35TrpTyr: 0.35 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.416TyrAla: 3.416 ± 0.473
0.701TyrCys: 0.701 ± 0.302
2.54TyrAsp: 2.54 ± 0.49
2.015TyrGlu: 2.015 ± 0.453
1.489TyrPhe: 1.489 ± 0.3
3.241TyrGly: 3.241 ± 0.434
0.35TyrHis: 0.35 ± 0.181
1.839TyrIle: 1.839 ± 0.38
2.102TyrLys: 2.102 ± 0.503
1.401TyrLeu: 1.401 ± 0.416
0.876TyrMet: 0.876 ± 0.259
2.452TyrAsn: 2.452 ± 0.35
1.051TyrPro: 1.051 ± 0.288
1.314TyrGln: 1.314 ± 0.255
1.839TyrArg: 1.839 ± 0.408
2.89TyrSer: 2.89 ± 0.575
2.19TyrThr: 2.19 ± 0.618
1.664TyrVal: 1.664 ± 0.341
0.438TyrTrp: 0.438 ± 0.165
1.401TyrTyr: 1.401 ± 0.45
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski