Amino acid dipepetide frequency for Flavobacterium phage vB_FspS_hattifnatt9-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.128AlaAla: 1.128 ± 0.526
0.242AlaCys: 0.242 ± 0.137
1.773AlaAsp: 1.773 ± 0.413
2.982AlaGlu: 2.982 ± 0.555
2.418AlaPhe: 2.418 ± 0.458
2.096AlaGly: 2.096 ± 0.552
0.484AlaHis: 0.484 ± 0.197
3.708AlaIle: 3.708 ± 0.429
5.562AlaLys: 5.562 ± 0.782
4.917AlaLeu: 4.917 ± 0.61
1.532AlaMet: 1.532 ± 0.594
5.562AlaAsn: 5.562 ± 0.702
0.806AlaPro: 0.806 ± 0.24
2.096AlaGln: 2.096 ± 0.569
1.209AlaArg: 1.209 ± 0.376
3.305AlaSer: 3.305 ± 0.747
4.03AlaThr: 4.03 ± 0.63
3.305AlaVal: 3.305 ± 0.74
0.806AlaTrp: 0.806 ± 0.244
1.693AlaTyr: 1.693 ± 0.357
0.0AlaXaa: 0.0 ± 0.0
Cys
0.081CysAla: 0.081 ± 0.064
0.242CysCys: 0.242 ± 0.151
0.887CysAsp: 0.887 ± 0.302
1.209CysGlu: 1.209 ± 0.303
0.887CysPhe: 0.887 ± 0.251
0.806CysGly: 0.806 ± 0.27
0.081CysHis: 0.081 ± 0.083
0.564CysIle: 0.564 ± 0.244
0.725CysLys: 0.725 ± 0.274
1.29CysLeu: 1.29 ± 0.35
0.081CysMet: 0.081 ± 0.082
0.484CysAsn: 0.484 ± 0.263
0.564CysPro: 0.564 ± 0.217
0.242CysGln: 0.242 ± 0.13
0.403CysArg: 0.403 ± 0.184
0.725CysSer: 0.725 ± 0.248
0.645CysThr: 0.645 ± 0.22
0.564CysVal: 0.564 ± 0.223
0.081CysTrp: 0.081 ± 0.091
0.403CysTyr: 0.403 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
3.627AspAla: 3.627 ± 0.517
0.806AspCys: 0.806 ± 0.224
2.982AspAsp: 2.982 ± 0.421
4.514AspGlu: 4.514 ± 0.606
3.95AspPhe: 3.95 ± 0.473
2.982AspGly: 2.982 ± 0.551
0.645AspHis: 0.645 ± 0.237
3.627AspIle: 3.627 ± 0.516
5.32AspLys: 5.32 ± 0.75
5.159AspLeu: 5.159 ± 0.639
1.612AspMet: 1.612 ± 0.421
4.353AspAsn: 4.353 ± 0.658
0.081AspPro: 0.081 ± 0.086
0.645AspGln: 0.645 ± 0.224
1.451AspArg: 1.451 ± 0.296
3.788AspSer: 3.788 ± 0.498
2.982AspThr: 2.982 ± 0.549
3.224AspVal: 3.224 ± 0.546
0.484AspTrp: 0.484 ± 0.17
2.902AspTyr: 2.902 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
2.821GluAla: 2.821 ± 0.497
0.645GluCys: 0.645 ± 0.246
3.144GluAsp: 3.144 ± 0.589
3.708GluGlu: 3.708 ± 0.53
4.192GluPhe: 4.192 ± 0.591
1.854GluGly: 1.854 ± 0.358
1.128GluHis: 1.128 ± 0.32
7.496GluIle: 7.496 ± 0.775
6.61GluLys: 6.61 ± 0.925
7.577GluLeu: 7.577 ± 0.92
1.773GluMet: 1.773 ± 0.332
6.126GluAsn: 6.126 ± 0.78
2.096GluPro: 2.096 ± 0.392
3.627GluGln: 3.627 ± 0.408
2.418GluArg: 2.418 ± 0.528
3.869GluSer: 3.869 ± 0.513
3.95GluThr: 3.95 ± 0.582
3.547GluVal: 3.547 ± 0.546
0.403GluTrp: 0.403 ± 0.178
3.385GluTyr: 3.385 ± 0.593
0.0GluXaa: 0.0 ± 0.0
Phe
2.338PheAla: 2.338 ± 0.414
0.403PheCys: 0.403 ± 0.18
3.869PheAsp: 3.869 ± 0.548
3.144PheGlu: 3.144 ± 0.527
2.499PhePhe: 2.499 ± 0.41
3.547PheGly: 3.547 ± 0.494
0.564PheHis: 0.564 ± 0.246
4.03PheIle: 4.03 ± 0.63
4.595PheLys: 4.595 ± 0.583
4.111PheLeu: 4.111 ± 0.72
0.887PheMet: 0.887 ± 0.268
5.481PheAsn: 5.481 ± 0.823
1.128PhePro: 1.128 ± 0.261
1.451PheGln: 1.451 ± 0.325
1.048PheArg: 1.048 ± 0.286
3.869PheSer: 3.869 ± 0.519
4.595PheThr: 4.595 ± 0.631
2.257PheVal: 2.257 ± 0.515
0.645PheTrp: 0.645 ± 0.178
1.854PheTyr: 1.854 ± 0.375
0.0PheXaa: 0.0 ± 0.0
Gly
2.66GlyAla: 2.66 ± 0.544
0.564GlyCys: 0.564 ± 0.222
2.418GlyAsp: 2.418 ± 0.462
2.015GlyGlu: 2.015 ± 0.437
2.902GlyPhe: 2.902 ± 0.63
1.854GlyGly: 1.854 ± 0.594
0.645GlyHis: 0.645 ± 0.233
3.869GlyIle: 3.869 ± 0.419
3.627GlyLys: 3.627 ± 0.587
4.111GlyLeu: 4.111 ± 0.499
0.806GlyMet: 0.806 ± 0.226
4.272GlyAsn: 4.272 ± 0.642
0.0GlyPro: 0.0 ± 0.0
1.773GlyGln: 1.773 ± 0.403
2.015GlyArg: 2.015 ± 0.348
3.063GlySer: 3.063 ± 0.542
4.111GlyThr: 4.111 ± 0.672
3.224GlyVal: 3.224 ± 0.398
0.242GlyTrp: 0.242 ± 0.118
2.418GlyTyr: 2.418 ± 0.362
0.0GlyXaa: 0.0 ± 0.0
His
0.322HisAla: 0.322 ± 0.152
0.484HisCys: 0.484 ± 0.183
0.887HisAsp: 0.887 ± 0.238
0.645HisGlu: 0.645 ± 0.181
0.806HisPhe: 0.806 ± 0.254
0.645HisGly: 0.645 ± 0.255
0.403HisHis: 0.403 ± 0.205
1.209HisIle: 1.209 ± 0.325
0.806HisLys: 0.806 ± 0.293
1.209HisLeu: 1.209 ± 0.318
0.0HisMet: 0.0 ± 0.0
1.128HisAsn: 1.128 ± 0.288
0.564HisPro: 0.564 ± 0.22
0.645HisGln: 0.645 ± 0.173
0.645HisArg: 0.645 ± 0.251
1.128HisSer: 1.128 ± 0.323
0.967HisThr: 0.967 ± 0.246
0.564HisVal: 0.564 ± 0.236
0.242HisTrp: 0.242 ± 0.186
1.048HisTyr: 1.048 ± 0.25
0.0HisXaa: 0.0 ± 0.0
Ile
4.111IleAla: 4.111 ± 0.404
1.048IleCys: 1.048 ± 0.311
5.401IleAsp: 5.401 ± 0.735
7.738IleGlu: 7.738 ± 0.647
3.305IlePhe: 3.305 ± 0.526
3.708IleGly: 3.708 ± 0.65
0.967IleHis: 0.967 ± 0.356
6.448IleIle: 6.448 ± 0.849
8.302IleLys: 8.302 ± 0.873
7.174IleLeu: 7.174 ± 0.792
1.451IleMet: 1.451 ± 0.372
6.771IleAsn: 6.771 ± 0.708
2.096IlePro: 2.096 ± 0.329
3.385IleGln: 3.385 ± 0.548
2.257IleArg: 2.257 ± 0.413
5.481IleSer: 5.481 ± 0.704
4.595IleThr: 4.595 ± 0.69
5.159IleVal: 5.159 ± 0.755
0.645IleTrp: 0.645 ± 0.235
2.982IleTyr: 2.982 ± 0.536
0.0IleXaa: 0.0 ± 0.0
Lys
4.917LysAla: 4.917 ± 0.67
1.29LysCys: 1.29 ± 0.364
4.675LysAsp: 4.675 ± 0.486
8.705LysGlu: 8.705 ± 1.2
4.192LysPhe: 4.192 ± 0.443
3.95LysGly: 3.95 ± 0.561
2.015LysHis: 2.015 ± 0.4
8.302LysIle: 8.302 ± 0.801
8.625LysLys: 8.625 ± 0.908
7.255LysLeu: 7.255 ± 0.767
3.708LysMet: 3.708 ± 0.607
5.562LysAsn: 5.562 ± 0.655
2.579LysPro: 2.579 ± 0.463
4.433LysGln: 4.433 ± 0.693
3.385LysArg: 3.385 ± 0.542
5.562LysSer: 5.562 ± 0.604
5.965LysThr: 5.965 ± 0.833
4.917LysVal: 4.917 ± 0.598
1.128LysTrp: 1.128 ± 0.28
4.192LysTyr: 4.192 ± 0.576
0.0LysXaa: 0.0 ± 0.0
Leu
3.788LeuAla: 3.788 ± 0.874
0.564LeuCys: 0.564 ± 0.256
5.804LeuAsp: 5.804 ± 0.832
6.126LeuGlu: 6.126 ± 0.713
3.95LeuPhe: 3.95 ± 0.59
3.869LeuGly: 3.869 ± 0.538
0.967LeuHis: 0.967 ± 0.326
8.383LeuIle: 8.383 ± 0.867
8.705LeuLys: 8.705 ± 0.879
6.61LeuLeu: 6.61 ± 0.746
2.096LeuMet: 2.096 ± 0.381
7.577LeuAsn: 7.577 ± 0.753
3.224LeuPro: 3.224 ± 0.49
3.708LeuGln: 3.708 ± 0.513
3.224LeuArg: 3.224 ± 0.611
5.884LeuSer: 5.884 ± 0.712
6.771LeuThr: 6.771 ± 0.723
5.32LeuVal: 5.32 ± 0.708
0.967LeuTrp: 0.967 ± 0.281
3.224LeuTyr: 3.224 ± 0.589
0.0LeuXaa: 0.0 ± 0.0
Met
1.773MetAla: 1.773 ± 0.454
0.081MetCys: 0.081 ± 0.084
0.806MetAsp: 0.806 ± 0.264
1.693MetGlu: 1.693 ± 0.368
1.209MetPhe: 1.209 ± 0.262
0.806MetGly: 0.806 ± 0.283
0.403MetHis: 0.403 ± 0.145
1.29MetIle: 1.29 ± 0.362
2.982MetLys: 2.982 ± 0.537
2.015MetLeu: 2.015 ± 0.359
0.242MetMet: 0.242 ± 0.126
0.887MetAsn: 0.887 ± 0.291
1.209MetPro: 1.209 ± 0.297
1.209MetGln: 1.209 ± 0.342
0.967MetArg: 0.967 ± 0.219
1.773MetSer: 1.773 ± 0.335
1.209MetThr: 1.209 ± 0.318
1.128MetVal: 1.128 ± 0.265
0.322MetTrp: 0.322 ± 0.168
0.403MetTyr: 0.403 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
4.917AsnAla: 4.917 ± 0.87
0.645AsnCys: 0.645 ± 0.204
4.595AsnAsp: 4.595 ± 0.867
6.045AsnGlu: 6.045 ± 0.819
4.111AsnPhe: 4.111 ± 0.626
3.869AsnGly: 3.869 ± 0.613
1.128AsnHis: 1.128 ± 0.286
5.32AsnIle: 5.32 ± 0.575
8.867AsnLys: 8.867 ± 1.194
7.335AsnLeu: 7.335 ± 0.816
1.29AsnMet: 1.29 ± 0.355
5.078AsnAsn: 5.078 ± 0.818
2.015AsnPro: 2.015 ± 0.393
2.499AsnGln: 2.499 ± 0.333
2.015AsnArg: 2.015 ± 0.343
4.998AsnSer: 4.998 ± 0.617
4.111AsnThr: 4.111 ± 0.49
4.836AsnVal: 4.836 ± 0.556
0.887AsnTrp: 0.887 ± 0.275
4.675AsnTyr: 4.675 ± 0.583
0.0AsnXaa: 0.0 ± 0.0
Pro
1.209ProAla: 1.209 ± 0.272
0.403ProCys: 0.403 ± 0.179
1.128ProAsp: 1.128 ± 0.294
2.015ProGlu: 2.015 ± 0.489
1.612ProPhe: 1.612 ± 0.352
0.0ProGly: 0.0 ± 0.0
0.242ProHis: 0.242 ± 0.137
1.773ProIle: 1.773 ± 0.409
1.612ProLys: 1.612 ± 0.367
2.579ProLeu: 2.579 ± 0.469
0.484ProMet: 0.484 ± 0.198
2.66ProAsn: 2.66 ± 0.446
0.322ProPro: 0.322 ± 0.143
0.806ProGln: 0.806 ± 0.21
0.322ProArg: 0.322 ± 0.141
2.176ProSer: 2.176 ± 0.444
1.854ProThr: 1.854 ± 0.405
1.048ProVal: 1.048 ± 0.302
0.0ProTrp: 0.0 ± 0.0
1.209ProTyr: 1.209 ± 0.352
0.0ProXaa: 0.0 ± 0.0
Gln
2.418GlnAla: 2.418 ± 0.755
0.403GlnCys: 0.403 ± 0.21
1.209GlnAsp: 1.209 ± 0.34
1.935GlnGlu: 1.935 ± 0.366
1.612GlnPhe: 1.612 ± 0.304
1.935GlnGly: 1.935 ± 0.364
0.887GlnHis: 0.887 ± 0.304
4.03GlnIle: 4.03 ± 0.626
3.547GlnLys: 3.547 ± 0.608
4.03GlnLeu: 4.03 ± 0.474
0.967GlnMet: 0.967 ± 0.324
2.176GlnAsn: 2.176 ± 0.351
1.048GlnPro: 1.048 ± 0.293
1.128GlnGln: 1.128 ± 0.334
2.096GlnArg: 2.096 ± 0.481
2.418GlnSer: 2.418 ± 0.422
2.176GlnThr: 2.176 ± 0.365
2.176GlnVal: 2.176 ± 0.386
0.645GlnTrp: 0.645 ± 0.233
1.451GlnTyr: 1.451 ± 0.351
0.0GlnXaa: 0.0 ± 0.0
Arg
0.887ArgAla: 0.887 ± 0.286
0.564ArgCys: 0.564 ± 0.182
1.209ArgAsp: 1.209 ± 0.298
2.176ArgGlu: 2.176 ± 0.525
1.37ArgPhe: 1.37 ± 0.378
1.37ArgGly: 1.37 ± 0.333
0.403ArgHis: 0.403 ± 0.209
2.579ArgIle: 2.579 ± 0.536
2.982ArgLys: 2.982 ± 0.451
4.111ArgLeu: 4.111 ± 0.673
0.645ArgMet: 0.645 ± 0.194
2.096ArgAsn: 2.096 ± 0.363
0.081ArgPro: 0.081 ± 0.086
1.29ArgGln: 1.29 ± 0.386
1.209ArgArg: 1.209 ± 0.329
1.854ArgSer: 1.854 ± 0.353
2.015ArgThr: 2.015 ± 0.393
2.338ArgVal: 2.338 ± 0.369
0.081ArgTrp: 0.081 ± 0.075
1.854ArgTyr: 1.854 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
2.418SerAla: 2.418 ± 0.578
0.806SerCys: 0.806 ± 0.287
4.272SerAsp: 4.272 ± 0.614
4.836SerGlu: 4.836 ± 0.747
3.869SerPhe: 3.869 ± 0.594
4.595SerGly: 4.595 ± 0.679
0.645SerHis: 0.645 ± 0.193
6.126SerIle: 6.126 ± 0.781
5.723SerLys: 5.723 ± 0.68
5.804SerLeu: 5.804 ± 0.578
1.209SerMet: 1.209 ± 0.299
5.239SerAsn: 5.239 ± 0.721
1.451SerPro: 1.451 ± 0.294
2.741SerGln: 2.741 ± 0.471
1.612SerArg: 1.612 ± 0.36
3.869SerSer: 3.869 ± 0.688
3.627SerThr: 3.627 ± 0.553
4.675SerVal: 4.675 ± 0.66
0.484SerTrp: 0.484 ± 0.218
1.693SerTyr: 1.693 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
4.514ThrAla: 4.514 ± 0.724
0.645ThrCys: 0.645 ± 0.215
4.192ThrAsp: 4.192 ± 0.555
4.675ThrGlu: 4.675 ± 0.718
4.03ThrPhe: 4.03 ± 0.638
3.144ThrGly: 3.144 ± 0.618
0.887ThrHis: 0.887 ± 0.212
5.884ThrIle: 5.884 ± 0.762
5.481ThrLys: 5.481 ± 0.557
5.723ThrLeu: 5.723 ± 0.588
1.612ThrMet: 1.612 ± 0.349
4.111ThrAsn: 4.111 ± 0.645
1.854ThrPro: 1.854 ± 0.381
2.821ThrGln: 2.821 ± 0.455
1.451ThrArg: 1.451 ± 0.284
4.111ThrSer: 4.111 ± 0.705
4.756ThrThr: 4.756 ± 0.843
1.209ThrVal: 1.209 ± 0.311
0.645ThrTrp: 0.645 ± 0.211
2.015ThrTyr: 2.015 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
3.466ValAla: 3.466 ± 0.566
0.564ValCys: 0.564 ± 0.25
2.982ValAsp: 2.982 ± 0.455
2.821ValGlu: 2.821 ± 0.398
3.385ValPhe: 3.385 ± 0.52
3.224ValGly: 3.224 ± 0.423
0.887ValHis: 0.887 ± 0.29
3.627ValIle: 3.627 ± 0.46
4.998ValLys: 4.998 ± 0.579
4.433ValLeu: 4.433 ± 0.615
1.37ValMet: 1.37 ± 0.288
5.401ValAsn: 5.401 ± 0.808
1.29ValPro: 1.29 ± 0.3
1.773ValGln: 1.773 ± 0.354
1.532ValArg: 1.532 ± 0.327
4.595ValSer: 4.595 ± 0.567
2.741ValThr: 2.741 ± 0.574
3.144ValVal: 3.144 ± 0.569
0.806ValTrp: 0.806 ± 0.258
2.338ValTyr: 2.338 ± 0.336
0.0ValXaa: 0.0 ± 0.0
Trp
0.645TrpAla: 0.645 ± 0.206
0.081TrpCys: 0.081 ± 0.084
0.806TrpAsp: 0.806 ± 0.249
0.645TrpGlu: 0.645 ± 0.241
0.242TrpPhe: 0.242 ± 0.145
0.161TrpGly: 0.161 ± 0.101
0.322TrpHis: 0.322 ± 0.155
0.725TrpIle: 0.725 ± 0.232
1.128TrpLys: 1.128 ± 0.376
1.451TrpLeu: 1.451 ± 0.372
0.081TrpMet: 0.081 ± 0.125
0.806TrpAsn: 0.806 ± 0.262
0.0TrpPro: 0.0 ± 0.0
0.564TrpGln: 0.564 ± 0.161
0.403TrpArg: 0.403 ± 0.14
0.564TrpSer: 0.564 ± 0.29
0.322TrpThr: 0.322 ± 0.131
0.484TrpVal: 0.484 ± 0.157
0.0TrpTrp: 0.0 ± 0.0
0.564TrpTyr: 0.564 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.612TyrAla: 1.612 ± 0.268
0.484TyrCys: 0.484 ± 0.189
2.096TyrAsp: 2.096 ± 0.432
2.741TyrGlu: 2.741 ± 0.427
1.773TyrPhe: 1.773 ± 0.426
2.176TyrGly: 2.176 ± 0.36
0.645TyrHis: 0.645 ± 0.229
4.03TyrIle: 4.03 ± 0.581
5.159TyrLys: 5.159 ± 0.653
3.869TyrLeu: 3.869 ± 0.473
0.484TyrMet: 0.484 ± 0.195
3.466TyrAsn: 3.466 ± 0.49
1.048TyrPro: 1.048 ± 0.353
1.451TyrGln: 1.451 ± 0.292
1.451TyrArg: 1.451 ± 0.383
2.66TyrSer: 2.66 ± 0.599
2.338TyrThr: 2.338 ± 0.539
2.257TyrVal: 2.257 ± 0.385
0.564TyrTrp: 0.564 ± 0.177
1.854TyrTyr: 1.854 ± 0.472
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (12407 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski