Amino acid dipepetide frequency for Shigella phage SfII (Shigella flexneri bacteriophage II) (Bacteriophage SfII)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.403AlaAla: 8.403 ± 0.882
1.427AlaCys: 1.427 ± 0.383
4.598AlaAsp: 4.598 ± 0.732
4.756AlaGlu: 4.756 ± 0.626
3.171AlaPhe: 3.171 ± 0.695
7.689AlaGly: 7.689 ± 0.837
1.665AlaHis: 1.665 ± 0.392
5.628AlaIle: 5.628 ± 0.722
3.805AlaLys: 3.805 ± 0.443
7.531AlaLeu: 7.531 ± 0.75
2.774AlaMet: 2.774 ± 0.493
2.616AlaAsn: 2.616 ± 0.505
3.488AlaPro: 3.488 ± 0.527
2.299AlaGln: 2.299 ± 0.404
6.5AlaArg: 6.5 ± 0.885
5.153AlaSer: 5.153 ± 0.891
5.707AlaThr: 5.707 ± 0.775
5.628AlaVal: 5.628 ± 0.796
1.585AlaTrp: 1.585 ± 0.341
3.012AlaTyr: 3.012 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
1.268CysAla: 1.268 ± 0.356
0.159CysCys: 0.159 ± 0.112
0.793CysAsp: 0.793 ± 0.237
0.317CysGlu: 0.317 ± 0.143
0.238CysPhe: 0.238 ± 0.131
1.506CysGly: 1.506 ± 0.352
0.396CysHis: 0.396 ± 0.215
0.793CysIle: 0.793 ± 0.262
0.317CysLys: 0.317 ± 0.156
0.872CysLeu: 0.872 ± 0.311
0.317CysMet: 0.317 ± 0.148
0.476CysAsn: 0.476 ± 0.206
0.555CysPro: 0.555 ± 0.201
0.872CysGln: 0.872 ± 0.251
1.11CysArg: 1.11 ± 0.34
0.872CysSer: 0.872 ± 0.25
0.476CysThr: 0.476 ± 0.19
1.031CysVal: 1.031 ± 0.338
0.317CysTrp: 0.317 ± 0.192
0.317CysTyr: 0.317 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
5.232AspAla: 5.232 ± 0.746
0.396AspCys: 0.396 ± 0.214
4.518AspAsp: 4.518 ± 0.774
3.646AspGlu: 3.646 ± 0.505
2.14AspPhe: 2.14 ± 0.406
4.598AspGly: 4.598 ± 0.657
0.476AspHis: 0.476 ± 0.183
2.933AspIle: 2.933 ± 0.586
3.409AspLys: 3.409 ± 0.607
5.232AspLeu: 5.232 ± 0.76
1.427AspMet: 1.427 ± 0.348
2.457AspAsn: 2.457 ± 0.46
3.171AspPro: 3.171 ± 0.65
1.348AspGln: 1.348 ± 0.346
2.299AspArg: 2.299 ± 0.451
2.933AspSer: 2.933 ± 0.42
2.537AspThr: 2.537 ± 0.428
3.884AspVal: 3.884 ± 0.859
0.793AspTrp: 0.793 ± 0.215
1.427AspTyr: 1.427 ± 0.35
0.0AspXaa: 0.0 ± 0.0
Glu
5.073GluAla: 5.073 ± 0.549
0.793GluCys: 0.793 ± 0.297
2.299GluAsp: 2.299 ± 0.454
2.774GluGlu: 2.774 ± 0.504
1.665GluPhe: 1.665 ± 0.324
2.537GluGly: 2.537 ± 0.418
1.031GluHis: 1.031 ± 0.261
3.171GluIle: 3.171 ± 0.404
3.567GluLys: 3.567 ± 0.577
7.214GluLeu: 7.214 ± 0.748
1.902GluMet: 1.902 ± 0.334
2.457GluAsn: 2.457 ± 0.406
2.616GluPro: 2.616 ± 0.388
2.378GluGln: 2.378 ± 0.444
3.805GluArg: 3.805 ± 0.513
3.329GluSer: 3.329 ± 0.65
2.854GluThr: 2.854 ± 0.405
3.646GluVal: 3.646 ± 0.65
1.268GluTrp: 1.268 ± 0.313
1.506GluTyr: 1.506 ± 0.263
0.0GluXaa: 0.0 ± 0.0
Phe
2.695PheAla: 2.695 ± 0.43
0.159PheCys: 0.159 ± 0.12
2.299PheAsp: 2.299 ± 0.379
1.585PheGlu: 1.585 ± 0.271
1.427PhePhe: 1.427 ± 0.376
3.171PheGly: 3.171 ± 0.567
0.872PheHis: 0.872 ± 0.276
2.537PheIle: 2.537 ± 0.678
2.22PheLys: 2.22 ± 0.405
3.488PheLeu: 3.488 ± 0.712
1.506PheMet: 1.506 ± 0.328
2.14PheAsn: 2.14 ± 0.412
1.427PhePro: 1.427 ± 0.361
1.427PheGln: 1.427 ± 0.337
2.457PheArg: 2.457 ± 0.424
2.695PheSer: 2.695 ± 0.583
2.457PheThr: 2.457 ± 0.44
2.695PheVal: 2.695 ± 0.556
1.031PheTrp: 1.031 ± 0.251
1.823PheTyr: 1.823 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
6.025GlyAla: 6.025 ± 0.874
0.713GlyCys: 0.713 ± 0.207
4.122GlyAsp: 4.122 ± 0.638
4.518GlyGlu: 4.518 ± 0.63
3.329GlyPhe: 3.329 ± 0.49
4.915GlyGly: 4.915 ± 0.802
0.951GlyHis: 0.951 ± 0.253
3.884GlyIle: 3.884 ± 0.603
3.964GlyLys: 3.964 ± 0.534
5.707GlyLeu: 5.707 ± 1.107
2.299GlyMet: 2.299 ± 0.453
3.409GlyAsn: 3.409 ± 0.429
1.506GlyPro: 1.506 ± 0.387
2.061GlyGln: 2.061 ± 0.424
4.201GlyArg: 4.201 ± 0.623
3.646GlySer: 3.646 ± 0.548
4.122GlyThr: 4.122 ± 0.669
5.153GlyVal: 5.153 ± 0.681
2.061GlyTrp: 2.061 ± 0.377
3.25GlyTyr: 3.25 ± 0.546
0.0GlyXaa: 0.0 ± 0.0
His
1.189HisAla: 1.189 ± 0.273
0.238HisCys: 0.238 ± 0.147
1.11HisAsp: 1.11 ± 0.276
0.872HisGlu: 0.872 ± 0.268
1.11HisPhe: 1.11 ± 0.328
1.585HisGly: 1.585 ± 0.341
0.713HisHis: 0.713 ± 0.239
0.872HisIle: 0.872 ± 0.248
0.951HisLys: 0.951 ± 0.282
1.506HisLeu: 1.506 ± 0.371
0.396HisMet: 0.396 ± 0.193
0.713HisAsn: 0.713 ± 0.251
1.11HisPro: 1.11 ± 0.368
0.396HisGln: 0.396 ± 0.186
1.585HisArg: 1.585 ± 0.437
0.951HisSer: 0.951 ± 0.291
1.031HisThr: 1.031 ± 0.257
0.872HisVal: 0.872 ± 0.242
0.476HisTrp: 0.476 ± 0.18
0.713HisTyr: 0.713 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
5.39IleAla: 5.39 ± 0.668
1.11IleCys: 1.11 ± 0.358
3.567IleAsp: 3.567 ± 0.535
3.409IleGlu: 3.409 ± 0.62
1.823IlePhe: 1.823 ± 0.595
4.439IleGly: 4.439 ± 0.568
0.555IleHis: 0.555 ± 0.199
3.726IleIle: 3.726 ± 1.02
3.25IleLys: 3.25 ± 0.536
3.25IleLeu: 3.25 ± 0.478
1.189IleMet: 1.189 ± 0.279
3.567IleAsn: 3.567 ± 0.512
3.171IlePro: 3.171 ± 0.383
1.268IleGln: 1.268 ± 0.314
3.646IleArg: 3.646 ± 0.399
5.311IleSer: 5.311 ± 0.827
4.36IleThr: 4.36 ± 0.47
3.567IleVal: 3.567 ± 0.553
0.634IleTrp: 0.634 ± 0.183
2.061IleTyr: 2.061 ± 0.711
0.0IleXaa: 0.0 ± 0.0
Lys
4.915LysAla: 4.915 ± 0.703
0.555LysCys: 0.555 ± 0.179
2.616LysAsp: 2.616 ± 0.424
3.409LysGlu: 3.409 ± 0.46
2.061LysPhe: 2.061 ± 0.346
2.933LysGly: 2.933 ± 0.471
0.951LysHis: 0.951 ± 0.232
3.329LysIle: 3.329 ± 0.622
3.805LysLys: 3.805 ± 0.727
5.232LysLeu: 5.232 ± 0.756
1.982LysMet: 1.982 ± 0.519
3.329LysAsn: 3.329 ± 0.482
2.774LysPro: 2.774 ± 0.417
1.982LysGln: 1.982 ± 0.427
3.964LysArg: 3.964 ± 0.491
4.201LysSer: 4.201 ± 0.491
2.537LysThr: 2.537 ± 0.403
3.329LysVal: 3.329 ± 0.676
0.793LysTrp: 0.793 ± 0.215
1.744LysTyr: 1.744 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
8.244LeuAla: 8.244 ± 0.802
1.982LeuCys: 1.982 ± 0.462
3.964LeuAsp: 3.964 ± 0.505
5.311LeuGlu: 5.311 ± 0.692
4.994LeuPhe: 4.994 ± 0.618
4.836LeuGly: 4.836 ± 0.843
1.268LeuHis: 1.268 ± 0.346
5.945LeuIle: 5.945 ± 0.674
5.073LeuLys: 5.073 ± 0.808
7.769LeuLeu: 7.769 ± 0.966
2.14LeuMet: 2.14 ± 0.373
4.439LeuAsn: 4.439 ± 0.646
4.915LeuPro: 4.915 ± 0.621
3.329LeuGln: 3.329 ± 0.478
6.025LeuArg: 6.025 ± 0.738
6.5LeuSer: 6.5 ± 0.789
4.994LeuThr: 4.994 ± 0.56
5.153LeuVal: 5.153 ± 0.612
1.268LeuTrp: 1.268 ± 0.418
2.14LeuTyr: 2.14 ± 0.432
0.0LeuXaa: 0.0 ± 0.0
Met
2.14MetAla: 2.14 ± 0.355
0.317MetCys: 0.317 ± 0.148
0.793MetAsp: 0.793 ± 0.268
0.951MetGlu: 0.951 ± 0.315
0.793MetPhe: 0.793 ± 0.233
1.506MetGly: 1.506 ± 0.334
0.238MetHis: 0.238 ± 0.135
1.665MetIle: 1.665 ± 0.32
1.665MetLys: 1.665 ± 0.361
2.854MetLeu: 2.854 ± 0.422
0.713MetMet: 0.713 ± 0.341
1.506MetAsn: 1.506 ± 0.272
1.348MetPro: 1.348 ± 0.32
1.348MetGln: 1.348 ± 0.413
2.14MetArg: 2.14 ± 0.447
2.378MetSer: 2.378 ± 0.344
1.823MetThr: 1.823 ± 0.382
1.506MetVal: 1.506 ± 0.367
0.396MetTrp: 0.396 ± 0.152
0.555MetTyr: 0.555 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
4.122AsnAla: 4.122 ± 0.611
0.317AsnCys: 0.317 ± 0.186
2.695AsnAsp: 2.695 ± 0.305
2.14AsnGlu: 2.14 ± 0.375
1.585AsnPhe: 1.585 ± 0.361
3.805AsnGly: 3.805 ± 0.546
0.793AsnHis: 0.793 ± 0.279
2.933AsnIle: 2.933 ± 0.519
3.171AsnLys: 3.171 ± 0.605
2.14AsnLeu: 2.14 ± 0.473
1.11AsnMet: 1.11 ± 0.31
1.982AsnAsn: 1.982 ± 0.393
2.774AsnPro: 2.774 ± 0.49
1.902AsnGln: 1.902 ± 0.426
2.299AsnArg: 2.299 ± 0.487
2.854AsnSer: 2.854 ± 0.591
2.378AsnThr: 2.378 ± 0.557
2.774AsnVal: 2.774 ± 0.543
0.555AsnTrp: 0.555 ± 0.208
0.951AsnTyr: 0.951 ± 0.319
0.0AsnXaa: 0.0 ± 0.0
Pro
4.756ProAla: 4.756 ± 0.609
0.476ProCys: 0.476 ± 0.207
3.488ProAsp: 3.488 ± 0.603
3.567ProGlu: 3.567 ± 0.63
1.902ProPhe: 1.902 ± 0.402
3.329ProGly: 3.329 ± 0.553
0.951ProHis: 0.951 ± 0.262
2.22ProIle: 2.22 ± 0.373
2.14ProLys: 2.14 ± 0.397
3.884ProLeu: 3.884 ± 0.519
1.031ProMet: 1.031 ± 0.297
2.061ProAsn: 2.061 ± 0.374
1.427ProPro: 1.427 ± 0.386
1.427ProGln: 1.427 ± 0.331
1.823ProArg: 1.823 ± 0.335
3.092ProSer: 3.092 ± 0.84
2.378ProThr: 2.378 ± 0.498
4.043ProVal: 4.043 ± 0.5
0.159ProTrp: 0.159 ± 0.109
1.585ProTyr: 1.585 ± 0.353
0.0ProXaa: 0.0 ± 0.0
Gln
3.329GlnAla: 3.329 ± 0.478
0.396GlnCys: 0.396 ± 0.179
1.585GlnAsp: 1.585 ± 0.394
2.061GlnGlu: 2.061 ± 0.419
1.348GlnPhe: 1.348 ± 0.379
1.665GlnGly: 1.665 ± 0.386
0.793GlnHis: 0.793 ± 0.253
2.22GlnIle: 2.22 ± 0.488
2.14GlnLys: 2.14 ± 0.369
3.726GlnLeu: 3.726 ± 0.564
1.11GlnMet: 1.11 ± 0.333
1.348GlnAsn: 1.348 ± 0.328
1.823GlnPro: 1.823 ± 0.366
1.823GlnGln: 1.823 ± 0.362
3.012GlnArg: 3.012 ± 0.631
2.22GlnSer: 2.22 ± 0.481
2.457GlnThr: 2.457 ± 0.457
1.982GlnVal: 1.982 ± 0.343
0.872GlnTrp: 0.872 ± 0.285
0.951GlnTyr: 0.951 ± 0.275
0.0GlnXaa: 0.0 ± 0.0
Arg
5.549ArgAla: 5.549 ± 0.768
0.634ArgCys: 0.634 ± 0.228
3.171ArgAsp: 3.171 ± 0.438
3.329ArgGlu: 3.329 ± 0.532
2.537ArgPhe: 2.537 ± 0.521
3.726ArgGly: 3.726 ± 0.606
1.982ArgHis: 1.982 ± 0.388
2.537ArgIle: 2.537 ± 0.413
4.043ArgLys: 4.043 ± 0.533
6.738ArgLeu: 6.738 ± 0.792
1.506ArgMet: 1.506 ± 0.271
2.299ArgAsn: 2.299 ± 0.379
2.299ArgPro: 2.299 ± 0.36
3.964ArgGln: 3.964 ± 0.653
4.677ArgArg: 4.677 ± 1.042
3.25ArgSer: 3.25 ± 0.635
3.329ArgThr: 3.329 ± 0.535
3.964ArgVal: 3.964 ± 0.7
1.268ArgTrp: 1.268 ± 0.364
2.14ArgTyr: 2.14 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
5.866SerAla: 5.866 ± 0.579
0.951SerCys: 0.951 ± 0.347
3.726SerAsp: 3.726 ± 0.63
3.25SerGlu: 3.25 ± 0.447
3.092SerPhe: 3.092 ± 0.505
4.915SerGly: 4.915 ± 0.651
1.744SerHis: 1.744 ± 0.344
3.884SerIle: 3.884 ± 0.682
3.646SerLys: 3.646 ± 0.624
6.342SerLeu: 6.342 ± 0.744
1.506SerMet: 1.506 ± 0.318
2.457SerAsn: 2.457 ± 0.372
2.695SerPro: 2.695 ± 0.487
2.854SerGln: 2.854 ± 0.49
3.409SerArg: 3.409 ± 0.534
3.726SerSer: 3.726 ± 0.451
3.329SerThr: 3.329 ± 0.613
4.518SerVal: 4.518 ± 0.641
1.031SerTrp: 1.031 ± 0.271
1.823SerTyr: 1.823 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
5.866ThrAla: 5.866 ± 1.013
0.634ThrCys: 0.634 ± 0.231
3.012ThrAsp: 3.012 ± 0.455
3.646ThrGlu: 3.646 ± 0.44
2.22ThrPhe: 2.22 ± 0.409
4.836ThrGly: 4.836 ± 0.848
1.823ThrHis: 1.823 ± 0.412
2.537ThrIle: 2.537 ± 0.568
2.378ThrLys: 2.378 ± 0.437
5.787ThrLeu: 5.787 ± 0.858
1.11ThrMet: 1.11 ± 0.288
1.982ThrAsn: 1.982 ± 0.537
2.854ThrPro: 2.854 ± 0.51
1.348ThrGln: 1.348 ± 0.336
3.329ThrArg: 3.329 ± 0.447
4.043ThrSer: 4.043 ± 0.584
3.646ThrThr: 3.646 ± 0.725
3.726ThrVal: 3.726 ± 0.547
1.268ThrTrp: 1.268 ± 0.327
1.665ThrTyr: 1.665 ± 0.33
0.0ThrXaa: 0.0 ± 0.0
Val
4.281ValAla: 4.281 ± 0.575
0.872ValCys: 0.872 ± 0.303
3.726ValAsp: 3.726 ± 0.635
3.567ValGlu: 3.567 ± 0.623
2.695ValPhe: 2.695 ± 0.435
4.281ValGly: 4.281 ± 0.653
0.396ValHis: 0.396 ± 0.162
5.47ValIle: 5.47 ± 0.667
4.043ValLys: 4.043 ± 0.651
5.866ValLeu: 5.866 ± 0.624
1.268ValMet: 1.268 ± 0.315
3.171ValAsn: 3.171 ± 0.692
3.488ValPro: 3.488 ± 0.565
2.378ValGln: 2.378 ± 0.442
3.567ValArg: 3.567 ± 0.573
4.439ValSer: 4.439 ± 0.522
4.439ValThr: 4.439 ± 0.741
6.025ValVal: 6.025 ± 0.717
0.951ValTrp: 0.951 ± 0.261
2.22ValTyr: 2.22 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
1.11TrpAla: 1.11 ± 0.309
0.396TrpCys: 0.396 ± 0.16
0.793TrpAsp: 0.793 ± 0.26
1.11TrpGlu: 1.11 ± 0.312
0.793TrpPhe: 0.793 ± 0.244
1.031TrpGly: 1.031 ± 0.264
0.238TrpHis: 0.238 ± 0.169
0.555TrpIle: 0.555 ± 0.167
1.348TrpLys: 1.348 ± 0.383
2.774TrpLeu: 2.774 ± 0.588
0.396TrpMet: 0.396 ± 0.167
0.317TrpAsn: 0.317 ± 0.137
0.872TrpPro: 0.872 ± 0.228
1.031TrpGln: 1.031 ± 0.236
1.189TrpArg: 1.189 ± 0.273
1.11TrpSer: 1.11 ± 0.292
0.793TrpThr: 0.793 ± 0.295
1.189TrpVal: 1.189 ± 0.426
0.555TrpTrp: 0.555 ± 0.172
0.476TrpTyr: 0.476 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.982TyrAla: 1.982 ± 0.464
0.476TyrCys: 0.476 ± 0.198
1.982TyrAsp: 1.982 ± 0.376
1.665TyrGlu: 1.665 ± 0.341
1.189TyrPhe: 1.189 ± 0.298
2.537TyrGly: 2.537 ± 0.523
0.555TyrHis: 0.555 ± 0.225
2.457TyrIle: 2.457 ± 0.702
1.585TyrLys: 1.585 ± 0.355
2.378TyrLeu: 2.378 ± 0.369
0.872TyrMet: 0.872 ± 0.255
0.555TyrAsn: 0.555 ± 0.24
1.427TyrPro: 1.427 ± 0.287
1.427TyrGln: 1.427 ± 0.354
1.902TyrArg: 1.902 ± 0.436
2.061TyrSer: 2.061 ± 0.397
2.061TyrThr: 2.061 ± 0.435
2.457TyrVal: 2.457 ± 0.578
0.793TyrTrp: 0.793 ± 0.226
0.951TyrTyr: 0.951 ± 0.226
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12616 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski