Amino acid dipepetide frequency for Escherichia phage G4 (Bacteriophage G4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.956AlaAla: 2.956 ± 1.257
1.267AlaCys: 1.267 ± 0.883
5.49AlaAsp: 5.49 ± 1.602
3.378AlaGlu: 3.378 ± 0.897
3.378AlaPhe: 3.378 ± 0.98
10.135AlaGly: 10.135 ± 2.85
2.956AlaHis: 2.956 ± 1.287
4.223AlaIle: 4.223 ± 1.511
5.912AlaLys: 5.912 ± 1.153
5.068AlaLeu: 5.068 ± 1.342
0.845AlaMet: 0.845 ± 0.434
3.378AlaAsn: 3.378 ± 1.164
3.378AlaPro: 3.378 ± 1.398
2.956AlaGln: 2.956 ± 1.126
1.267AlaArg: 1.267 ± 0.834
7.179AlaSer: 7.179 ± 1.626
3.801AlaThr: 3.801 ± 1.457
8.446AlaVal: 8.446 ± 1.926
0.422AlaTrp: 0.422 ± 0.427
2.111AlaTyr: 2.111 ± 0.683
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.267CysAsp: 1.267 ± 0.802
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.422CysHis: 0.422 ± 0.427
0.0CysIle: 0.0 ± 0.0
0.422CysLys: 0.422 ± 0.587
0.422CysLeu: 0.422 ± 0.408
0.0CysMet: 0.0 ± 0.0
0.845CysAsn: 0.845 ± 0.628
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.422CysArg: 0.422 ± 0.501
1.267CysSer: 1.267 ± 0.582
0.0CysThr: 0.0 ± 0.0
1.267CysVal: 1.267 ± 0.476
0.0CysTrp: 0.0 ± 0.0
1.267CysTyr: 1.267 ± 0.7
0.0CysXaa: 0.0 ± 0.0
Asp
3.378AspAla: 3.378 ± 1.079
0.845AspCys: 0.845 ± 0.434
2.956AspAsp: 2.956 ± 0.768
3.801AspGlu: 3.801 ± 1.197
2.534AspPhe: 2.534 ± 1.142
2.956AspGly: 2.956 ± 1.367
1.267AspHis: 1.267 ± 0.746
6.334AspIle: 6.334 ± 1.619
0.422AspLys: 0.422 ± 0.358
2.111AspLeu: 2.111 ± 1.273
1.689AspMet: 1.689 ± 0.522
2.534AspAsn: 2.534 ± 0.701
2.534AspPro: 2.534 ± 1.394
1.267AspGln: 1.267 ± 0.732
3.378AspArg: 3.378 ± 1.091
5.068AspSer: 5.068 ± 1.415
2.956AspThr: 2.956 ± 1.002
4.223AspVal: 4.223 ± 1.04
0.422AspTrp: 0.422 ± 0.606
3.801AspTyr: 3.801 ± 1.163
0.0AspXaa: 0.0 ± 0.0
Glu
3.801GluAla: 3.801 ± 1.019
1.267GluCys: 1.267 ± 0.706
1.267GluAsp: 1.267 ± 0.485
2.534GluGlu: 2.534 ± 1.675
2.534GluPhe: 2.534 ± 1.382
1.689GluGly: 1.689 ± 0.472
1.267GluHis: 1.267 ± 0.874
3.801GluIle: 3.801 ± 1.2
0.845GluLys: 0.845 ± 0.647
5.49GluLeu: 5.49 ± 1.402
1.267GluMet: 1.267 ± 0.724
2.534GluAsn: 2.534 ± 1.068
0.845GluPro: 0.845 ± 0.434
1.267GluGln: 1.267 ± 0.718
2.534GluArg: 2.534 ± 1.411
2.534GluSer: 2.534 ± 1.132
5.49GluThr: 5.49 ± 1.264
1.689GluVal: 1.689 ± 0.675
0.845GluTrp: 0.845 ± 0.434
0.845GluTyr: 0.845 ± 0.434
0.0GluXaa: 0.0 ± 0.0
Phe
1.689PheAla: 1.689 ± 1.124
0.845PheCys: 0.845 ± 0.434
1.689PheAsp: 1.689 ± 0.627
1.267PheGlu: 1.267 ± 0.763
0.422PhePhe: 0.422 ± 0.353
3.378PheGly: 3.378 ± 0.763
1.267PheHis: 1.267 ± 0.435
2.111PheIle: 2.111 ± 0.839
2.534PheLys: 2.534 ± 0.745
2.111PheLeu: 2.111 ± 1.894
2.111PheMet: 2.111 ± 0.661
2.956PheAsn: 2.956 ± 0.887
2.111PhePro: 2.111 ± 1.114
1.689PheGln: 1.689 ± 0.653
3.378PheArg: 3.378 ± 1.353
2.111PheSer: 2.111 ± 0.749
2.956PheThr: 2.956 ± 0.84
0.845PheVal: 0.845 ± 0.736
0.845PheTrp: 0.845 ± 0.416
3.378PheTyr: 3.378 ± 0.999
0.0PheXaa: 0.0 ± 0.0
Gly
5.912GlyAla: 5.912 ± 2.498
0.0GlyCys: 0.0 ± 0.0
1.267GlyAsp: 1.267 ± 0.675
0.845GlyGlu: 0.845 ± 0.434
2.534GlyPhe: 2.534 ± 0.723
4.223GlyGly: 4.223 ± 1.688
1.267GlyHis: 1.267 ± 0.485
4.223GlyIle: 4.223 ± 1.642
7.179GlyLys: 7.179 ± 1.5
4.645GlyLeu: 4.645 ± 1.113
1.689GlyMet: 1.689 ± 0.677
3.378GlyAsn: 3.378 ± 0.862
0.0GlyPro: 0.0 ± 0.0
2.534GlyGln: 2.534 ± 1.029
3.378GlyArg: 3.378 ± 1.205
2.956GlySer: 2.956 ± 1.051
3.378GlyThr: 3.378 ± 1.144
3.378GlyVal: 3.378 ± 1.318
2.111GlyTrp: 2.111 ± 0.794
3.378GlyTyr: 3.378 ± 0.778
0.0GlyXaa: 0.0 ± 0.0
His
2.534HisAla: 2.534 ± 0.685
0.0HisCys: 0.0 ± 0.0
1.267HisAsp: 1.267 ± 0.435
0.845HisGlu: 0.845 ± 0.568
1.267HisPhe: 1.267 ± 0.613
1.689HisGly: 1.689 ± 0.568
1.267HisHis: 1.267 ± 0.747
0.422HisIle: 0.422 ± 0.353
1.689HisLys: 1.689 ± 0.799
2.956HisLeu: 2.956 ± 0.873
0.422HisMet: 0.422 ± 0.353
0.845HisAsn: 0.845 ± 0.538
0.845HisPro: 0.845 ± 0.578
0.422HisGln: 0.422 ± 0.353
0.845HisArg: 0.845 ± 0.449
2.111HisSer: 2.111 ± 0.673
1.267HisThr: 1.267 ± 0.802
0.845HisVal: 0.845 ± 0.511
1.689HisTrp: 1.689 ± 0.755
0.845HisTyr: 0.845 ± 0.416
0.0HisXaa: 0.0 ± 0.0
Ile
7.601IleAla: 7.601 ± 1.345
0.422IleCys: 0.422 ± 0.501
3.378IleAsp: 3.378 ± 1.023
2.111IleGlu: 2.111 ± 0.696
0.845IlePhe: 0.845 ± 0.705
2.534IleGly: 2.534 ± 0.732
0.422IleHis: 0.422 ± 0.501
0.845IleIle: 0.845 ± 0.744
2.956IleLys: 2.956 ± 0.978
4.223IleLeu: 4.223 ± 1.342
2.956IleMet: 2.956 ± 1.406
5.49IleAsn: 5.49 ± 1.395
2.534IlePro: 2.534 ± 0.709
2.111IleGln: 2.111 ± 1.163
3.378IleArg: 3.378 ± 0.887
3.378IleSer: 3.378 ± 1.163
1.689IleThr: 1.689 ± 0.837
1.267IleVal: 1.267 ± 0.512
0.845IleTrp: 0.845 ± 0.416
0.845IleTyr: 0.845 ± 0.705
0.0IleXaa: 0.0 ± 0.0
Lys
2.111LysAla: 2.111 ± 0.718
0.0LysCys: 0.0 ± 0.0
6.334LysAsp: 6.334 ± 1.329
5.912LysGlu: 5.912 ± 1.535
3.378LysPhe: 3.378 ± 0.922
5.068LysGly: 5.068 ± 1.546
1.267LysHis: 1.267 ± 0.549
2.956LysIle: 2.956 ± 0.754
4.223LysLys: 4.223 ± 1.642
5.068LysLeu: 5.068 ± 1.165
2.111LysMet: 2.111 ± 1.027
1.689LysAsn: 1.689 ± 0.832
3.801LysPro: 3.801 ± 1.927
3.378LysGln: 3.378 ± 1.062
0.0LysArg: 0.0 ± 0.0
7.179LysSer: 7.179 ± 1.762
3.378LysThr: 3.378 ± 0.842
3.801LysVal: 3.801 ± 0.993
1.267LysTrp: 1.267 ± 0.435
1.689LysTyr: 1.689 ± 0.849
0.0LysXaa: 0.0 ± 0.0
Leu
7.179LeuAla: 7.179 ± 1.174
0.845LeuCys: 0.845 ± 0.616
4.223LeuAsp: 4.223 ± 1.179
2.956LeuGlu: 2.956 ± 0.98
2.111LeuPhe: 2.111 ± 1.935
3.801LeuGly: 3.801 ± 1.153
1.689LeuHis: 1.689 ± 0.663
2.956LeuIle: 2.956 ± 1.432
10.98LeuLys: 10.98 ± 2.295
9.291LeuLeu: 9.291 ± 4.26
4.645LeuMet: 4.645 ± 0.945
3.378LeuAsn: 3.378 ± 2.343
3.378LeuPro: 3.378 ± 1.453
4.223LeuGln: 4.223 ± 1.182
3.378LeuArg: 3.378 ± 0.828
8.868LeuSer: 8.868 ± 2.59
9.713LeuThr: 9.713 ± 1.897
3.801LeuVal: 3.801 ± 1.144
1.689LeuTrp: 1.689 ± 0.725
0.845LeuTyr: 0.845 ± 0.538
0.0LeuXaa: 0.0 ± 0.0
Met
2.956MetAla: 2.956 ± 1.203
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.956MetGlu: 2.956 ± 0.827
1.267MetPhe: 1.267 ± 0.549
0.845MetGly: 0.845 ± 0.449
0.422MetHis: 0.422 ± 0.353
0.422MetIle: 0.422 ± 0.427
3.378MetLys: 3.378 ± 0.998
2.111MetLeu: 2.111 ± 1.409
0.422MetMet: 0.422 ± 0.395
1.267MetAsn: 1.267 ± 0.7
1.689MetPro: 1.689 ± 0.663
2.111MetGln: 2.111 ± 1.16
3.801MetArg: 3.801 ± 1.003
2.534MetSer: 2.534 ± 0.497
2.534MetThr: 2.534 ± 1.299
1.689MetVal: 1.689 ± 0.557
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.912AsnAla: 5.912 ± 1.227
0.422AsnCys: 0.422 ± 0.587
2.111AsnAsp: 2.111 ± 0.803
2.111AsnGlu: 2.111 ± 1.021
2.111AsnPhe: 2.111 ± 0.794
2.111AsnGly: 2.111 ± 0.817
1.689AsnHis: 1.689 ± 0.756
3.378AsnIle: 3.378 ± 1.16
2.956AsnLys: 2.956 ± 0.79
8.024AsnLeu: 8.024 ± 2.121
2.534AsnMet: 2.534 ± 1.637
4.645AsnAsn: 4.645 ± 0.663
3.378AsnPro: 3.378 ± 0.723
2.534AsnGln: 2.534 ± 1.426
2.956AsnArg: 2.956 ± 0.72
4.223AsnSer: 4.223 ± 1.016
3.801AsnThr: 3.801 ± 0.979
2.956AsnVal: 2.956 ± 1.147
0.0AsnTrp: 0.0 ± 0.0
2.534AsnTyr: 2.534 ± 0.653
0.0AsnXaa: 0.0 ± 0.0
Pro
2.111ProAla: 2.111 ± 1.316
0.0ProCys: 0.0 ± 0.0
2.111ProAsp: 2.111 ± 0.978
2.534ProGlu: 2.534 ± 0.557
1.267ProPhe: 1.267 ± 0.435
0.422ProGly: 0.422 ± 0.358
1.267ProHis: 1.267 ± 0.802
2.111ProIle: 2.111 ± 1.028
2.534ProLys: 2.534 ± 0.769
6.334ProLeu: 6.334 ± 1.765
0.0ProMet: 0.0 ± 0.0
4.223ProAsn: 4.223 ± 1.511
2.534ProPro: 2.534 ± 1.742
0.422ProGln: 0.422 ± 0.372
1.689ProArg: 1.689 ± 0.85
3.801ProSer: 3.801 ± 1.736
2.111ProThr: 2.111 ± 1.089
5.912ProVal: 5.912 ± 1.487
0.845ProTrp: 0.845 ± 0.449
1.267ProTyr: 1.267 ± 0.435
0.0ProXaa: 0.0 ± 0.0
Gln
1.267GlnAla: 1.267 ± 0.598
0.422GlnCys: 0.422 ± 0.372
1.267GlnAsp: 1.267 ± 0.473
2.956GlnGlu: 2.956 ± 1.051
1.689GlnPhe: 1.689 ± 1.491
1.689GlnGly: 1.689 ± 0.829
1.267GlnHis: 1.267 ± 0.569
1.689GlnIle: 1.689 ± 0.576
2.534GlnLys: 2.534 ± 1.773
3.801GlnLeu: 3.801 ± 1.324
0.422GlnMet: 0.422 ± 0.358
4.645GlnAsn: 4.645 ± 1.815
2.534GlnPro: 2.534 ± 0.975
2.111GlnGln: 2.111 ± 0.947
1.267GlnArg: 1.267 ± 0.59
3.801GlnSer: 3.801 ± 0.894
3.801GlnThr: 3.801 ± 1.306
1.689GlnVal: 1.689 ± 0.553
0.845GlnTrp: 0.845 ± 0.705
2.111GlnTyr: 2.111 ± 0.718
0.0GlnXaa: 0.0 ± 0.0
Arg
5.068ArgAla: 5.068 ± 1.59
0.0ArgCys: 0.0 ± 0.0
4.223ArgAsp: 4.223 ± 0.946
1.689ArgGlu: 1.689 ± 0.653
2.534ArgPhe: 2.534 ± 1.261
2.111ArgGly: 2.111 ± 0.542
1.267ArgHis: 1.267 ± 0.87
3.801ArgIle: 3.801 ± 1.788
0.845ArgLys: 0.845 ± 0.694
5.068ArgLeu: 5.068 ± 1.641
1.689ArgMet: 1.689 ± 0.867
2.111ArgAsn: 2.111 ± 0.879
1.267ArgPro: 1.267 ± 0.485
1.689ArgGln: 1.689 ± 0.576
3.801ArgArg: 3.801 ± 1.447
4.645ArgSer: 4.645 ± 1.304
4.223ArgThr: 4.223 ± 1.042
3.378ArgVal: 3.378 ± 2.105
0.0ArgTrp: 0.0 ± 0.0
2.111ArgTyr: 2.111 ± 0.542
0.0ArgXaa: 0.0 ± 0.0
Ser
6.757SerAla: 6.757 ± 1.677
0.422SerCys: 0.422 ± 0.587
4.645SerAsp: 4.645 ± 1.544
2.111SerGlu: 2.111 ± 1.114
2.111SerPhe: 2.111 ± 0.689
5.068SerGly: 5.068 ± 1.819
0.845SerHis: 0.845 ± 0.507
3.801SerIle: 3.801 ± 1.192
6.757SerLys: 6.757 ± 1.572
6.334SerLeu: 6.334 ± 1.987
4.645SerMet: 4.645 ± 0.861
4.223SerAsn: 4.223 ± 0.782
3.378SerPro: 3.378 ± 1.062
3.378SerGln: 3.378 ± 1.131
6.334SerArg: 6.334 ± 1.218
7.179SerSer: 7.179 ± 1.633
5.49SerThr: 5.49 ± 1.413
5.068SerVal: 5.068 ± 1.852
0.845SerTrp: 0.845 ± 0.62
2.111SerTyr: 2.111 ± 0.714
0.0SerXaa: 0.0 ± 0.0
Thr
7.601ThrAla: 7.601 ± 1.62
0.845ThrCys: 0.845 ± 0.568
4.223ThrAsp: 4.223 ± 1.364
2.111ThrGlu: 2.111 ± 1.162
2.111ThrPhe: 2.111 ± 0.853
1.267ThrGly: 1.267 ± 0.427
1.267ThrHis: 1.267 ± 0.534
4.223ThrIle: 4.223 ± 1.463
5.49ThrLys: 5.49 ± 1.295
6.757ThrLeu: 6.757 ± 1.681
0.422ThrMet: 0.422 ± 0.372
4.645ThrAsn: 4.645 ± 0.903
1.689ThrPro: 1.689 ± 0.522
5.068ThrGln: 5.068 ± 0.975
3.801ThrArg: 3.801 ± 1.142
6.757ThrSer: 6.757 ± 1.746
4.645ThrThr: 4.645 ± 1.41
3.378ThrVal: 3.378 ± 1.008
0.845ThrTrp: 0.845 ± 0.434
1.267ThrTyr: 1.267 ± 0.856
0.0ThrXaa: 0.0 ± 0.0
Val
6.334ValAla: 6.334 ± 1.341
0.0ValCys: 0.0 ± 0.0
2.534ValAsp: 2.534 ± 0.906
2.534ValGlu: 2.534 ± 1.033
2.111ValPhe: 2.111 ± 0.794
5.912ValGly: 5.912 ± 1.509
2.956ValHis: 2.956 ± 0.978
1.267ValIle: 1.267 ± 0.718
1.689ValLys: 1.689 ± 0.655
7.179ValLeu: 7.179 ± 1.695
0.422ValMet: 0.422 ± 0.408
3.801ValAsn: 3.801 ± 1.245
4.645ValPro: 4.645 ± 1.802
3.378ValGln: 3.378 ± 1.373
3.378ValArg: 3.378 ± 0.626
2.956ValSer: 2.956 ± 1.538
3.801ValThr: 3.801 ± 0.998
2.956ValVal: 2.956 ± 0.815
0.422ValTrp: 0.422 ± 0.408
2.956ValTyr: 2.956 ± 0.918
0.0ValXaa: 0.0 ± 0.0
Trp
0.422TrpAla: 0.422 ± 0.353
0.0TrpCys: 0.0 ± 0.0
0.422TrpAsp: 0.422 ± 0.353
0.422TrpGlu: 0.422 ± 0.358
0.422TrpPhe: 0.422 ± 0.606
0.422TrpGly: 0.422 ± 0.353
0.0TrpHis: 0.0 ± 0.0
0.845TrpIle: 0.845 ± 0.534
0.845TrpLys: 0.845 ± 0.624
0.845TrpLeu: 0.845 ± 0.434
0.422TrpMet: 0.422 ± 0.372
1.689TrpAsn: 1.689 ± 0.633
1.689TrpPro: 1.689 ± 0.867
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.845TrpSer: 0.845 ± 0.507
2.534TrpThr: 2.534 ± 0.718
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.111TrpTyr: 2.111 ± 0.689
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.956TyrAla: 2.956 ± 0.776
0.0TyrCys: 0.0 ± 0.0
3.801TyrAsp: 3.801 ± 1.11
1.267TyrGlu: 1.267 ± 0.912
5.068TyrPhe: 5.068 ± 0.905
2.956TyrGly: 2.956 ± 0.732
0.0TyrHis: 0.0 ± 0.0
0.422TyrIle: 0.422 ± 0.353
0.845TyrLys: 0.845 ± 0.416
2.111TyrLeu: 2.111 ± 1.134
1.267TyrMet: 1.267 ± 0.57
2.111TyrAsn: 2.111 ± 0.793
1.267TyrPro: 1.267 ± 0.585
1.267TyrGln: 1.267 ± 0.7
2.534TyrArg: 2.534 ± 0.988
2.111TyrSer: 2.111 ± 0.794
0.845TyrThr: 0.845 ± 0.416
4.645TyrVal: 4.645 ± 1.413
0.0TyrTrp: 0.0 ± 0.0
0.422TyrTyr: 0.422 ± 0.408
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (2369 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski