Amino acid dipepetide frequency for Chaetoceros protobacilladnavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.893AlaAla: 3.893 ± 1.728
1.112AlaCys: 1.112 ± 0.805
6.674AlaAsp: 6.674 ± 1.271
5.562AlaGlu: 5.562 ± 1.62
3.337AlaPhe: 3.337 ± 0.705
4.449AlaGly: 4.449 ± 1.32
2.225AlaHis: 2.225 ± 1.135
5.006AlaIle: 5.006 ± 1.821
5.562AlaLys: 5.562 ± 0.936
5.006AlaLeu: 5.006 ± 0.906
2.225AlaMet: 2.225 ± 0.995
2.781AlaAsn: 2.781 ± 1.37
5.006AlaPro: 5.006 ± 1.399
2.225AlaGln: 2.225 ± 1.446
3.893AlaArg: 3.893 ± 1.728
4.449AlaSer: 4.449 ± 1.486
2.781AlaThr: 2.781 ± 1.42
2.781AlaVal: 2.781 ± 1.723
0.0AlaTrp: 0.0 ± 0.0
2.225AlaTyr: 2.225 ± 1.098
0.0AlaXaa: 0.0 ± 0.0
Cys
0.556CysAla: 0.556 ± 0.402
0.0CysCys: 0.0 ± 0.0
0.556CysAsp: 0.556 ± 0.402
0.0CysGlu: 0.0 ± 0.0
0.556CysPhe: 0.556 ± 0.467
1.669CysGly: 1.669 ± 0.667
0.0CysHis: 0.0 ± 0.0
1.669CysIle: 1.669 ± 0.836
2.225CysLys: 2.225 ± 1.135
0.0CysLeu: 0.0 ± 0.0
1.112CysMet: 1.112 ± 0.609
0.556CysAsn: 0.556 ± 0.63
1.112CysPro: 1.112 ± 0.804
0.0CysGln: 0.0 ± 0.0
0.556CysArg: 0.556 ± 0.467
0.556CysSer: 0.556 ± 0.402
0.556CysThr: 0.556 ± 0.402
1.112CysVal: 1.112 ± 0.582
0.556CysTrp: 0.556 ± 0.402
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.225AspAla: 2.225 ± 0.739
0.0AspCys: 0.0 ± 0.0
8.343AspAsp: 8.343 ± 1.999
3.337AspGlu: 3.337 ± 1.034
1.112AspPhe: 1.112 ± 0.933
6.674AspGly: 6.674 ± 0.93
2.225AspHis: 2.225 ± 1.171
5.562AspIle: 5.562 ± 1.816
0.556AspLys: 0.556 ± 0.502
3.337AspLeu: 3.337 ± 1.641
2.225AspMet: 2.225 ± 1.164
3.337AspAsn: 3.337 ± 1.08
5.006AspPro: 5.006 ± 1.886
5.562AspGln: 5.562 ± 1.616
1.669AspArg: 1.669 ± 0.649
2.781AspSer: 2.781 ± 1.383
2.225AspThr: 2.225 ± 0.873
3.893AspVal: 3.893 ± 2.12
1.112AspTrp: 1.112 ± 0.423
3.337AspTyr: 3.337 ± 1.264
0.0AspXaa: 0.0 ± 0.0
Glu
3.337GluAla: 3.337 ± 1.281
1.669GluCys: 1.669 ± 0.681
2.781GluAsp: 2.781 ± 0.69
3.893GluGlu: 3.893 ± 1.786
3.893GluPhe: 3.893 ± 0.87
3.893GluGly: 3.893 ± 0.918
0.556GluHis: 0.556 ± 0.548
2.225GluIle: 2.225 ± 1.146
0.556GluLys: 0.556 ± 0.502
6.674GluLeu: 6.674 ± 1.569
0.556GluMet: 0.556 ± 0.63
1.669GluAsn: 1.669 ± 0.548
2.781GluPro: 2.781 ± 1.009
2.781GluGln: 2.781 ± 0.973
2.781GluArg: 2.781 ± 0.982
1.669GluSer: 1.669 ± 0.821
5.562GluThr: 5.562 ± 2.05
3.893GluVal: 3.893 ± 0.491
2.225GluTrp: 2.225 ± 1.098
0.556GluTyr: 0.556 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
1.669PheAla: 1.669 ± 1.4
0.556PheCys: 0.556 ± 0.63
2.225PheAsp: 2.225 ± 1.345
4.449PheGlu: 4.449 ± 1.074
2.781PhePhe: 2.781 ± 0.749
1.112PheGly: 1.112 ± 0.804
2.225PheHis: 2.225 ± 1.098
1.669PheIle: 1.669 ± 0.667
0.556PheLys: 0.556 ± 0.502
2.225PheLeu: 2.225 ± 1.931
1.112PheMet: 1.112 ± 0.726
3.337PheAsn: 3.337 ± 1.191
2.225PhePro: 2.225 ± 0.63
2.225PheGln: 2.225 ± 0.63
0.556PheArg: 0.556 ± 0.402
2.781PheSer: 2.781 ± 1.184
4.449PheThr: 4.449 ± 0.911
1.669PheVal: 1.669 ± 0.817
2.781PheTrp: 2.781 ± 1.546
1.112PheTyr: 1.112 ± 0.582
0.0PheXaa: 0.0 ± 0.0
Gly
7.786GlyAla: 7.786 ± 1.012
1.112GlyCys: 1.112 ± 0.804
2.225GlyAsp: 2.225 ± 1.218
3.337GlyGlu: 3.337 ± 1.584
2.781GlyPhe: 2.781 ± 1.222
8.343GlyGly: 8.343 ± 1.963
1.669GlyHis: 1.669 ± 0.546
1.112GlyIle: 1.112 ± 0.562
4.449GlyLys: 4.449 ± 1.821
7.786GlyLeu: 7.786 ± 2.131
1.112GlyMet: 1.112 ± 0.884
2.225GlyAsn: 2.225 ± 1.867
2.781GlyPro: 2.781 ± 1.199
2.781GlyGln: 2.781 ± 1.415
5.006GlyArg: 5.006 ± 2.088
4.449GlySer: 4.449 ± 1.378
6.118GlyThr: 6.118 ± 2.902
5.006GlyVal: 5.006 ± 1.785
1.669GlyTrp: 1.669 ± 0.406
2.781GlyTyr: 2.781 ± 1.37
0.0GlyXaa: 0.0 ± 0.0
His
1.669HisAla: 1.669 ± 1.206
0.556HisCys: 0.556 ± 0.467
1.669HisAsp: 1.669 ± 0.406
2.225HisGlu: 2.225 ± 0.778
1.669HisPhe: 1.669 ± 0.817
1.112HisGly: 1.112 ± 0.804
0.556HisHis: 0.556 ± 0.402
2.225HisIle: 2.225 ± 0.581
0.556HisLys: 0.556 ± 0.402
1.669HisLeu: 1.669 ± 0.681
0.0HisMet: 0.0 ± 0.0
1.112HisAsn: 1.112 ± 0.611
3.337HisPro: 3.337 ± 1.361
1.112HisGln: 1.112 ± 1.004
1.112HisArg: 1.112 ± 0.804
1.112HisSer: 1.112 ± 0.549
0.0HisThr: 0.0 ± 0.0
0.556HisVal: 0.556 ± 0.548
0.0HisTrp: 0.0 ± 0.0
2.225HisTyr: 2.225 ± 1.555
0.0HisXaa: 0.0 ± 0.0
Ile
3.337IleAla: 3.337 ± 1.694
1.669IleCys: 1.669 ± 0.667
7.23IleAsp: 7.23 ± 3.365
3.337IleGlu: 3.337 ± 1.24
3.337IlePhe: 3.337 ± 1.165
5.562IleGly: 5.562 ± 1.695
0.556IleHis: 0.556 ± 0.402
5.562IleIle: 5.562 ± 2.112
1.669IleLys: 1.669 ± 0.681
2.781IleLeu: 2.781 ± 0.973
1.669IleMet: 1.669 ± 0.667
2.225IleAsn: 2.225 ± 0.581
2.225IlePro: 2.225 ± 0.494
1.112IleGln: 1.112 ± 0.562
0.556IleArg: 0.556 ± 0.502
3.893IleSer: 3.893 ± 0.734
2.225IleThr: 2.225 ± 1.263
5.562IleVal: 5.562 ± 1.527
1.112IleTrp: 1.112 ± 0.582
1.669IleTyr: 1.669 ± 0.667
0.0IleXaa: 0.0 ± 0.0
Lys
5.562LysAla: 5.562 ± 1.566
0.0LysCys: 0.0 ± 0.0
3.337LysAsp: 3.337 ± 0.813
3.893LysGlu: 3.893 ± 1.449
2.781LysPhe: 2.781 ± 1.176
1.669LysGly: 1.669 ± 1.468
2.781LysHis: 2.781 ± 1.315
1.669LysIle: 1.669 ± 0.959
7.23LysLys: 7.23 ± 2.984
2.781LysLeu: 2.781 ± 0.547
1.112LysMet: 1.112 ± 0.423
2.225LysAsn: 2.225 ± 0.883
2.781LysPro: 2.781 ± 1.23
2.781LysGln: 2.781 ± 0.913
3.337LysArg: 3.337 ± 1.152
5.006LysSer: 5.006 ± 1.702
3.893LysThr: 3.893 ± 0.918
0.556LysVal: 0.556 ± 0.467
1.669LysTrp: 1.669 ± 0.726
1.669LysTyr: 1.669 ± 0.821
0.0LysXaa: 0.0 ± 0.0
Leu
6.674LeuAla: 6.674 ± 2.582
0.556LeuCys: 0.556 ± 0.402
6.674LeuAsp: 6.674 ± 2.273
3.893LeuGlu: 3.893 ± 0.491
2.225LeuPhe: 2.225 ± 0.819
5.006LeuGly: 5.006 ± 1.632
1.669LeuHis: 1.669 ± 0.795
3.337LeuIle: 3.337 ± 0.763
5.006LeuLys: 5.006 ± 0.928
4.449LeuLeu: 4.449 ± 1.626
1.112LeuMet: 1.112 ± 1.097
2.781LeuAsn: 2.781 ± 0.913
3.893LeuPro: 3.893 ± 0.999
1.669LeuGln: 1.669 ± 0.833
3.893LeuArg: 3.893 ± 1.527
4.449LeuSer: 4.449 ± 1.81
6.674LeuThr: 6.674 ± 3.292
3.337LeuVal: 3.337 ± 1.361
2.225LeuTrp: 2.225 ± 0.808
1.112LeuTyr: 1.112 ± 0.549
0.0LeuXaa: 0.0 ± 0.0
Met
2.781MetAla: 2.781 ± 1.283
0.0MetCys: 0.0 ± 0.0
2.781MetAsp: 2.781 ± 0.969
1.112MetGlu: 1.112 ± 0.423
0.0MetPhe: 0.0 ± 0.0
0.556MetGly: 0.556 ± 0.594
1.112MetHis: 1.112 ± 0.933
0.556MetIle: 0.556 ± 0.402
0.556MetLys: 0.556 ± 0.467
2.225MetLeu: 2.225 ± 0.679
0.556MetMet: 0.556 ± 0.548
2.225MetAsn: 2.225 ± 0.719
0.556MetPro: 0.556 ± 0.402
1.112MetGln: 1.112 ± 0.808
1.112MetArg: 1.112 ± 0.674
0.556MetSer: 0.556 ± 0.63
0.0MetThr: 0.0 ± 0.0
1.669MetVal: 1.669 ± 1.277
0.0MetTrp: 0.0 ± 0.0
2.781MetTyr: 2.781 ± 1.42
0.0MetXaa: 0.0 ± 0.0
Asn
2.781AsnAla: 2.781 ± 0.895
0.0AsnCys: 0.0 ± 0.0
2.225AsnAsp: 2.225 ± 1.146
1.112AsnGlu: 1.112 ± 0.78
1.112AsnPhe: 1.112 ± 0.582
3.337AsnGly: 3.337 ± 1.643
1.669AsnHis: 1.669 ± 0.667
2.781AsnIle: 2.781 ± 1.494
1.112AsnLys: 1.112 ± 0.933
6.118AsnLeu: 6.118 ± 2.458
0.556AsnMet: 0.556 ± 0.467
3.337AsnAsn: 3.337 ± 1.643
4.449AsnPro: 4.449 ± 0.473
0.556AsnGln: 0.556 ± 0.548
2.225AsnArg: 2.225 ± 0.873
3.337AsnSer: 3.337 ± 1.589
3.337AsnThr: 3.337 ± 1.012
3.893AsnVal: 3.893 ± 1.099
0.556AsnTrp: 0.556 ± 0.467
1.112AsnTyr: 1.112 ± 0.804
0.0AsnXaa: 0.0 ± 0.0
Pro
5.006ProAla: 5.006 ± 1.399
0.556ProCys: 0.556 ± 0.63
2.781ProAsp: 2.781 ± 1.546
2.225ProGlu: 2.225 ± 1.218
2.781ProPhe: 2.781 ± 0.973
3.337ProGly: 3.337 ± 0.813
1.112ProHis: 1.112 ± 0.582
4.449ProIle: 4.449 ± 2.032
3.893ProLys: 3.893 ± 0.685
5.562ProLeu: 5.562 ± 1.945
1.112ProMet: 1.112 ± 0.808
2.225ProAsn: 2.225 ± 0.947
3.337ProPro: 3.337 ± 1.805
3.337ProGln: 3.337 ± 0.855
4.449ProArg: 4.449 ± 1.568
3.337ProSer: 3.337 ± 1.69
5.562ProThr: 5.562 ± 1.038
3.893ProVal: 3.893 ± 1.237
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.225GlnAla: 2.225 ± 1.662
1.112GlnCys: 1.112 ± 0.804
1.112GlnAsp: 1.112 ± 0.804
1.669GlnGlu: 1.669 ± 0.795
2.225GlnPhe: 2.225 ± 1.176
2.225GlnGly: 2.225 ± 0.788
0.556GlnHis: 0.556 ± 0.467
2.225GlnIle: 2.225 ± 1.608
5.006GlnLys: 5.006 ± 1.014
2.781GlnLeu: 2.781 ± 0.53
0.0GlnMet: 0.0 ± 0.0
1.669GlnAsn: 1.669 ± 0.904
2.225GlnPro: 2.225 ± 1.233
1.669GlnGln: 1.669 ± 0.959
3.893GlnArg: 3.893 ± 1.449
3.893GlnSer: 3.893 ± 1.086
1.669GlnThr: 1.669 ± 0.795
0.556GlnVal: 0.556 ± 0.548
0.556GlnTrp: 0.556 ± 0.548
1.112GlnTyr: 1.112 ± 1.004
0.0GlnXaa: 0.0 ± 0.0
Arg
4.449ArgAla: 4.449 ± 1.478
1.112ArgCys: 1.112 ± 0.582
2.781ArgAsp: 2.781 ± 1.315
2.225ArgGlu: 2.225 ± 1.171
1.669ArgPhe: 1.669 ± 0.681
3.893ArgGly: 3.893 ± 2.17
0.0ArgHis: 0.0 ± 0.0
4.449ArgIle: 4.449 ± 1.413
4.449ArgLys: 4.449 ± 1.515
3.893ArgLeu: 3.893 ± 0.886
2.225ArgMet: 2.225 ± 0.549
2.781ArgAsn: 2.781 ± 0.561
2.781ArgPro: 2.781 ± 1.853
2.781ArgGln: 2.781 ± 0.8
11.123ArgArg: 11.123 ± 2.298
3.893ArgSer: 3.893 ± 1.62
3.337ArgThr: 3.337 ± 1.361
4.449ArgVal: 4.449 ± 1.704
0.556ArgTrp: 0.556 ± 0.402
1.112ArgTyr: 1.112 ± 0.582
0.0ArgXaa: 0.0 ± 0.0
Ser
5.006SerAla: 5.006 ± 1.21
1.112SerCys: 1.112 ± 0.804
3.893SerAsp: 3.893 ± 1.392
2.781SerGlu: 2.781 ± 0.832
1.112SerPhe: 1.112 ± 0.423
6.674SerGly: 6.674 ± 2.025
0.0SerHis: 0.0 ± 0.0
0.556SerIle: 0.556 ± 0.548
4.449SerLys: 4.449 ± 1.479
3.337SerLeu: 3.337 ± 1.114
0.556SerMet: 0.556 ± 0.559
5.562SerAsn: 5.562 ± 1.141
3.893SerPro: 3.893 ± 1.359
2.225SerGln: 2.225 ± 1.233
6.118SerArg: 6.118 ± 1.888
7.786SerSer: 7.786 ± 2.275
4.449SerThr: 4.449 ± 1.869
1.669SerVal: 1.669 ± 0.681
1.669SerTrp: 1.669 ± 0.817
2.225SerTyr: 2.225 ± 0.654
0.0SerXaa: 0.0 ± 0.0
Thr
4.449ThrAla: 4.449 ± 0.879
0.556ThrCys: 0.556 ± 0.402
1.669ThrAsp: 1.669 ± 0.726
2.781ThrGlu: 2.781 ± 0.846
2.225ThrPhe: 2.225 ± 0.501
7.786ThrGly: 7.786 ± 1.334
1.112ThrHis: 1.112 ± 0.423
5.562ThrIle: 5.562 ± 0.918
1.112ThrLys: 1.112 ± 0.423
4.449ThrLeu: 4.449 ± 1.101
3.337ThrMet: 3.337 ± 1.838
3.337ThrAsn: 3.337 ± 1.191
5.562ThrPro: 5.562 ± 1.638
2.225ThrGln: 2.225 ± 0.966
0.556ThrArg: 0.556 ± 0.63
5.562ThrSer: 5.562 ± 1.71
8.343ThrThr: 8.343 ± 3.293
5.006ThrVal: 5.006 ± 0.861
1.112ThrTrp: 1.112 ± 0.804
2.225ThrTyr: 2.225 ± 0.819
0.0ThrXaa: 0.0 ± 0.0
Val
2.781ValAla: 2.781 ± 0.675
0.556ValCys: 0.556 ± 0.402
2.781ValAsp: 2.781 ± 1.116
5.006ValGlu: 5.006 ± 1.826
1.112ValPhe: 1.112 ± 0.726
3.337ValGly: 3.337 ± 2.043
2.781ValHis: 2.781 ± 1.546
3.893ValIle: 3.893 ± 0.569
2.781ValLys: 2.781 ± 1.545
1.669ValLeu: 1.669 ± 0.871
0.556ValMet: 0.556 ± 0.402
0.556ValAsn: 0.556 ± 0.467
3.337ValPro: 3.337 ± 1.839
1.112ValGln: 1.112 ± 1.261
7.23ValArg: 7.23 ± 1.835
4.449ValSer: 4.449 ± 0.769
3.893ValThr: 3.893 ± 0.588
2.781ValVal: 2.781 ± 1.252
0.556ValTrp: 0.556 ± 0.548
3.893ValTyr: 3.893 ± 1.786
0.0ValXaa: 0.0 ± 0.0
Trp
1.669TrpAla: 1.669 ± 1.046
0.556TrpCys: 0.556 ± 0.402
1.112TrpAsp: 1.112 ± 0.804
0.0TrpGlu: 0.0 ± 0.0
1.669TrpPhe: 1.669 ± 0.681
2.225TrpGly: 2.225 ± 1.171
0.556TrpHis: 0.556 ± 0.402
1.669TrpIle: 1.669 ± 0.548
1.112TrpLys: 1.112 ± 0.549
1.669TrpLeu: 1.669 ± 0.821
0.0TrpMet: 0.0 ± 0.0
0.556TrpAsn: 0.556 ± 0.402
0.556TrpPro: 0.556 ± 0.402
0.0TrpGln: 0.0 ± 0.0
2.225TrpArg: 2.225 ± 0.846
0.556TrpSer: 0.556 ± 0.502
2.781TrpThr: 2.781 ± 1.086
0.556TrpVal: 0.556 ± 0.63
0.0TrpTrp: 0.0 ± 0.0
0.556TrpTyr: 0.556 ± 0.548
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.337TyrAla: 3.337 ± 1.251
0.556TyrCys: 0.556 ± 0.402
1.669TyrAsp: 1.669 ± 0.546
1.112TyrGlu: 1.112 ± 0.562
3.337TyrPhe: 3.337 ± 0.949
1.669TyrGly: 1.669 ± 0.972
1.112TyrHis: 1.112 ± 0.804
1.112TyrIle: 1.112 ± 1.004
3.893TyrLys: 3.893 ± 1.786
2.225TyrLeu: 2.225 ± 1.423
0.556TyrMet: 0.556 ± 0.548
1.112TyrAsn: 1.112 ± 0.582
1.112TyrPro: 1.112 ± 0.582
1.112TyrGln: 1.112 ± 0.582
1.669TyrArg: 1.669 ± 0.821
1.112TyrSer: 1.112 ± 0.423
1.112TyrThr: 1.112 ± 0.78
2.225TyrVal: 2.225 ± 0.501
1.669TyrTrp: 1.669 ± 0.972
1.669TyrTyr: 1.669 ± 0.667
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski