Amino acid dipepetide frequency for Chaetoceros protobacilladnavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.599AlaAla: 1.599 ± 1.419
0.799AlaCys: 0.799 ± 0.71
3.197AlaAsp: 3.197 ± 2.038
6.395AlaGlu: 6.395 ± 0.915
4.796AlaPhe: 4.796 ± 0.635
4.796AlaGly: 4.796 ± 0.635
2.398AlaHis: 2.398 ± 1.303
3.997AlaIle: 3.997 ± 2.774
5.596AlaLys: 5.596 ± 2.463
2.398AlaLeu: 2.398 ± 2.288
2.398AlaMet: 2.398 ± 1.479
2.398AlaAsn: 2.398 ± 1.134
3.197AlaPro: 3.197 ± 2.038
0.799AlaGln: 0.799 ± 0.763
2.398AlaArg: 2.398 ± 1.003
3.197AlaSer: 3.197 ± 1.331
2.398AlaThr: 2.398 ± 1.398
4.796AlaVal: 4.796 ± 0.587
0.0AlaTrp: 0.0 ± 0.0
2.398AlaTyr: 2.398 ± 1.303
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.592
0.0CysCys: 0.0 ± 0.0
0.799CysAsp: 0.799 ± 0.592
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.799CysLys: 0.799 ± 0.592
0.799CysLeu: 0.799 ± 0.592
0.799CysMet: 0.799 ± 0.71
0.799CysAsn: 0.799 ± 0.592
1.599CysPro: 1.599 ± 1.184
0.0CysGln: 0.0 ± 0.0
2.398CysArg: 2.398 ± 1.134
2.398CysSer: 2.398 ± 1.776
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.799CysTyr: 0.799 ± 0.763
0.0CysXaa: 0.0 ± 0.0
Asp
1.599AspAla: 1.599 ± 0.666
0.0AspCys: 0.0 ± 0.0
2.398AspAsp: 2.398 ± 1.776
3.997AspGlu: 3.997 ± 0.913
2.398AspPhe: 2.398 ± 0.909
6.395AspGly: 6.395 ± 1.716
0.0AspHis: 0.0 ± 0.0
1.599AspIle: 1.599 ± 0.561
1.599AspLys: 1.599 ± 1.525
4.796AspLeu: 4.796 ± 1.471
1.599AspMet: 1.599 ± 0.561
1.599AspAsn: 1.599 ± 1.419
6.395AspPro: 6.395 ± 0.66
3.997AspGln: 3.997 ± 0.913
4.796AspArg: 4.796 ± 2.606
3.997AspSer: 3.997 ± 1.988
3.997AspThr: 3.997 ± 0.913
3.197AspVal: 3.197 ± 1.807
1.599AspTrp: 1.599 ± 0.561
1.599AspTyr: 1.599 ± 0.666
0.0AspXaa: 0.0 ± 0.0
Glu
3.197GluAla: 3.197 ± 1.01
0.799GluCys: 0.799 ± 0.592
4.796GluAsp: 4.796 ± 2.627
3.997GluGlu: 3.997 ± 0.468
3.197GluPhe: 3.197 ± 0.33
3.197GluGly: 3.197 ± 1.01
0.799GluHis: 0.799 ± 0.592
1.599GluIle: 1.599 ± 1.184
3.197GluLys: 3.197 ± 1.331
4.796GluLeu: 4.796 ± 1.997
1.599GluMet: 1.599 ± 1.184
3.997GluAsn: 3.997 ± 1.547
2.398GluPro: 2.398 ± 1.776
2.398GluGln: 2.398 ± 0.293
1.599GluArg: 1.599 ± 0.666
6.395GluSer: 6.395 ± 1.696
3.197GluThr: 3.197 ± 2.368
4.796GluVal: 4.796 ± 0.587
2.398GluTrp: 2.398 ± 1.303
0.799GluTyr: 0.799 ± 0.71
0.0GluXaa: 0.0 ± 0.0
Phe
0.799PheAla: 0.799 ± 0.71
0.799PheCys: 0.799 ± 0.592
1.599PheAsp: 1.599 ± 0.666
3.997PheGlu: 3.997 ± 1.642
1.599PhePhe: 1.599 ± 1.184
0.799PheGly: 0.799 ± 0.592
5.596PheHis: 5.596 ± 3.204
2.398PheIle: 2.398 ± 0.293
0.799PheLys: 0.799 ± 0.592
1.599PheLeu: 1.599 ± 0.874
0.799PheMet: 0.799 ± 0.71
4.796PheAsn: 4.796 ± 2.268
2.398PhePro: 2.398 ± 0.909
3.197PheGln: 3.197 ± 2.185
0.799PheArg: 0.799 ± 0.71
3.197PheSer: 3.197 ± 1.428
1.599PheThr: 1.599 ± 1.184
0.799PheVal: 0.799 ± 0.71
2.398PheTrp: 2.398 ± 1.776
0.799PheTyr: 0.799 ± 0.592
0.0PheXaa: 0.0 ± 0.0
Gly
6.395GlyAla: 6.395 ± 2.047
0.0GlyCys: 0.0 ± 0.0
0.799GlyAsp: 0.799 ± 0.592
0.799GlyGlu: 0.799 ± 0.592
3.197GlyPhe: 3.197 ± 2.038
7.194GlyGly: 7.194 ± 0.391
1.599GlyHis: 1.599 ± 0.666
4.796GlyIle: 4.796 ± 2.958
1.599GlyLys: 1.599 ± 1.184
6.395GlyLeu: 6.395 ± 2.568
0.799GlyMet: 0.799 ± 0.71
3.197GlyAsn: 3.197 ± 0.858
1.599GlyPro: 1.599 ± 0.666
5.596GlyGln: 5.596 ± 2.32
7.994GlyArg: 7.994 ± 1.554
5.596GlySer: 5.596 ± 1.861
4.796GlyThr: 4.796 ± 1.503
6.395GlyVal: 6.395 ± 2.895
1.599GlyTrp: 1.599 ± 0.561
2.398GlyTyr: 2.398 ± 1.398
0.0GlyXaa: 0.0 ± 0.0
His
4.796HisAla: 4.796 ± 0.635
0.799HisCys: 0.799 ± 0.71
1.599HisAsp: 1.599 ± 0.874
1.599HisGlu: 1.599 ± 1.184
0.799HisPhe: 0.799 ± 0.592
0.799HisGly: 0.799 ± 0.592
0.799HisHis: 0.799 ± 0.71
1.599HisIle: 1.599 ± 1.184
3.997HisLys: 3.997 ± 1.596
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.599HisAsn: 1.599 ± 1.525
3.997HisPro: 3.997 ± 0.913
1.599HisGln: 1.599 ± 0.666
0.799HisArg: 0.799 ± 0.592
2.398HisSer: 2.398 ± 1.003
0.799HisThr: 0.799 ± 0.763
2.398HisVal: 2.398 ± 0.909
0.0HisTrp: 0.0 ± 0.0
1.599HisTyr: 1.599 ± 1.184
0.0HisXaa: 0.0 ± 0.0
Ile
4.796IleAla: 4.796 ± 2.623
0.0IleCys: 0.0 ± 0.0
3.997IleAsp: 3.997 ± 2.221
3.197IleGlu: 3.197 ± 1.01
0.799IlePhe: 0.799 ± 0.592
3.997IleGly: 3.997 ± 0.468
0.0IleHis: 0.0 ± 0.0
2.398IleIle: 2.398 ± 0.293
2.398IleLys: 2.398 ± 1.479
2.398IleLeu: 2.398 ± 1.776
2.398IleMet: 2.398 ± 1.413
3.197IleAsn: 3.197 ± 0.33
1.599IlePro: 1.599 ± 1.419
2.398IleGln: 2.398 ± 0.909
3.197IleArg: 3.197 ± 0.33
3.197IleSer: 3.197 ± 2.185
0.799IleThr: 0.799 ± 0.763
2.398IleVal: 2.398 ± 1.776
0.0IleTrp: 0.0 ± 0.0
1.599IleTyr: 1.599 ± 0.561
0.0IleXaa: 0.0 ± 0.0
Lys
6.395LysAla: 6.395 ± 3.776
0.799LysCys: 0.799 ± 0.592
2.398LysAsp: 2.398 ± 0.293
5.596LysGlu: 5.596 ± 2.23
2.398LysPhe: 2.398 ± 1.776
2.398LysGly: 2.398 ± 0.909
3.197LysHis: 3.197 ± 0.33
0.799LysIle: 0.799 ± 0.71
7.194LysLys: 7.194 ± 3.737
5.596LysLeu: 5.596 ± 1.381
2.398LysMet: 2.398 ± 1.009
1.599LysAsn: 1.599 ± 1.525
1.599LysPro: 1.599 ± 0.561
4.796LysGln: 4.796 ± 1.471
12.79LysArg: 12.79 ± 1.83
3.997LysSer: 3.997 ± 0.468
4.796LysThr: 4.796 ± 1.156
3.197LysVal: 3.197 ± 2.029
0.0LysTrp: 0.0 ± 0.0
3.197LysTyr: 3.197 ± 1.331
0.0LysXaa: 0.0 ± 0.0
Leu
4.796LeuAla: 4.796 ± 3.665
0.799LeuCys: 0.799 ± 0.592
5.596LeuAsp: 5.596 ± 1.381
3.997LeuGlu: 3.997 ± 0.468
3.197LeuPhe: 3.197 ± 0.33
4.796LeuGly: 4.796 ± 1.768
3.197LeuHis: 3.197 ± 0.858
1.599LeuIle: 1.599 ± 1.419
5.596LeuLys: 5.596 ± 0.991
7.194LeuLeu: 7.194 ± 0.863
0.0LeuMet: 0.0 ± 0.0
4.796LeuAsn: 4.796 ± 1.503
1.599LeuPro: 1.599 ± 1.184
1.599LeuGln: 1.599 ± 0.874
0.799LeuArg: 0.799 ± 0.71
5.596LeuSer: 5.596 ± 0.198
3.997LeuThr: 3.997 ± 0.913
2.398LeuVal: 2.398 ± 1.303
2.398LeuTrp: 2.398 ± 1.134
0.799LeuTyr: 0.799 ± 0.592
0.0LeuXaa: 0.0 ± 0.0
Met
1.599MetAla: 1.599 ± 1.419
0.0MetCys: 0.0 ± 0.0
0.799MetAsp: 0.799 ± 0.71
2.398MetGlu: 2.398 ± 1.303
0.799MetPhe: 0.799 ± 0.592
2.398MetGly: 2.398 ± 0.293
0.799MetHis: 0.799 ± 0.71
1.599MetIle: 1.599 ± 0.874
0.0MetLys: 0.0 ± 0.0
1.599MetLeu: 1.599 ± 0.561
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.799MetArg: 0.799 ± 0.592
0.799MetSer: 0.799 ± 0.763
2.398MetThr: 2.398 ± 1.134
2.398MetVal: 2.398 ± 1.134
0.799MetTrp: 0.799 ± 0.592
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.398AsnAla: 2.398 ± 1.134
1.599AsnCys: 1.599 ± 1.184
2.398AsnAsp: 2.398 ± 0.909
3.197AsnGlu: 3.197 ± 1.331
0.799AsnPhe: 0.799 ± 0.71
3.197AsnGly: 3.197 ± 1.807
1.599AsnHis: 1.599 ± 1.184
0.799AsnIle: 0.799 ± 0.763
3.197AsnLys: 3.197 ± 0.858
2.398AsnLeu: 2.398 ± 1.479
0.799AsnMet: 0.799 ± 0.71
3.197AsnAsn: 3.197 ± 2.038
4.796AsnPro: 4.796 ± 1.156
0.0AsnGln: 0.0 ± 0.0
0.799AsnArg: 0.799 ± 0.592
3.197AsnSer: 3.197 ± 2.839
0.0AsnThr: 0.0 ± 0.0
8.793AsnVal: 8.793 ± 3.095
0.799AsnTrp: 0.799 ± 0.71
2.398AsnTyr: 2.398 ± 1.303
0.0AsnXaa: 0.0 ± 0.0
Pro
1.599ProAla: 1.599 ± 0.561
0.799ProCys: 0.799 ± 0.592
4.796ProAsp: 4.796 ± 1.818
2.398ProGlu: 2.398 ± 1.776
4.796ProPhe: 4.796 ± 0.635
3.997ProGly: 3.997 ± 1.642
0.799ProHis: 0.799 ± 0.592
1.599ProIle: 1.599 ± 1.184
4.796ProLys: 4.796 ± 1.471
5.596ProLeu: 5.596 ± 0.198
0.0ProMet: 0.0 ± 0.0
2.398ProAsn: 2.398 ± 1.398
3.197ProPro: 3.197 ± 0.33
0.799ProGln: 0.799 ± 0.592
5.596ProArg: 5.596 ± 1.068
4.796ProSer: 4.796 ± 1.682
7.194ProThr: 7.194 ± 2.399
2.398ProVal: 2.398 ± 0.293
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.599GlnAla: 1.599 ± 1.525
0.799GlnCys: 0.799 ± 0.592
0.799GlnAsp: 0.799 ± 0.592
3.197GlnGlu: 3.197 ± 0.33
1.599GlnPhe: 1.599 ± 0.666
0.799GlnGly: 0.799 ± 0.71
0.0GlnHis: 0.0 ± 0.0
1.599GlnIle: 1.599 ± 1.184
4.796GlnLys: 4.796 ± 2.525
2.398GlnLeu: 2.398 ± 1.003
0.0GlnMet: 0.0 ± 0.0
3.197GlnAsn: 3.197 ± 2.038
2.398GlnPro: 2.398 ± 1.134
1.599GlnGln: 1.599 ± 1.184
4.796GlnArg: 4.796 ± 0.888
3.997GlnSer: 3.997 ± 1.596
3.197GlnThr: 3.197 ± 1.121
0.799GlnVal: 0.799 ± 0.592
0.0GlnTrp: 0.0 ± 0.0
0.799GlnTyr: 0.799 ± 0.592
0.0GlnXaa: 0.0 ± 0.0
Arg
3.197ArgAla: 3.197 ± 0.858
1.599ArgCys: 1.599 ± 0.666
5.596ArgAsp: 5.596 ± 1.277
1.599ArgGlu: 1.599 ± 1.184
3.997ArgPhe: 3.997 ± 0.468
4.796ArgGly: 4.796 ± 1.997
2.398ArgHis: 2.398 ± 1.303
3.997ArgIle: 3.997 ± 1.162
10.392ArgLys: 10.392 ± 2.703
4.796ArgLeu: 4.796 ± 1.818
0.799ArgMet: 0.799 ± 0.592
2.398ArgAsn: 2.398 ± 1.134
7.194ArgPro: 7.194 ± 2.774
2.398ArgGln: 2.398 ± 1.003
7.994ArgArg: 7.994 ± 2.804
6.395ArgSer: 6.395 ± 2.047
3.997ArgThr: 3.997 ± 1.642
3.197ArgVal: 3.197 ± 1.428
0.799ArgTrp: 0.799 ± 0.592
1.599ArgTyr: 1.599 ± 0.666
0.0ArgXaa: 0.0 ± 0.0
Ser
4.796SerAla: 4.796 ± 1.471
0.0SerCys: 0.0 ± 0.0
7.194SerAsp: 7.194 ± 1.249
3.997SerGlu: 3.997 ± 1.596
1.599SerPhe: 1.599 ± 0.561
8.793SerGly: 8.793 ± 2.893
1.599SerHis: 1.599 ± 1.184
2.398SerIle: 2.398 ± 0.909
4.796SerLys: 4.796 ± 2.627
3.997SerLeu: 3.997 ± 0.468
0.799SerMet: 0.799 ± 0.819
2.398SerAsn: 2.398 ± 1.398
2.398SerPro: 2.398 ± 0.293
1.599SerGln: 1.599 ± 1.419
10.392SerArg: 10.392 ± 1.074
6.395SerSer: 6.395 ± 2.766
4.796SerThr: 4.796 ± 1.682
3.197SerVal: 3.197 ± 0.33
1.599SerTrp: 1.599 ± 1.184
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.398ThrAla: 2.398 ± 1.003
0.0ThrCys: 0.0 ± 0.0
1.599ThrAsp: 1.599 ± 1.184
3.197ThrGlu: 3.197 ± 1.506
0.799ThrPhe: 0.799 ± 0.592
5.596ThrGly: 5.596 ± 0.991
2.398ThrHis: 2.398 ± 0.293
2.398ThrIle: 2.398 ± 1.134
6.395ThrLys: 6.395 ± 2.28
3.997ThrLeu: 3.997 ± 1.547
0.0ThrMet: 0.0 ± 0.0
2.398ThrAsn: 2.398 ± 0.909
4.796ThrPro: 4.796 ± 1.818
1.599ThrGln: 1.599 ± 0.561
3.197ThrArg: 3.197 ± 2.185
3.997ThrSer: 3.997 ± 2.221
9.592ThrThr: 9.592 ± 2.629
3.197ThrVal: 3.197 ± 1.121
2.398ThrTrp: 2.398 ± 0.909
1.599ThrTyr: 1.599 ± 1.184
0.0ThrXaa: 0.0 ± 0.0
Val
3.997ValAla: 3.997 ± 0.468
1.599ValCys: 1.599 ± 0.561
3.997ValAsp: 3.997 ± 0.744
3.997ValGlu: 3.997 ± 0.468
1.599ValPhe: 1.599 ± 0.561
3.197ValGly: 3.197 ± 2.038
3.997ValHis: 3.997 ± 0.913
4.796ValIle: 4.796 ± 0.587
5.596ValLys: 5.596 ± 1.381
3.197ValLeu: 3.197 ± 0.858
2.398ValMet: 2.398 ± 1.134
2.398ValAsn: 2.398 ± 1.003
3.197ValPro: 3.197 ± 1.807
2.398ValGln: 2.398 ± 1.003
3.197ValArg: 3.197 ± 1.807
3.997ValSer: 3.997 ± 1.162
1.599ValThr: 1.599 ± 0.666
3.197ValVal: 3.197 ± 0.858
0.799ValTrp: 0.799 ± 0.592
2.398ValTyr: 2.398 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
1.599TrpAla: 1.599 ± 0.666
0.0TrpCys: 0.0 ± 0.0
1.599TrpAsp: 1.599 ± 1.184
0.0TrpGlu: 0.0 ± 0.0
1.599TrpPhe: 1.599 ± 0.561
2.398TrpGly: 2.398 ± 1.003
0.0TrpHis: 0.0 ± 0.0
0.799TrpIle: 0.799 ± 0.71
0.0TrpLys: 0.0 ± 0.0
0.799TrpLeu: 0.799 ± 0.71
0.799TrpMet: 0.799 ± 0.592
0.0TrpAsn: 0.0 ± 0.0
1.599TrpPro: 1.599 ± 0.561
0.799TrpGln: 0.799 ± 0.592
3.197TrpArg: 3.197 ± 1.121
0.0TrpSer: 0.0 ± 0.0
1.599TrpThr: 1.599 ± 1.184
1.599TrpVal: 1.599 ± 0.666
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.599TyrAla: 1.599 ± 0.666
0.799TyrCys: 0.799 ± 0.592
1.599TyrAsp: 1.599 ± 0.666
1.599TyrGlu: 1.599 ± 1.184
0.799TyrPhe: 0.799 ± 0.592
3.197TyrGly: 3.197 ± 0.858
0.799TyrHis: 0.799 ± 0.592
3.997TyrIle: 3.997 ± 1.766
2.398TyrLys: 2.398 ± 1.479
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.599TyrPro: 1.599 ± 0.561
0.799TyrGln: 0.799 ± 0.763
1.599TyrArg: 1.599 ± 1.184
0.0TyrSer: 0.0 ± 0.0
0.799TyrThr: 0.799 ± 0.592
2.398TyrVal: 2.398 ± 1.003
0.799TyrTrp: 0.799 ± 0.763
1.599TyrTyr: 1.599 ± 0.561
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski