Amino acid dipepetide frequency for Chaetoceros tenuissimus RNA virus type-II

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.105AlaAla: 6.105 ± 1.774
1.526AlaCys: 1.526 ± 0.444
5.723AlaAsp: 5.723 ± 0.677
2.289AlaGlu: 2.289 ± 0.008
3.815AlaPhe: 3.815 ± 0.206
3.815AlaGly: 3.815 ± 1.766
1.145AlaHis: 1.145 ± 0.004
2.671AlaIle: 2.671 ± 0.868
4.96AlaLys: 4.96 ± 0.86
6.105AlaLeu: 6.105 ± 0.856
1.908AlaMet: 1.908 ± 1.089
3.052AlaAsn: 3.052 ± 0.887
4.96AlaPro: 4.96 ± 1.77
2.671AlaGln: 2.671 ± 0.21
3.434AlaArg: 3.434 ± 0.646
6.105AlaSer: 6.105 ± 1.774
4.96AlaThr: 4.96 ± 2.428
4.578AlaVal: 4.578 ± 1.331
1.908AlaTrp: 1.908 ± 0.226
1.908AlaTyr: 1.908 ± 1.089
0.0AlaXaa: 0.0 ± 0.0
Cys
1.145CysAla: 1.145 ± 0.654
0.0CysCys: 0.0 ± 0.0
0.763CysAsp: 0.763 ± 0.436
1.145CysGlu: 1.145 ± 0.004
0.0CysPhe: 0.0 ± 0.0
3.052CysGly: 3.052 ± 1.086
0.0CysHis: 0.0 ± 0.0
1.908CysIle: 1.908 ± 0.883
2.289CysLys: 2.289 ± 0.008
1.526CysLeu: 1.526 ± 0.444
0.763CysMet: 0.763 ± 0.436
0.763CysAsn: 0.763 ± 0.436
0.763CysPro: 0.763 ± 0.436
0.763CysGln: 0.763 ± 0.222
0.382CysArg: 0.382 ± 0.218
1.145CysSer: 1.145 ± 1.319
1.145CysThr: 1.145 ± 0.654
1.145CysVal: 1.145 ± 0.654
0.0CysTrp: 0.0 ± 0.0
0.763CysTyr: 0.763 ± 0.436
0.0CysXaa: 0.0 ± 0.0
Asp
4.197AspAla: 4.197 ± 0.233
0.0AspCys: 0.0 ± 0.0
5.723AspAsp: 5.723 ± 1.296
4.197AspGlu: 4.197 ± 0.233
3.052AspPhe: 3.052 ± 0.887
4.197AspGly: 4.197 ± 0.891
1.145AspHis: 1.145 ± 0.654
3.052AspIle: 3.052 ± 0.887
2.671AspLys: 2.671 ± 0.868
5.723AspLeu: 5.723 ± 1.296
1.145AspMet: 1.145 ± 0.654
3.434AspAsn: 3.434 ± 1.327
3.434AspPro: 3.434 ± 1.327
2.289AspGln: 2.289 ± 1.323
3.434AspArg: 3.434 ± 0.646
3.815AspSer: 3.815 ± 0.451
3.434AspThr: 3.434 ± 1.327
4.578AspVal: 4.578 ± 0.673
1.526AspTrp: 1.526 ± 0.214
3.052AspTyr: 3.052 ± 0.428
0.0AspXaa: 0.0 ± 0.0
Glu
1.526GluAla: 1.526 ± 0.872
1.908GluCys: 1.908 ± 0.226
4.197GluAsp: 4.197 ± 0.891
3.815GluGlu: 3.815 ± 2.179
5.341GluPhe: 5.341 ± 0.42
3.052GluGly: 3.052 ± 1.086
1.526GluHis: 1.526 ± 0.214
4.96GluIle: 4.96 ± 0.202
3.815GluLys: 3.815 ± 1.521
6.105GluLeu: 6.105 ± 1.117
0.763GluMet: 0.763 ± 0.222
1.526GluAsn: 1.526 ± 0.444
1.526GluPro: 1.526 ± 0.872
1.526GluGln: 1.526 ± 0.872
2.671GluArg: 2.671 ± 0.868
2.671GluSer: 2.671 ± 0.21
4.197GluThr: 4.197 ± 0.891
2.671GluVal: 2.671 ± 1.525
1.908GluTrp: 1.908 ± 1.089
2.289GluTyr: 2.289 ± 0.65
0.0GluXaa: 0.0 ± 0.0
Phe
3.434PheAla: 3.434 ± 0.012
1.908PheCys: 1.908 ± 0.226
4.578PheAsp: 4.578 ± 0.673
1.908PheGlu: 1.908 ± 0.226
1.908PhePhe: 1.908 ± 0.226
3.052PheGly: 3.052 ± 0.23
1.526PheHis: 1.526 ± 0.444
1.526PheIle: 1.526 ± 0.444
4.578PheLys: 4.578 ± 0.642
4.197PheLeu: 4.197 ± 1.082
1.145PheMet: 1.145 ± 0.654
3.434PheAsn: 3.434 ± 0.012
2.671PhePro: 2.671 ± 0.21
3.434PheGln: 3.434 ± 0.646
3.434PheArg: 3.434 ± 0.646
0.763PheSer: 0.763 ± 0.436
3.434PheThr: 3.434 ± 0.012
3.052PheVal: 3.052 ± 0.887
0.382PheTrp: 0.382 ± 0.44
1.145PheTyr: 1.145 ± 0.661
0.0PheXaa: 0.0 ± 0.0
Gly
3.815GlyAla: 3.815 ± 0.451
0.0GlyCys: 0.0 ± 0.0
4.96GlyAsp: 4.96 ± 0.202
4.197GlyGlu: 4.197 ± 0.233
3.052GlyPhe: 3.052 ± 0.23
4.578GlyGly: 4.578 ± 1.331
0.382GlyHis: 0.382 ± 0.218
3.052GlyIle: 3.052 ± 0.428
2.671GlyLys: 2.671 ± 0.868
4.96GlyLeu: 4.96 ± 0.202
1.908GlyMet: 1.908 ± 0.226
3.815GlyAsn: 3.815 ± 1.766
1.526GlyPro: 1.526 ± 0.444
1.526GlyGln: 1.526 ± 1.101
2.289GlyArg: 2.289 ± 0.008
3.434GlySer: 3.434 ± 0.012
5.723GlyThr: 5.723 ± 0.677
5.723GlyVal: 5.723 ± 0.638
1.145GlyTrp: 1.145 ± 0.661
2.671GlyTyr: 2.671 ± 0.868
0.0GlyXaa: 0.0 ± 0.0
His
3.434HisAla: 3.434 ± 0.012
0.0HisCys: 0.0 ± 0.0
1.145HisAsp: 1.145 ± 0.661
0.382HisGlu: 0.382 ± 0.218
1.145HisPhe: 1.145 ± 0.661
1.908HisGly: 1.908 ± 0.432
0.382HisHis: 0.382 ± 0.218
1.526HisIle: 1.526 ± 0.214
1.908HisLys: 1.908 ± 0.432
1.526HisLeu: 1.526 ± 0.872
0.382HisMet: 0.382 ± 0.218
1.526HisAsn: 1.526 ± 0.872
1.145HisPro: 1.145 ± 0.004
0.763HisGln: 0.763 ± 0.436
0.763HisArg: 0.763 ± 0.222
2.671HisSer: 2.671 ± 0.21
0.763HisThr: 0.763 ± 0.436
1.526HisVal: 1.526 ± 0.872
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.578IleAla: 4.578 ± 0.673
0.763IleCys: 0.763 ± 0.436
3.052IleAsp: 3.052 ± 2.202
5.341IleGlu: 5.341 ± 1.735
2.671IlePhe: 2.671 ± 0.21
2.289IleGly: 2.289 ± 0.665
2.671IleHis: 2.671 ± 0.447
4.96IleIle: 4.96 ± 2.175
2.671IleLys: 2.671 ± 0.21
1.908IleLeu: 1.908 ± 0.432
2.671IleMet: 2.671 ± 0.868
2.289IleAsn: 2.289 ± 0.008
2.671IlePro: 2.671 ± 0.21
1.526IleGln: 1.526 ± 1.101
2.671IleArg: 2.671 ± 0.868
5.341IleSer: 5.341 ± 1.078
3.815IleThr: 3.815 ± 0.206
2.671IleVal: 2.671 ± 0.868
1.145IleTrp: 1.145 ± 0.004
1.526IleTyr: 1.526 ± 0.214
0.0IleXaa: 0.0 ± 0.0
Lys
3.815LysAla: 3.815 ± 0.206
1.145LysCys: 1.145 ± 0.654
2.671LysAsp: 2.671 ± 0.21
5.341LysGlu: 5.341 ± 2.393
3.052LysPhe: 3.052 ± 1.086
3.052LysGly: 3.052 ± 1.743
1.908LysHis: 1.908 ± 0.432
5.723LysIle: 5.723 ± 0.638
4.197LysLys: 4.197 ± 0.424
3.815LysLeu: 3.815 ± 0.206
0.382LysMet: 0.382 ± 0.218
4.197LysAsn: 4.197 ± 1.739
2.289LysPro: 2.289 ± 0.665
1.526LysGln: 1.526 ± 0.872
3.052LysArg: 3.052 ± 1.743
4.197LysSer: 4.197 ± 1.082
3.052LysThr: 3.052 ± 1.086
4.96LysVal: 4.96 ± 0.202
1.526LysTrp: 1.526 ± 0.444
1.908LysTyr: 1.908 ± 1.089
0.0LysXaa: 0.0 ± 0.0
Leu
6.105LeuAla: 6.105 ± 1.774
1.908LeuCys: 1.908 ± 0.432
6.868LeuAsp: 6.868 ± 0.681
5.723LeuGlu: 5.723 ± 1.953
4.197LeuPhe: 4.197 ± 1.082
3.052LeuGly: 3.052 ± 0.23
1.526LeuHis: 1.526 ± 0.872
4.96LeuIle: 4.96 ± 0.202
4.578LeuLys: 4.578 ± 1.3
6.486LeuLeu: 6.486 ± 0.899
0.763LeuMet: 0.763 ± 0.436
6.486LeuAsn: 6.486 ± 1.074
4.197LeuPro: 4.197 ± 0.424
2.289LeuGln: 2.289 ± 0.008
3.815LeuArg: 3.815 ± 1.766
8.394LeuSer: 8.394 ± 1.782
4.578LeuThr: 4.578 ± 0.642
5.723LeuVal: 5.723 ± 1.296
0.763LeuTrp: 0.763 ± 0.436
1.526LeuTyr: 1.526 ± 0.872
0.0LeuXaa: 0.0 ± 0.0
Met
1.526MetAla: 1.526 ± 0.872
1.526MetCys: 1.526 ± 0.214
1.145MetAsp: 1.145 ± 0.654
1.526MetGlu: 1.526 ± 0.214
0.382MetPhe: 0.382 ± 0.44
1.908MetGly: 1.908 ± 0.226
0.382MetHis: 0.382 ± 0.218
1.908MetIle: 1.908 ± 0.432
1.908MetLys: 1.908 ± 1.089
0.763MetLeu: 0.763 ± 0.436
0.382MetMet: 0.382 ± 0.218
2.671MetAsn: 2.671 ± 1.525
2.671MetPro: 2.671 ± 0.21
0.763MetGln: 0.763 ± 0.222
1.145MetArg: 1.145 ± 0.654
1.526MetSer: 1.526 ± 0.214
1.526MetThr: 1.526 ± 0.444
1.145MetVal: 1.145 ± 0.004
0.0MetTrp: 0.0 ± 0.0
0.763MetTyr: 0.763 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
4.197AsnAla: 4.197 ± 0.233
1.145AsnCys: 1.145 ± 0.654
4.197AsnAsp: 4.197 ± 0.891
1.908AsnGlu: 1.908 ± 0.226
1.908AsnPhe: 1.908 ± 0.432
4.96AsnGly: 4.96 ± 0.455
0.382AsnHis: 0.382 ± 0.218
2.289AsnIle: 2.289 ± 1.307
2.671AsnLys: 2.671 ± 0.21
4.197AsnLeu: 4.197 ± 1.082
0.763AsnMet: 0.763 ± 0.436
3.815AsnAsn: 3.815 ± 0.206
3.815AsnPro: 3.815 ± 1.109
2.289AsnGln: 2.289 ± 0.008
1.908AsnArg: 1.908 ± 0.226
4.197AsnSer: 4.197 ± 1.549
3.434AsnThr: 3.434 ± 0.669
3.052AsnVal: 3.052 ± 1.086
0.382AsnTrp: 0.382 ± 0.218
2.289AsnTyr: 2.289 ± 1.323
0.0AsnXaa: 0.0 ± 0.0
Pro
2.289ProAla: 2.289 ± 0.665
1.908ProCys: 1.908 ± 0.432
2.289ProAsp: 2.289 ± 0.65
3.434ProGlu: 3.434 ± 0.012
2.671ProPhe: 2.671 ± 1.763
2.671ProGly: 2.671 ± 0.447
1.526ProHis: 1.526 ± 0.214
3.434ProIle: 3.434 ± 1.327
3.052ProLys: 3.052 ± 0.428
6.486ProLeu: 6.486 ± 0.416
0.382ProMet: 0.382 ± 0.218
1.526ProAsn: 1.526 ± 0.214
0.763ProPro: 0.763 ± 0.436
1.908ProGln: 1.908 ± 0.432
1.145ProArg: 1.145 ± 0.004
1.908ProSer: 1.908 ± 0.883
1.908ProThr: 1.908 ± 0.226
2.671ProVal: 2.671 ± 1.763
0.763ProTrp: 0.763 ± 0.879
3.052ProTyr: 3.052 ± 2.202
0.0ProXaa: 0.0 ± 0.0
Gln
3.052GlnAla: 3.052 ± 0.428
0.382GlnCys: 0.382 ± 0.218
1.908GlnAsp: 1.908 ± 0.226
1.526GlnGlu: 1.526 ± 0.444
1.526GlnPhe: 1.526 ± 0.444
2.289GlnGly: 2.289 ± 1.323
0.763GlnHis: 0.763 ± 0.436
0.763GlnIle: 0.763 ± 0.436
1.908GlnLys: 1.908 ± 1.089
2.671GlnLeu: 2.671 ± 0.21
1.526GlnMet: 1.526 ± 0.444
1.908GlnAsn: 1.908 ± 0.432
0.763GlnPro: 0.763 ± 0.879
1.526GlnGln: 1.526 ± 1.101
2.289GlnArg: 2.289 ± 0.665
3.052GlnSer: 3.052 ± 1.086
1.526GlnThr: 1.526 ± 0.444
1.526GlnVal: 1.526 ± 1.101
0.763GlnTrp: 0.763 ± 0.222
1.908GlnTyr: 1.908 ± 0.883
0.0GlnXaa: 0.0 ± 0.0
Arg
2.289ArgAla: 2.289 ± 0.008
0.763ArgCys: 0.763 ± 0.222
3.052ArgAsp: 3.052 ± 1.743
2.289ArgGlu: 2.289 ± 0.65
3.434ArgPhe: 3.434 ± 0.012
2.289ArgGly: 2.289 ± 0.665
2.289ArgHis: 2.289 ± 0.665
1.908ArgIle: 1.908 ± 0.226
2.289ArgLys: 2.289 ± 1.307
4.578ArgLeu: 4.578 ± 0.673
2.289ArgMet: 2.289 ± 0.65
1.908ArgAsn: 1.908 ± 0.226
1.145ArgPro: 1.145 ± 0.661
1.908ArgGln: 1.908 ± 0.432
3.815ArgArg: 3.815 ± 0.864
2.671ArgSer: 2.671 ± 0.447
2.671ArgThr: 2.671 ± 0.868
2.671ArgVal: 2.671 ± 0.868
0.763ArgTrp: 0.763 ± 0.436
1.526ArgTyr: 1.526 ± 1.101
0.0ArgXaa: 0.0 ± 0.0
Ser
6.868SerAla: 6.868 ± 1.338
0.763SerCys: 0.763 ± 0.436
3.052SerAsp: 3.052 ± 2.202
4.197SerGlu: 4.197 ± 0.233
3.052SerPhe: 3.052 ± 0.887
3.434SerGly: 3.434 ± 0.646
1.145SerHis: 1.145 ± 0.654
4.96SerIle: 4.96 ± 0.455
7.249SerLys: 7.249 ± 0.852
5.723SerLeu: 5.723 ± 1.953
3.815SerMet: 3.815 ± 0.571
2.671SerAsn: 2.671 ± 0.447
3.052SerPro: 3.052 ± 0.23
1.145SerGln: 1.145 ± 0.661
2.289SerArg: 2.289 ± 1.981
3.815SerSer: 3.815 ± 1.521
4.197SerThr: 4.197 ± 2.206
5.723SerVal: 5.723 ± 0.019
0.382SerTrp: 0.382 ± 0.218
1.908SerTyr: 1.908 ± 0.883
0.0SerXaa: 0.0 ± 0.0
Thr
5.341ThrAla: 5.341 ± 3.525
0.763ThrCys: 0.763 ± 0.436
3.052ThrAsp: 3.052 ± 0.23
1.908ThrGlu: 1.908 ± 0.226
2.671ThrPhe: 2.671 ± 0.447
4.578ThrGly: 4.578 ± 0.016
1.145ThrHis: 1.145 ± 0.654
3.815ThrIle: 3.815 ± 0.206
3.434ThrLys: 3.434 ± 0.646
5.341ThrLeu: 5.341 ± 0.895
1.526ThrMet: 1.526 ± 0.679
1.908ThrAsn: 1.908 ± 0.226
4.197ThrPro: 4.197 ± 1.549
1.908ThrGln: 1.908 ± 0.432
2.289ThrArg: 2.289 ± 0.008
6.486ThrSer: 6.486 ± 1.556
3.052ThrThr: 3.052 ± 2.202
3.052ThrVal: 3.052 ± 1.545
1.145ThrTrp: 1.145 ± 0.004
2.289ThrTyr: 2.289 ± 0.65
0.0ThrXaa: 0.0 ± 0.0
Val
6.868ValAla: 6.868 ± 0.023
2.289ValCys: 2.289 ± 0.008
3.052ValAsp: 3.052 ± 0.428
3.434ValGlu: 3.434 ± 0.646
3.052ValPhe: 3.052 ± 1.743
4.578ValGly: 4.578 ± 0.673
2.289ValHis: 2.289 ± 0.65
1.526ValIle: 1.526 ± 0.872
3.815ValLys: 3.815 ± 1.521
7.249ValLeu: 7.249 ± 0.463
0.763ValMet: 0.763 ± 0.436
3.434ValAsn: 3.434 ± 0.669
3.052ValPro: 3.052 ± 0.428
1.526ValGln: 1.526 ± 1.101
2.289ValArg: 2.289 ± 0.65
2.671ValSer: 2.671 ± 1.763
4.197ValThr: 4.197 ± 1.549
2.671ValVal: 2.671 ± 0.21
0.382ValTrp: 0.382 ± 0.44
2.289ValTyr: 2.289 ± 1.323
0.0ValXaa: 0.0 ± 0.0
Trp
1.908TrpAla: 1.908 ± 0.432
0.0TrpCys: 0.0 ± 0.0
0.763TrpAsp: 0.763 ± 0.222
1.526TrpGlu: 1.526 ± 0.214
1.145TrpPhe: 1.145 ± 0.654
0.382TrpGly: 0.382 ± 0.218
0.0TrpHis: 0.0 ± 0.0
1.145TrpIle: 1.145 ± 0.654
0.0TrpLys: 0.0 ± 0.0
0.763TrpLeu: 0.763 ± 0.436
0.382TrpMet: 0.382 ± 0.218
0.763TrpAsn: 0.763 ± 0.222
1.145TrpPro: 1.145 ± 1.319
1.145TrpGln: 1.145 ± 0.004
1.145TrpArg: 1.145 ± 0.004
1.145TrpSer: 1.145 ± 0.004
1.145TrpThr: 1.145 ± 1.319
0.763TrpVal: 0.763 ± 0.879
0.0TrpTrp: 0.0 ± 0.0
0.382TrpTyr: 0.382 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.145TyrAla: 1.145 ± 0.004
0.763TyrCys: 0.763 ± 0.222
1.908TyrAsp: 1.908 ± 0.432
1.908TyrGlu: 1.908 ± 0.432
3.434TyrPhe: 3.434 ± 0.012
2.289TyrGly: 2.289 ± 0.008
0.763TyrHis: 0.763 ± 0.222
1.145TyrIle: 1.145 ± 0.004
1.145TyrLys: 1.145 ± 0.654
3.815TyrLeu: 3.815 ± 1.109
1.526TyrMet: 1.526 ± 0.872
2.671TyrAsn: 2.671 ± 1.105
0.0TyrPro: 0.0 ± 0.0
1.145TyrGln: 1.145 ± 1.319
2.289TyrArg: 2.289 ± 0.008
3.434TyrSer: 3.434 ± 0.012
1.526TyrThr: 1.526 ± 0.214
1.526TyrVal: 1.526 ± 0.444
0.763TyrTrp: 0.763 ± 0.222
1.145TyrTyr: 1.145 ± 1.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2622 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski