Amino acid dipepetide frequency for Tianjin totivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.496AlaAla: 9.496 ± 3.02
2.064AlaCys: 2.064 ± 1.115
2.477AlaAsp: 2.477 ± 1.378
2.064AlaGlu: 2.064 ± 1.149
4.129AlaPhe: 4.129 ± 0.788
6.606AlaGly: 6.606 ± 2.921
0.826AlaHis: 0.826 ± 0.459
6.606AlaIle: 6.606 ± 2.167
7.019AlaLys: 7.019 ± 2.13
6.193AlaLeu: 6.193 ± 1.183
1.652AlaMet: 1.652 ± 0.919
8.258AlaAsn: 8.258 ± 0.068
6.606AlaPro: 6.606 ± 2.167
2.064AlaGln: 2.064 ± 1.149
2.89AlaArg: 2.89 ± 0.099
4.542AlaSer: 4.542 ± 0.264
7.432AlaThr: 7.432 ± 1.117
5.78AlaVal: 5.78 ± 0.953
1.239AlaTrp: 1.239 ± 0.065
4.129AlaTyr: 4.129 ± 1.475
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.413CysAsp: 0.413 ± 0.23
1.652CysGlu: 1.652 ± 2.099
0.413CysPhe: 0.413 ± 0.23
0.826CysGly: 0.826 ± 0.295
0.413CysHis: 0.413 ± 0.525
0.826CysIle: 0.826 ± 0.295
1.652CysLys: 1.652 ± 1.344
0.826CysLeu: 0.826 ± 0.295
0.413CysMet: 0.413 ± 0.23
1.652CysAsn: 1.652 ± 0.919
0.413CysPro: 0.413 ± 0.23
0.826CysGln: 0.826 ± 0.459
0.826CysArg: 0.826 ± 0.295
0.0CysSer: 0.0 ± 0.0
0.413CysThr: 0.413 ± 0.525
0.413CysVal: 0.413 ± 0.23
0.826CysTrp: 0.826 ± 0.459
1.239CysTyr: 1.239 ± 0.065
0.0CysXaa: 0.0 ± 0.0
Asp
3.716AspAla: 3.716 ± 2.067
1.652AspCys: 1.652 ± 0.164
2.064AspAsp: 2.064 ± 0.394
4.542AspGlu: 4.542 ± 0.264
0.826AspPhe: 0.826 ± 0.295
2.064AspGly: 2.064 ± 1.115
0.413AspHis: 0.413 ± 0.23
4.129AspIle: 4.129 ± 1.475
1.239AspLys: 1.239 ± 0.82
4.955AspLeu: 4.955 ± 0.261
1.239AspMet: 1.239 ± 0.065
2.064AspAsn: 2.064 ± 0.394
2.477AspPro: 2.477 ± 1.378
2.477AspGln: 2.477 ± 0.624
2.064AspArg: 2.064 ± 0.394
2.064AspSer: 2.064 ± 0.36
2.477AspThr: 2.477 ± 1.639
3.303AspVal: 3.303 ± 0.329
0.0AspTrp: 0.0 ± 0.0
2.064AspTyr: 2.064 ± 0.36
0.0AspXaa: 0.0 ± 0.0
Glu
3.303GluAla: 3.303 ± 0.329
0.413GluCys: 0.413 ± 0.23
1.652GluAsp: 1.652 ± 0.919
4.955GluGlu: 4.955 ± 1.248
2.064GluPhe: 2.064 ± 2.623
2.89GluGly: 2.89 ± 0.099
1.239GluHis: 1.239 ± 0.065
3.303GluIle: 3.303 ± 1.18
2.064GluLys: 2.064 ± 1.115
4.542GluLeu: 4.542 ± 0.264
0.826GluMet: 0.826 ± 0.459
2.477GluAsn: 2.477 ± 0.131
2.89GluPro: 2.89 ± 0.099
2.064GluGln: 2.064 ± 1.149
2.477GluArg: 2.477 ± 0.624
1.652GluSer: 1.652 ± 0.164
2.477GluThr: 2.477 ± 0.885
3.303GluVal: 3.303 ± 1.083
2.89GluTrp: 2.89 ± 2.164
1.239GluTyr: 1.239 ± 0.689
0.0GluXaa: 0.0 ± 0.0
Phe
3.716PheAla: 3.716 ± 1.313
0.413PheCys: 0.413 ± 0.23
4.955PheAsp: 4.955 ± 0.261
2.477PheGlu: 2.477 ± 1.639
0.826PhePhe: 0.826 ± 0.295
2.89PheGly: 2.89 ± 0.655
0.826PheHis: 0.826 ± 0.459
1.239PheIle: 1.239 ± 0.065
2.89PheLys: 2.89 ± 2.164
1.652PheLeu: 1.652 ± 0.59
0.0PheMet: 0.0 ± 0.306
2.477PheAsn: 2.477 ± 0.624
2.064PhePro: 2.064 ± 0.394
0.413PheGln: 0.413 ± 0.525
0.826PheArg: 0.826 ± 0.295
2.477PheSer: 2.477 ± 0.624
1.239PheThr: 1.239 ± 0.689
2.064PheVal: 2.064 ± 1.115
0.826PheTrp: 0.826 ± 0.295
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.78GlyAla: 5.78 ± 2.462
0.0GlyCys: 0.0 ± 0.0
2.477GlyAsp: 2.477 ± 0.131
4.129GlyGlu: 4.129 ± 0.788
2.064GlyPhe: 2.064 ± 1.115
5.78GlyGly: 5.78 ± 0.198
0.413GlyHis: 0.413 ± 0.525
2.064GlyIle: 2.064 ± 1.115
3.716GlyLys: 3.716 ± 0.196
3.303GlyLeu: 3.303 ± 0.425
0.826GlyMet: 0.826 ± 0.295
3.716GlyAsn: 3.716 ± 0.559
3.303GlyPro: 3.303 ± 0.329
3.303GlyGln: 3.303 ± 1.083
2.89GlyArg: 2.89 ± 0.854
2.477GlySer: 2.477 ± 0.131
4.129GlyThr: 4.129 ± 1.543
4.129GlyVal: 4.129 ± 1.543
2.064GlyTrp: 2.064 ± 1.115
1.652GlyTyr: 1.652 ± 1.344
0.0GlyXaa: 0.0 ± 0.0
His
1.652HisAla: 1.652 ± 0.919
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.239HisGlu: 1.239 ± 0.689
0.413HisPhe: 0.413 ± 0.23
1.652HisGly: 1.652 ± 0.59
1.239HisHis: 1.239 ± 0.82
1.652HisIle: 1.652 ± 0.164
1.239HisLys: 1.239 ± 0.065
1.652HisLeu: 1.652 ± 2.099
0.826HisMet: 0.826 ± 0.295
1.239HisAsn: 1.239 ± 0.689
0.826HisPro: 0.826 ± 1.049
0.826HisGln: 0.826 ± 1.049
1.652HisArg: 1.652 ± 0.59
1.239HisSer: 1.239 ± 0.065
0.413HisThr: 0.413 ± 0.23
1.652HisVal: 1.652 ± 0.164
0.0HisTrp: 0.0 ± 0.0
0.413HisTyr: 0.413 ± 0.525
0.0HisXaa: 0.0 ± 0.0
Ile
7.845IleAla: 7.845 ± 3.179
0.413IleCys: 0.413 ± 0.23
2.477IleAsp: 2.477 ± 0.131
2.064IleGlu: 2.064 ± 0.36
1.239IlePhe: 1.239 ± 0.689
2.89IleGly: 2.89 ± 0.655
0.826IleHis: 0.826 ± 0.459
2.477IleIle: 2.477 ± 0.624
4.129IleLys: 4.129 ± 2.229
4.129IleLeu: 4.129 ± 0.72
0.826IleMet: 0.826 ± 0.295
2.89IleAsn: 2.89 ± 0.099
7.019IlePro: 7.019 ± 0.133
1.239IleGln: 1.239 ± 0.82
1.239IleArg: 1.239 ± 0.065
3.303IleSer: 3.303 ± 0.329
3.716IleThr: 3.716 ± 1.313
4.955IleVal: 4.955 ± 0.261
1.652IleTrp: 1.652 ± 0.164
2.064IleTyr: 2.064 ± 1.115
0.0IleXaa: 0.0 ± 0.0
Lys
2.477LysAla: 2.477 ± 0.624
0.0LysCys: 0.0 ± 0.0
2.064LysAsp: 2.064 ± 1.115
3.303LysGlu: 3.303 ± 0.425
2.89LysPhe: 2.89 ± 0.655
2.064LysGly: 2.064 ± 0.36
1.652LysHis: 1.652 ± 1.344
3.303LysIle: 3.303 ± 1.934
1.239LysLys: 1.239 ± 0.689
4.129LysLeu: 4.129 ± 0.788
2.477LysMet: 2.477 ± 0.131
3.716LysAsn: 3.716 ± 1.705
3.716LysPro: 3.716 ± 0.95
2.89LysGln: 2.89 ± 0.655
5.367LysArg: 5.367 ± 3.049
2.477LysSer: 2.477 ± 1.378
3.716LysThr: 3.716 ± 0.196
3.716LysVal: 3.716 ± 1.705
1.652LysTrp: 1.652 ± 0.59
1.652LysTyr: 1.652 ± 0.164
0.0LysXaa: 0.0 ± 0.0
Leu
7.845LeuAla: 7.845 ± 0.162
0.826LeuCys: 0.826 ± 1.049
3.716LeuAsp: 3.716 ± 0.559
2.064LeuGlu: 2.064 ± 0.394
2.064LeuPhe: 2.064 ± 0.36
3.303LeuGly: 3.303 ± 0.329
2.064LeuHis: 2.064 ± 1.115
3.303LeuIle: 3.303 ± 0.425
3.716LeuLys: 3.716 ± 0.95
6.606LeuLeu: 6.606 ± 0.658
2.064LeuMet: 2.064 ± 0.36
3.716LeuAsn: 3.716 ± 1.705
5.367LeuPro: 5.367 ± 0.786
4.955LeuGln: 4.955 ± 2.002
2.064LeuArg: 2.064 ± 1.869
7.432LeuSer: 7.432 ± 1.872
9.496LeuThr: 9.496 ± 2.266
5.367LeuVal: 5.367 ± 0.786
1.652LeuTrp: 1.652 ± 1.344
2.89LeuTyr: 2.89 ± 1.41
0.0LeuXaa: 0.0 ± 0.0
Met
2.477MetAla: 2.477 ± 0.885
1.239MetCys: 1.239 ± 0.689
1.239MetAsp: 1.239 ± 0.689
0.0MetGlu: 0.0 ± 0.0
0.413MetPhe: 0.413 ± 0.23
0.413MetGly: 0.413 ± 0.23
1.652MetHis: 1.652 ± 0.164
2.064MetIle: 2.064 ± 0.36
0.0MetLys: 0.0 ± 0.0
2.064MetLeu: 2.064 ± 0.36
0.826MetMet: 0.826 ± 0.295
2.064MetAsn: 2.064 ± 1.115
0.413MetPro: 0.413 ± 0.23
1.652MetGln: 1.652 ± 0.59
0.826MetArg: 0.826 ± 0.295
1.652MetSer: 1.652 ± 0.919
2.064MetThr: 2.064 ± 0.394
1.239MetVal: 1.239 ± 0.689
0.0MetTrp: 0.0 ± 0.0
0.826MetTyr: 0.826 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
5.78AsnAla: 5.78 ± 0.198
1.239AsnCys: 1.239 ± 0.82
2.477AsnAsp: 2.477 ± 0.885
2.477AsnGlu: 2.477 ± 0.131
2.064AsnPhe: 2.064 ± 0.36
2.89AsnGly: 2.89 ± 0.854
0.413AsnHis: 0.413 ± 0.23
4.129AsnIle: 4.129 ± 1.475
2.477AsnLys: 2.477 ± 0.624
3.716AsnLeu: 3.716 ± 0.196
1.652AsnMet: 1.652 ± 0.59
3.303AsnAsn: 3.303 ± 1.083
4.542AsnPro: 4.542 ± 0.491
3.716AsnGln: 3.716 ± 0.196
3.303AsnArg: 3.303 ± 1.18
4.542AsnSer: 4.542 ± 1.773
4.542AsnThr: 4.542 ± 0.264
3.303AsnVal: 3.303 ± 1.838
0.0AsnTrp: 0.0 ± 0.0
1.652AsnTyr: 1.652 ± 0.59
0.0AsnXaa: 0.0 ± 0.0
Pro
7.845ProAla: 7.845 ± 2.856
0.0ProCys: 0.0 ± 0.0
1.652ProAsp: 1.652 ± 0.919
2.89ProGlu: 2.89 ± 1.41
2.89ProPhe: 2.89 ± 0.655
2.477ProGly: 2.477 ± 0.624
1.239ProHis: 1.239 ± 0.065
4.129ProIle: 4.129 ± 0.788
3.303ProLys: 3.303 ± 1.083
5.78ProLeu: 5.78 ± 0.556
1.239ProMet: 1.239 ± 0.689
3.303ProAsn: 3.303 ± 0.425
3.303ProPro: 3.303 ± 0.329
2.064ProGln: 2.064 ± 0.394
2.064ProArg: 2.064 ± 0.36
5.367ProSer: 5.367 ± 0.031
4.955ProThr: 4.955 ± 0.261
4.955ProVal: 4.955 ± 1.248
0.826ProTrp: 0.826 ± 1.049
1.652ProTyr: 1.652 ± 0.919
0.0ProXaa: 0.0 ± 0.0
Gln
4.955GlnAla: 4.955 ± 2.002
0.413GlnCys: 0.413 ± 0.525
1.239GlnAsp: 1.239 ± 0.82
2.477GlnGlu: 2.477 ± 0.131
0.413GlnPhe: 0.413 ± 0.23
2.064GlnGly: 2.064 ± 0.394
1.239GlnHis: 1.239 ± 0.82
2.064GlnIle: 2.064 ± 0.36
1.652GlnLys: 1.652 ± 0.164
2.89GlnLeu: 2.89 ± 0.099
0.413GlnMet: 0.413 ± 0.525
2.064GlnAsn: 2.064 ± 0.36
2.89GlnPro: 2.89 ± 0.099
2.477GlnGln: 2.477 ± 0.131
2.477GlnArg: 2.477 ± 0.131
3.303GlnSer: 3.303 ± 0.329
4.129GlnThr: 4.129 ± 0.034
2.89GlnVal: 2.89 ± 0.854
0.413GlnTrp: 0.413 ± 0.23
2.89GlnTyr: 2.89 ± 1.608
0.0GlnXaa: 0.0 ± 0.0
Arg
3.716ArgAla: 3.716 ± 0.95
0.826ArgCys: 0.826 ± 1.049
2.064ArgAsp: 2.064 ± 1.115
2.064ArgGlu: 2.064 ± 1.149
1.239ArgPhe: 1.239 ± 0.065
1.652ArgGly: 1.652 ± 0.164
1.652ArgHis: 1.652 ± 0.164
2.064ArgIle: 2.064 ± 0.394
3.303ArgLys: 3.303 ± 1.18
4.129ArgLeu: 4.129 ± 0.72
0.0ArgMet: 0.0 ± 0.0
1.652ArgAsn: 1.652 ± 0.164
2.477ArgPro: 2.477 ± 0.624
1.239ArgGln: 1.239 ± 0.82
3.303ArgArg: 3.303 ± 0.425
2.477ArgSer: 2.477 ± 0.624
2.064ArgThr: 2.064 ± 1.115
2.89ArgVal: 2.89 ± 0.655
1.239ArgTrp: 1.239 ± 0.065
1.239ArgTyr: 1.239 ± 0.065
0.0ArgXaa: 0.0 ± 0.0
Ser
5.367SerAla: 5.367 ± 2.232
1.239SerCys: 1.239 ± 0.689
3.303SerAsp: 3.303 ± 1.083
3.303SerGlu: 3.303 ± 0.329
3.303SerPhe: 3.303 ± 0.329
6.606SerGly: 6.606 ± 0.851
0.826SerHis: 0.826 ± 0.295
2.477SerIle: 2.477 ± 0.885
3.303SerLys: 3.303 ± 1.18
6.606SerLeu: 6.606 ± 0.658
2.477SerMet: 2.477 ± 0.537
4.542SerAsn: 4.542 ± 1.018
3.303SerPro: 3.303 ± 0.329
1.239SerGln: 1.239 ± 0.689
0.826SerArg: 0.826 ± 0.459
6.193SerSer: 6.193 ± 1.835
4.542SerThr: 4.542 ± 2.527
4.129SerVal: 4.129 ± 0.788
1.239SerTrp: 1.239 ± 0.065
2.89SerTyr: 2.89 ± 1.41
0.0SerXaa: 0.0 ± 0.0
Thr
3.716ThrAla: 3.716 ± 2.067
0.413ThrCys: 0.413 ± 0.23
4.955ThrAsp: 4.955 ± 1.015
2.064ThrGlu: 2.064 ± 1.115
2.89ThrPhe: 2.89 ± 0.655
4.129ThrGly: 4.129 ± 0.72
1.239ThrHis: 1.239 ± 0.065
5.78ThrIle: 5.78 ± 0.198
5.367ThrLys: 5.367 ± 0.031
6.606ThrLeu: 6.606 ± 0.097
1.652ThrMet: 1.652 ± 0.919
4.129ThrAsn: 4.129 ± 0.034
2.064ThrPro: 2.064 ± 1.149
4.129ThrGln: 4.129 ± 0.034
1.652ThrArg: 1.652 ± 0.919
5.78ThrSer: 5.78 ± 0.953
7.432ThrThr: 7.432 ± 1.872
5.78ThrVal: 5.78 ± 2.462
2.064ThrTrp: 2.064 ± 1.149
3.303ThrTyr: 3.303 ± 1.18
0.0ThrXaa: 0.0 ± 0.0
Val
6.606ValAla: 6.606 ± 0.658
1.652ValCys: 1.652 ± 0.59
2.477ValAsp: 2.477 ± 0.131
3.716ValGlu: 3.716 ± 0.559
2.89ValPhe: 2.89 ± 0.655
5.78ValGly: 5.78 ± 2.462
0.413ValHis: 0.413 ± 0.525
2.477ValIle: 2.477 ± 0.624
2.89ValLys: 2.89 ± 0.099
4.129ValLeu: 4.129 ± 1.543
2.064ValMet: 2.064 ± 0.36
2.89ValAsn: 2.89 ± 0.099
5.367ValPro: 5.367 ± 1.478
2.477ValGln: 2.477 ± 0.131
3.716ValArg: 3.716 ± 0.196
3.716ValSer: 3.716 ± 0.196
4.955ValThr: 4.955 ± 0.493
4.955ValVal: 4.955 ± 2.757
0.413ValTrp: 0.413 ± 0.23
2.89ValTyr: 2.89 ± 1.608
0.0ValXaa: 0.0 ± 0.0
Trp
1.239TrpAla: 1.239 ± 0.689
0.826TrpCys: 0.826 ± 0.295
1.239TrpAsp: 1.239 ± 0.065
0.413TrpGlu: 0.413 ± 0.23
0.0TrpPhe: 0.0 ± 0.0
0.413TrpGly: 0.413 ± 0.525
0.413TrpHis: 0.413 ± 0.525
0.413TrpIle: 0.413 ± 0.23
0.413TrpLys: 0.413 ± 0.525
2.477TrpLeu: 2.477 ± 0.885
0.413TrpMet: 0.413 ± 0.23
1.652TrpAsn: 1.652 ± 0.59
1.652TrpPro: 1.652 ± 0.164
0.0TrpGln: 0.0 ± 0.0
0.413TrpArg: 0.413 ± 0.23
2.477TrpSer: 2.477 ± 2.394
2.89TrpThr: 2.89 ± 0.655
0.826TrpVal: 0.826 ± 0.295
0.413TrpTrp: 0.413 ± 0.525
1.652TrpTyr: 1.652 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.542TyrAla: 4.542 ± 1.245
0.413TyrCys: 0.413 ± 0.23
2.89TyrAsp: 2.89 ± 0.655
0.826TyrGlu: 0.826 ± 0.295
1.652TyrPhe: 1.652 ± 0.919
0.826TyrGly: 0.826 ± 0.459
0.826TyrHis: 0.826 ± 0.459
2.89TyrIle: 2.89 ± 1.41
2.89TyrLys: 2.89 ± 0.099
4.129TyrLeu: 4.129 ± 2.229
0.826TyrMet: 0.826 ± 0.295
0.413TyrAsn: 0.413 ± 0.23
1.239TyrPro: 1.239 ± 0.82
3.303TyrGln: 3.303 ± 0.425
0.413TyrArg: 0.413 ± 0.23
4.542TyrSer: 4.542 ± 1.018
2.064TyrThr: 2.064 ± 0.36
0.826TyrVal: 0.826 ± 0.295
0.826TyrTrp: 0.826 ± 0.295
0.826TyrTyr: 0.826 ± 0.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski