Amino acid dipepetide frequency for Changjiang tombus-like virus 18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.698AlaAla: 2.698 ± 1.125
0.0AlaCys: 0.0 ± 0.0
3.597AlaAsp: 3.597 ± 1.253
2.698AlaGlu: 2.698 ± 1.022
5.396AlaPhe: 5.396 ± 1.291
5.396AlaGly: 5.396 ± 1.858
0.899AlaHis: 0.899 ± 0.666
5.396AlaIle: 5.396 ± 0.754
3.597AlaLys: 3.597 ± 1.259
7.194AlaLeu: 7.194 ± 2.652
2.698AlaMet: 2.698 ± 0.373
4.496AlaAsn: 4.496 ± 2.416
0.899AlaPro: 0.899 ± 0.666
1.799AlaGln: 1.799 ± 0.871
5.396AlaArg: 5.396 ± 0.901
0.899AlaSer: 0.899 ± 0.593
5.396AlaThr: 5.396 ± 3.07
5.396AlaVal: 5.396 ± 1.858
0.899AlaTrp: 0.899 ± 0.666
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.799CysAsp: 1.799 ± 0.871
0.899CysGlu: 0.899 ± 0.593
0.0CysPhe: 0.0 ± 0.0
1.799CysGly: 1.799 ± 0.626
0.0CysHis: 0.0 ± 0.0
1.799CysIle: 1.799 ± 1.186
0.0CysLys: 0.0 ± 0.0
0.899CysLeu: 0.899 ± 0.593
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.899CysGln: 0.899 ± 0.593
0.899CysArg: 0.899 ± 0.666
0.899CysSer: 0.899 ± 0.593
0.0CysThr: 0.0 ± 0.0
0.899CysVal: 0.899 ± 0.666
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.799AspAla: 1.799 ± 1.332
0.0AspCys: 0.0 ± 0.0
2.698AspAsp: 2.698 ± 1.149
1.799AspGlu: 1.799 ± 1.885
3.597AspPhe: 3.597 ± 1.549
3.597AspGly: 3.597 ± 0.574
2.698AspHis: 2.698 ± 0.317
2.698AspIle: 2.698 ± 1.779
1.799AspLys: 1.799 ± 0.626
2.698AspLeu: 2.698 ± 0.317
1.799AspMet: 1.799 ± 1.332
5.396AspAsn: 5.396 ± 2.387
1.799AspPro: 1.799 ± 0.871
1.799AspGln: 1.799 ± 1.186
4.496AspArg: 4.496 ± 1.07
2.698AspSer: 2.698 ± 1.154
1.799AspThr: 1.799 ± 0.626
1.799AspVal: 1.799 ± 1.332
0.0AspTrp: 0.0 ± 0.0
2.698AspTyr: 2.698 ± 1.154
0.0AspXaa: 0.0 ± 0.0
Glu
2.698GluAla: 2.698 ± 1.125
0.899GluCys: 0.899 ± 0.593
2.698GluAsp: 2.698 ± 1.715
3.597GluGlu: 3.597 ± 2.518
1.799GluPhe: 1.799 ± 0.871
5.396GluGly: 5.396 ± 1.879
0.899GluHis: 0.899 ± 0.593
4.496GluIle: 4.496 ± 1.07
0.899GluLys: 0.899 ± 0.593
0.899GluLeu: 0.899 ± 0.593
0.899GluMet: 0.899 ± 0.942
1.799GluAsn: 1.799 ± 1.186
2.698GluPro: 2.698 ± 1.154
2.698GluGln: 2.698 ± 1.154
0.899GluArg: 0.899 ± 0.593
2.698GluSer: 2.698 ± 1.998
3.597GluThr: 3.597 ± 0.574
5.396GluVal: 5.396 ± 3.224
0.0GluTrp: 0.0 ± 0.0
2.698GluTyr: 2.698 ± 1.779
0.0GluXaa: 0.0 ± 0.0
Phe
2.698PheAla: 2.698 ± 1.779
0.899PheCys: 0.899 ± 0.593
1.799PheAsp: 1.799 ± 0.871
1.799PheGlu: 1.799 ± 0.626
1.799PhePhe: 1.799 ± 0.871
1.799PheGly: 1.799 ± 1.186
0.0PheHis: 0.0 ± 0.0
4.496PheIle: 4.496 ± 2.142
1.799PheLys: 1.799 ± 1.332
1.799PheLeu: 1.799 ± 1.885
1.799PheMet: 1.799 ± 0.871
1.799PheAsn: 1.799 ± 1.186
0.899PhePro: 0.899 ± 0.942
0.899PheGln: 0.899 ± 0.666
8.094PheArg: 8.094 ± 1.575
5.396PheSer: 5.396 ± 2.044
0.0PheThr: 0.0 ± 0.0
0.899PheVal: 0.899 ± 0.593
0.899PheTrp: 0.899 ± 0.666
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.899GlyAla: 0.899 ± 0.666
0.0GlyCys: 0.0 ± 0.0
1.799GlyAsp: 1.799 ± 0.626
0.899GlyGlu: 0.899 ± 0.593
0.0GlyPhe: 0.0 ± 0.0
4.496GlyGly: 4.496 ± 1.203
2.698GlyHis: 2.698 ± 1.715
4.496GlyIle: 4.496 ± 1.128
3.597GlyLys: 3.597 ± 1.771
6.295GlyLeu: 6.295 ± 1.553
0.899GlyMet: 0.899 ± 0.666
4.496GlyAsn: 4.496 ± 1.056
0.899GlyPro: 0.899 ± 0.666
3.597GlyGln: 3.597 ± 1.253
1.799GlyArg: 1.799 ± 0.871
7.194GlySer: 7.194 ± 2.865
0.899GlyThr: 0.899 ± 0.666
6.295GlyVal: 6.295 ± 2.33
0.0GlyTrp: 0.0 ± 0.0
2.698GlyTyr: 2.698 ± 0.317
0.0GlyXaa: 0.0 ± 0.0
His
3.597HisAla: 3.597 ± 1.669
0.899HisCys: 0.899 ± 0.666
0.899HisAsp: 0.899 ± 0.593
1.799HisGlu: 1.799 ± 0.871
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.597HisLeu: 3.597 ± 1.591
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.899HisArg: 0.899 ± 0.593
4.496HisSer: 4.496 ± 1.07
1.799HisThr: 1.799 ± 1.885
2.698HisVal: 2.698 ± 1.612
0.0HisTrp: 0.0 ± 0.0
0.899HisTyr: 0.899 ± 0.593
0.0HisXaa: 0.0 ± 0.0
Ile
3.597IleAla: 3.597 ± 1.615
0.899IleCys: 0.899 ± 0.593
3.597IleAsp: 3.597 ± 0.521
2.698IleGlu: 2.698 ± 1.154
2.698IlePhe: 2.698 ± 1.149
1.799IleGly: 1.799 ± 0.796
1.799IleHis: 1.799 ± 0.626
3.597IleIle: 3.597 ± 0.574
4.496IleLys: 4.496 ± 2.142
7.194IleLeu: 7.194 ± 1.041
0.0IleMet: 0.0 ± 0.0
3.597IleAsn: 3.597 ± 1.669
3.597IlePro: 3.597 ± 1.771
2.698IleGln: 2.698 ± 1.715
2.698IleArg: 2.698 ± 1.154
5.396IleSer: 5.396 ± 2.044
4.496IleThr: 4.496 ± 2.142
2.698IleVal: 2.698 ± 1.125
0.0IleTrp: 0.0 ± 0.0
1.799IleTyr: 1.799 ± 0.626
0.0IleXaa: 0.0 ± 0.0
Lys
5.396LysAla: 5.396 ± 3.07
0.899LysCys: 0.899 ± 0.593
3.597LysAsp: 3.597 ± 1.741
3.597LysGlu: 3.597 ± 1.741
3.597LysPhe: 3.597 ± 1.741
0.0LysGly: 0.0 ± 0.0
2.698LysHis: 2.698 ± 0.317
4.496LysIle: 4.496 ± 0.312
5.396LysLys: 5.396 ± 2.387
3.597LysLeu: 3.597 ± 1.615
2.698LysMet: 2.698 ± 1.036
3.597LysAsn: 3.597 ± 2.626
2.698LysPro: 2.698 ± 1.022
2.698LysGln: 2.698 ± 1.149
4.496LysArg: 4.496 ± 1.056
4.496LysSer: 4.496 ± 1.957
6.295LysThr: 6.295 ± 1.575
6.295LysVal: 6.295 ± 4.122
0.0LysTrp: 0.0 ± 0.0
3.597LysTyr: 3.597 ± 1.549
0.0LysXaa: 0.0 ± 0.0
Leu
7.194LeuAla: 7.194 ± 1.149
1.799LeuCys: 1.799 ± 0.626
3.597LeuAsp: 3.597 ± 1.771
6.295LeuGlu: 6.295 ± 1.553
0.0LeuPhe: 0.0 ± 0.0
4.496LeuGly: 4.496 ± 1.056
1.799LeuHis: 1.799 ± 0.871
2.698LeuIle: 2.698 ± 0.317
10.791LeuLys: 10.791 ± 1.562
9.892LeuLeu: 9.892 ± 3.671
3.597LeuMet: 3.597 ± 0.521
6.295LeuAsn: 6.295 ± 2.518
2.698LeuPro: 2.698 ± 1.125
3.597LeuGln: 3.597 ± 1.253
5.396LeuArg: 5.396 ± 2.612
8.094LeuSer: 8.094 ± 3.656
4.496LeuThr: 4.496 ± 1.056
2.698LeuVal: 2.698 ± 1.125
0.899LeuTrp: 0.899 ± 0.593
0.899LeuTyr: 0.899 ± 0.666
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.799MetAsp: 1.799 ± 0.626
2.698MetGlu: 2.698 ± 1.022
1.799MetPhe: 1.799 ± 0.796
1.799MetGly: 1.799 ± 0.871
0.0MetHis: 0.0 ± 0.0
1.799MetIle: 1.799 ± 0.796
0.0MetLys: 0.0 ± 0.0
4.496MetLeu: 4.496 ± 0.312
0.899MetMet: 0.899 ± 0.942
0.899MetAsn: 0.899 ± 0.593
2.698MetPro: 2.698 ± 1.715
0.899MetGln: 0.899 ± 0.942
0.0MetArg: 0.0 ± 0.0
3.597MetSer: 3.597 ± 1.253
0.899MetThr: 0.899 ± 0.666
3.597MetVal: 3.597 ± 1.259
0.899MetTrp: 0.899 ± 0.942
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.899AsnAla: 0.899 ± 0.666
0.899AsnCys: 0.899 ± 0.666
0.899AsnAsp: 0.899 ± 0.666
0.0AsnGlu: 0.0 ± 0.0
2.698AsnPhe: 2.698 ± 1.154
5.396AsnGly: 5.396 ± 1.858
1.799AsnHis: 1.799 ± 0.626
1.799AsnIle: 1.799 ± 0.626
4.496AsnLys: 4.496 ± 3.554
3.597AsnLeu: 3.597 ± 0.574
1.799AsnMet: 1.799 ± 1.062
4.496AsnAsn: 4.496 ± 1.727
4.496AsnPro: 4.496 ± 1.831
1.799AsnGln: 1.799 ± 0.626
2.698AsnArg: 2.698 ± 1.612
7.194AsnSer: 7.194 ± 1.441
6.295AsnThr: 6.295 ± 0.938
0.899AsnVal: 0.899 ± 0.666
1.799AsnTrp: 1.799 ± 0.626
0.899AsnTyr: 0.899 ± 0.593
0.0AsnXaa: 0.0 ± 0.0
Pro
7.194ProAla: 7.194 ± 2.243
0.0ProCys: 0.0 ± 0.0
2.698ProAsp: 2.698 ± 0.317
2.698ProGlu: 2.698 ± 1.125
2.698ProPhe: 2.698 ± 1.149
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.799ProIle: 1.799 ± 0.871
3.597ProLys: 3.597 ± 1.259
4.496ProLeu: 4.496 ± 1.203
2.698ProMet: 2.698 ± 1.149
1.799ProAsn: 1.799 ± 1.332
1.799ProPro: 1.799 ± 0.626
0.899ProGln: 0.899 ± 0.942
1.799ProArg: 1.799 ± 1.186
2.698ProSer: 2.698 ± 1.612
4.496ProThr: 4.496 ± 1.056
4.496ProVal: 4.496 ± 1.128
0.0ProTrp: 0.0 ± 0.0
1.799ProTyr: 1.799 ± 0.796
0.0ProXaa: 0.0 ± 0.0
Gln
3.597GlnAla: 3.597 ± 0.574
0.899GlnCys: 0.899 ± 0.593
0.0GlnAsp: 0.0 ± 0.0
0.899GlnGlu: 0.899 ± 0.593
3.597GlnPhe: 3.597 ± 1.549
1.799GlnGly: 1.799 ± 0.626
0.899GlnHis: 0.899 ± 0.666
0.0GlnIle: 0.0 ± 0.0
1.799GlnLys: 1.799 ± 0.796
4.496GlnLeu: 4.496 ± 0.312
1.799GlnMet: 1.799 ± 0.871
1.799GlnAsn: 1.799 ± 0.626
3.597GlnPro: 3.597 ± 1.549
1.799GlnGln: 1.799 ± 0.626
0.899GlnArg: 0.899 ± 0.942
0.899GlnSer: 0.899 ± 0.942
4.496GlnThr: 4.496 ± 2.551
3.597GlnVal: 3.597 ± 2.518
0.0GlnTrp: 0.0 ± 0.0
0.899GlnTyr: 0.899 ± 0.593
0.0GlnXaa: 0.0 ± 0.0
Arg
3.597ArgAla: 3.597 ± 1.615
0.899ArgCys: 0.899 ± 0.593
3.597ArgAsp: 3.597 ± 1.591
2.698ArgGlu: 2.698 ± 1.154
0.899ArgPhe: 0.899 ± 0.593
1.799ArgGly: 1.799 ± 0.626
0.0ArgHis: 0.0 ± 0.0
7.194ArgIle: 7.194 ± 2.826
3.597ArgLys: 3.597 ± 0.521
4.496ArgLeu: 4.496 ± 1.727
1.799ArgMet: 1.799 ± 0.871
2.698ArgAsn: 2.698 ± 1.154
1.799ArgPro: 1.799 ± 0.796
7.194ArgGln: 7.194 ± 2.834
2.698ArgArg: 2.698 ± 1.154
6.295ArgSer: 6.295 ± 2.187
3.597ArgThr: 3.597 ± 0.574
0.899ArgVal: 0.899 ± 0.942
0.899ArgTrp: 0.899 ± 0.593
4.496ArgTyr: 4.496 ± 1.203
0.0ArgXaa: 0.0 ± 0.0
Ser
3.597SerAla: 3.597 ± 1.259
0.0SerCys: 0.0 ± 0.0
1.799SerAsp: 1.799 ± 0.626
2.698SerGlu: 2.698 ± 1.154
5.396SerPhe: 5.396 ± 2.697
5.396SerGly: 5.396 ± 2.044
0.0SerHis: 0.0 ± 0.0
6.295SerIle: 6.295 ± 2.91
8.993SerLys: 8.993 ± 2.14
5.396SerLeu: 5.396 ± 1.291
0.899SerMet: 0.899 ± 0.666
2.698SerAsn: 2.698 ± 1.022
8.094SerPro: 8.094 ± 2.345
0.899SerGln: 0.899 ± 0.942
5.396SerArg: 5.396 ± 2.044
8.993SerSer: 8.993 ± 0.843
3.597SerThr: 3.597 ± 2.664
2.698SerVal: 2.698 ± 1.154
0.899SerTrp: 0.899 ± 0.593
6.295SerTyr: 6.295 ± 2.187
0.0SerXaa: 0.0 ± 0.0
Thr
2.698ThrAla: 2.698 ± 1.022
0.899ThrCys: 0.899 ± 0.942
1.799ThrAsp: 1.799 ± 1.186
1.799ThrGlu: 1.799 ± 0.626
0.0ThrPhe: 0.0 ± 0.0
4.496ThrGly: 4.496 ± 1.056
0.899ThrHis: 0.899 ± 0.666
2.698ThrIle: 2.698 ± 0.317
7.194ThrLys: 7.194 ± 2.001
5.396ThrLeu: 5.396 ± 0.754
1.799ThrMet: 1.799 ± 0.796
2.698ThrAsn: 2.698 ± 0.317
5.396ThrPro: 5.396 ± 2.387
0.0ThrGln: 0.0 ± 0.0
7.194ThrArg: 7.194 ± 1.441
5.396ThrSer: 5.396 ± 0.901
8.993ThrThr: 8.993 ± 4.832
10.791ThrVal: 10.791 ± 3.912
0.899ThrTrp: 0.899 ± 0.593
3.597ThrTyr: 3.597 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
8.993ValAla: 8.993 ± 2.406
0.899ValCys: 0.899 ± 0.593
7.194ValAsp: 7.194 ± 1.149
2.698ValGlu: 2.698 ± 1.612
1.799ValPhe: 1.799 ± 0.626
2.698ValGly: 2.698 ± 1.998
2.698ValHis: 2.698 ± 2.827
3.597ValIle: 3.597 ± 1.591
6.295ValLys: 6.295 ± 4.335
3.597ValLeu: 3.597 ± 2.664
1.799ValMet: 1.799 ± 1.885
4.496ValAsn: 4.496 ± 1.128
1.799ValPro: 1.799 ± 0.871
1.799ValGln: 1.799 ± 1.332
0.899ValArg: 0.899 ± 0.942
1.799ValSer: 1.799 ± 0.626
9.892ValThr: 9.892 ± 4.43
2.698ValVal: 2.698 ± 0.317
0.0ValTrp: 0.0 ± 0.0
1.799ValTyr: 1.799 ± 0.796
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.899TrpAsp: 0.899 ± 0.593
1.799TrpGlu: 1.799 ± 0.626
0.899TrpPhe: 0.899 ± 0.593
0.899TrpGly: 0.899 ± 0.666
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.597TrpLeu: 3.597 ± 0.521
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.899TrpArg: 0.899 ± 0.593
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.597TyrAla: 3.597 ± 0.521
0.0TyrCys: 0.0 ± 0.0
0.899TyrAsp: 0.899 ± 0.593
3.597TyrGlu: 3.597 ± 0.521
0.899TyrPhe: 0.899 ± 0.593
0.899TyrGly: 0.899 ± 0.593
1.799TyrHis: 1.799 ± 0.796
0.0TyrIle: 0.0 ± 0.0
1.799TyrLys: 1.799 ± 1.186
3.597TyrLeu: 3.597 ± 1.259
0.0TyrMet: 0.0 ± 0.0
2.698TyrAsn: 2.698 ± 1.149
1.799TyrPro: 1.799 ± 1.332
1.799TyrGln: 1.799 ± 0.871
3.597TyrArg: 3.597 ± 1.253
0.899TyrSer: 0.899 ± 0.666
3.597TyrThr: 3.597 ± 2.372
2.698TyrVal: 2.698 ± 1.149
0.899TyrTrp: 0.899 ± 0.593
0.899TyrTyr: 0.899 ± 0.942
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1113 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski