Amino acid dipepetide frequency for Sanxia Qinvirus-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.855AlaAla: 13.855 ± 0.252
0.407AlaCys: 0.407 ± 0.192
5.297AlaAsp: 5.297 ± 0.403
5.297AlaGlu: 5.297 ± 1.447
3.26AlaPhe: 3.26 ± 1.532
4.075AlaGly: 4.075 ± 1.215
2.037AlaHis: 2.037 ± 0.086
3.26AlaIle: 3.26 ± 0.554
3.26AlaLys: 3.26 ± 0.554
8.965AlaLeu: 8.965 ± 2.127
4.075AlaMet: 4.075 ± 0.872
3.667AlaAsn: 3.667 ± 1.406
4.075AlaPro: 4.075 ± 1.915
2.445AlaGln: 2.445 ± 0.106
5.705AlaArg: 5.705 ± 1.492
8.15AlaSer: 8.15 ± 1.386
6.52AlaThr: 6.52 ± 3.195
4.89AlaVal: 4.89 ± 0.212
1.63AlaTrp: 1.63 ± 0.277
3.667AlaTyr: 3.667 ± 0.363
0.0AlaXaa: 0.0 ± 0.0
Cys
2.445CysAla: 2.445 ± 0.106
0.407CysCys: 0.407 ± 0.192
0.0CysAsp: 0.0 ± 0.0
0.815CysGlu: 0.815 ± 0.66
1.222CysPhe: 1.222 ± 0.575
1.63CysGly: 1.63 ± 1.321
0.0CysHis: 0.0 ± 0.0
1.63CysIle: 1.63 ± 0.766
0.0CysLys: 0.0 ± 0.0
2.037CysLeu: 2.037 ± 1.129
0.407CysMet: 0.407 ± 0.192
1.222CysAsn: 1.222 ± 0.575
0.815CysPro: 0.815 ± 0.66
0.815CysGln: 0.815 ± 0.383
1.222CysArg: 1.222 ± 0.575
0.815CysSer: 0.815 ± 0.383
2.037CysThr: 2.037 ± 0.086
0.815CysVal: 0.815 ± 0.383
0.0CysTrp: 0.0 ± 0.0
2.445CysTyr: 2.445 ± 1.149
0.0CysXaa: 0.0 ± 0.0
Asp
4.075AspAla: 4.075 ± 0.872
2.037AspCys: 2.037 ± 0.958
3.667AspAsp: 3.667 ± 2.449
6.112AspGlu: 6.112 ± 1.3
1.63AspPhe: 1.63 ± 1.321
1.63AspGly: 1.63 ± 2.364
1.63AspHis: 1.63 ± 0.277
3.26AspIle: 3.26 ± 0.554
3.26AspLys: 3.26 ± 0.554
6.112AspLeu: 6.112 ± 2.344
2.445AspMet: 2.445 ± 0.24
1.63AspAsn: 1.63 ± 1.321
2.445AspPro: 2.445 ± 0.937
2.037AspGln: 2.037 ± 1.129
2.445AspArg: 2.445 ± 0.937
4.075AspSer: 4.075 ± 0.171
2.037AspThr: 2.037 ± 0.958
4.482AspVal: 4.482 ± 2.107
2.445AspTrp: 2.445 ± 0.937
2.852AspTyr: 2.852 ± 0.297
0.0AspXaa: 0.0 ± 0.0
Glu
4.89GluAla: 4.89 ± 0.212
1.63GluCys: 1.63 ± 0.766
3.667GluAsp: 3.667 ± 3.493
2.445GluGlu: 2.445 ± 0.106
2.037GluPhe: 2.037 ± 0.958
2.037GluGly: 2.037 ± 0.086
1.222GluHis: 1.222 ± 0.575
2.445GluIle: 2.445 ± 0.937
2.852GluLys: 2.852 ± 1.341
6.927GluLeu: 6.927 ± 3.256
1.63GluMet: 1.63 ± 0.277
2.445GluAsn: 2.445 ± 0.106
1.222GluPro: 1.222 ± 0.469
0.815GluGln: 0.815 ± 0.383
3.26GluArg: 3.26 ± 0.489
5.705GluSer: 5.705 ± 1.492
2.852GluThr: 2.852 ± 1.341
4.482GluVal: 4.482 ± 2.107
2.445GluTrp: 2.445 ± 1.149
2.037GluTyr: 2.037 ± 0.958
0.0GluXaa: 0.0 ± 0.0
Phe
6.112PheAla: 6.112 ± 0.786
1.222PheCys: 1.222 ± 0.575
2.852PheAsp: 2.852 ± 1.341
1.63PheGlu: 1.63 ± 0.766
2.037PhePhe: 2.037 ± 0.958
1.63PheGly: 1.63 ± 0.766
1.222PheHis: 1.222 ± 0.575
2.037PheIle: 2.037 ± 0.958
2.852PheLys: 2.852 ± 0.297
4.482PheLeu: 4.482 ± 1.064
0.815PheMet: 0.815 ± 0.66
2.037PheAsn: 2.037 ± 0.086
1.63PhePro: 1.63 ± 0.766
3.667PheGln: 3.667 ± 0.68
2.445PheArg: 2.445 ± 0.106
3.26PheSer: 3.26 ± 0.489
2.445PheThr: 2.445 ± 0.106
2.037PheVal: 2.037 ± 0.958
0.815PheTrp: 0.815 ± 0.383
0.815PheTyr: 0.815 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
4.482GlyAla: 4.482 ± 2.066
0.815GlyCys: 0.815 ± 0.383
3.26GlyAsp: 3.26 ± 1.598
2.445GlyGlu: 2.445 ± 1.149
2.852GlyPhe: 2.852 ± 1.341
2.037GlyGly: 2.037 ± 2.172
1.222GlyHis: 1.222 ± 0.469
1.63GlyIle: 1.63 ± 0.277
4.075GlyLys: 4.075 ± 0.872
2.852GlyLeu: 2.852 ± 1.789
1.222GlyMet: 1.222 ± 0.575
0.815GlyAsn: 0.815 ± 0.66
1.222GlyPro: 1.222 ± 0.469
0.815GlyGln: 0.815 ± 0.383
3.667GlyArg: 3.667 ± 1.406
3.667GlySer: 3.667 ± 1.406
2.445GlyThr: 2.445 ± 0.937
1.222GlyVal: 1.222 ± 1.512
1.222GlyTrp: 1.222 ± 0.469
1.222GlyTyr: 1.222 ± 0.575
0.0GlyXaa: 0.0 ± 0.0
His
1.63HisAla: 1.63 ± 0.766
1.222HisCys: 1.222 ± 0.575
2.445HisAsp: 2.445 ± 0.937
1.63HisGlu: 1.63 ± 0.766
0.815HisPhe: 0.815 ± 0.383
1.222HisGly: 1.222 ± 0.575
0.815HisHis: 0.815 ± 0.66
0.407HisIle: 0.407 ± 0.192
2.037HisLys: 2.037 ± 0.958
2.037HisLeu: 2.037 ± 0.086
0.815HisMet: 0.815 ± 0.66
1.63HisAsn: 1.63 ± 0.277
0.407HisPro: 0.407 ± 0.192
0.407HisGln: 0.407 ± 0.192
1.63HisArg: 1.63 ± 0.277
0.815HisSer: 0.815 ± 0.383
1.222HisThr: 1.222 ± 1.512
1.63HisVal: 1.63 ± 0.766
0.407HisTrp: 0.407 ± 0.852
2.037HisTyr: 2.037 ± 0.086
0.0HisXaa: 0.0 ± 0.0
Ile
4.89IleAla: 4.89 ± 1.875
1.63IleCys: 1.63 ± 0.277
2.445IleAsp: 2.445 ± 0.106
1.63IleGlu: 1.63 ± 0.277
0.815IlePhe: 0.815 ± 0.383
2.852IleGly: 2.852 ± 0.297
2.037IleHis: 2.037 ± 0.086
2.037IleIle: 2.037 ± 1.129
1.63IleLys: 1.63 ± 0.277
4.075IleLeu: 4.075 ± 1.215
1.222IleMet: 1.222 ± 0.469
4.482IleAsn: 4.482 ± 0.02
1.222IlePro: 1.222 ± 1.512
1.63IleGln: 1.63 ± 0.766
3.26IleArg: 3.26 ± 0.489
4.482IleSer: 4.482 ± 0.02
1.63IleThr: 1.63 ± 0.766
3.667IleVal: 3.667 ± 0.68
0.0IleTrp: 0.0 ± 0.0
0.815IleTyr: 0.815 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
3.667LysAla: 3.667 ± 1.406
1.222LysCys: 1.222 ± 0.575
1.222LysAsp: 1.222 ± 0.469
3.667LysGlu: 3.667 ± 1.406
1.63LysPhe: 1.63 ± 0.766
2.037LysGly: 2.037 ± 2.172
0.815LysHis: 0.815 ± 0.383
0.815LysIle: 0.815 ± 0.383
2.445LysLys: 2.445 ± 1.149
2.445LysLeu: 2.445 ± 1.149
1.222LysMet: 1.222 ± 0.575
2.852LysAsn: 2.852 ± 0.297
1.63LysPro: 1.63 ± 0.766
1.222LysGln: 1.222 ± 0.575
7.742LysArg: 7.742 ± 5.751
5.297LysSer: 5.297 ± 2.49
6.112LysThr: 6.112 ± 0.786
2.445LysVal: 2.445 ± 0.106
0.0LysTrp: 0.0 ± 0.0
1.222LysTyr: 1.222 ± 0.575
0.0LysXaa: 0.0 ± 0.0
Leu
7.335LeuAla: 7.335 ± 0.318
2.445LeuCys: 2.445 ± 0.106
7.742LeuAsp: 7.742 ± 1.578
4.075LeuGlu: 4.075 ± 0.872
4.89LeuPhe: 4.89 ± 2.298
4.89LeuGly: 4.89 ± 1.875
2.037LeuHis: 2.037 ± 0.086
4.89LeuIle: 4.89 ± 0.212
7.335LeuLys: 7.335 ± 0.318
8.15LeuLeu: 8.15 ± 0.701
2.852LeuMet: 2.852 ± 0.746
4.075LeuAsn: 4.075 ± 2.258
4.89LeuPro: 4.89 ± 2.298
1.63LeuGln: 1.63 ± 0.766
5.297LeuArg: 5.297 ± 0.64
6.927LeuSer: 6.927 ± 0.126
3.26LeuThr: 3.26 ± 0.554
5.297LeuVal: 5.297 ± 0.403
0.815LeuTrp: 0.815 ± 0.383
2.852LeuTyr: 2.852 ± 1.341
0.0LeuXaa: 0.0 ± 0.0
Met
2.852MetAla: 2.852 ± 0.297
0.407MetCys: 0.407 ± 0.192
2.445MetAsp: 2.445 ± 0.937
1.63MetGlu: 1.63 ± 0.766
2.445MetPhe: 2.445 ± 0.106
1.222MetGly: 1.222 ± 1.512
0.815MetHis: 0.815 ± 0.383
0.815MetIle: 0.815 ± 0.383
0.407MetLys: 0.407 ± 0.192
2.037MetLeu: 2.037 ± 0.086
0.407MetMet: 0.407 ± 0.192
1.63MetAsn: 1.63 ± 0.277
1.63MetPro: 1.63 ± 0.277
0.815MetGln: 0.815 ± 0.66
1.63MetArg: 1.63 ± 0.766
1.222MetSer: 1.222 ± 0.469
0.407MetThr: 0.407 ± 0.852
3.667MetVal: 3.667 ± 0.363
0.407MetTrp: 0.407 ± 0.192
0.407MetTyr: 0.407 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.667AsnAla: 3.667 ± 1.724
0.815AsnCys: 0.815 ± 0.66
2.445AsnAsp: 2.445 ± 1.981
3.26AsnGlu: 3.26 ± 0.554
2.852AsnPhe: 2.852 ± 0.297
1.63AsnGly: 1.63 ± 1.321
1.63AsnHis: 1.63 ± 0.277
4.075AsnIle: 4.075 ± 0.872
1.63AsnLys: 1.63 ± 0.766
4.482AsnLeu: 4.482 ± 0.02
0.815AsnMet: 0.815 ± 1.275
0.815AsnAsn: 0.815 ± 0.383
2.445AsnPro: 2.445 ± 0.937
1.222AsnGln: 1.222 ± 0.575
2.037AsnArg: 2.037 ± 0.086
2.852AsnSer: 2.852 ± 0.297
2.852AsnThr: 2.852 ± 0.746
2.037AsnVal: 2.037 ± 0.086
1.222AsnTrp: 1.222 ± 0.575
2.445AsnTyr: 2.445 ± 1.981
0.0AsnXaa: 0.0 ± 0.0
Pro
3.667ProAla: 3.667 ± 1.406
0.0ProCys: 0.0 ± 0.0
2.445ProAsp: 2.445 ± 0.106
3.26ProGlu: 3.26 ± 1.532
2.445ProPhe: 2.445 ± 1.149
2.037ProGly: 2.037 ± 0.958
2.037ProHis: 2.037 ± 0.086
1.63ProIle: 1.63 ± 0.277
1.63ProLys: 1.63 ± 0.766
3.26ProLeu: 3.26 ± 2.641
0.407ProMet: 0.407 ± 0.192
3.26ProAsn: 3.26 ± 1.598
4.89ProPro: 4.89 ± 1.255
1.63ProGln: 1.63 ± 0.766
2.852ProArg: 2.852 ± 1.341
3.26ProSer: 3.26 ± 0.554
2.037ProThr: 2.037 ± 0.958
3.26ProVal: 3.26 ± 1.598
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.222GlnAla: 1.222 ± 0.469
0.407GlnCys: 0.407 ± 0.192
0.815GlnAsp: 0.815 ± 0.383
1.63GlnGlu: 1.63 ± 0.766
1.63GlnPhe: 1.63 ± 0.766
1.63GlnGly: 1.63 ± 0.277
0.815GlnHis: 0.815 ± 0.383
1.63GlnIle: 1.63 ± 1.321
1.222GlnLys: 1.222 ± 0.469
3.667GlnLeu: 3.667 ± 1.724
0.0GlnMet: 0.0 ± 0.0
0.407GlnAsn: 0.407 ± 0.852
0.815GlnPro: 0.815 ± 0.383
1.63GlnGln: 1.63 ± 1.321
1.63GlnArg: 1.63 ± 0.766
2.037GlnSer: 2.037 ± 0.958
1.222GlnThr: 1.222 ± 0.469
2.445GlnVal: 2.445 ± 0.106
0.815GlnTrp: 0.815 ± 0.383
1.63GlnTyr: 1.63 ± 0.766
0.0GlnXaa: 0.0 ± 0.0
Arg
4.89ArgAla: 4.89 ± 0.212
1.222ArgCys: 1.222 ± 1.512
2.852ArgAsp: 2.852 ± 1.789
4.482ArgGlu: 4.482 ± 1.064
2.445ArgPhe: 2.445 ± 0.106
1.222ArgGly: 1.222 ± 0.575
1.222ArgHis: 1.222 ± 0.575
4.89ArgIle: 4.89 ± 0.832
3.667ArgLys: 3.667 ± 1.406
6.52ArgLeu: 6.52 ± 1.109
1.222ArgMet: 1.222 ± 0.575
2.445ArgAsn: 2.445 ± 0.106
3.667ArgPro: 3.667 ± 0.363
1.222ArgGln: 1.222 ± 0.575
2.852ArgArg: 2.852 ± 1.341
5.297ArgSer: 5.297 ± 1.447
4.075ArgThr: 4.075 ± 2.258
6.112ArgVal: 6.112 ± 0.786
0.407ArgTrp: 0.407 ± 0.192
1.222ArgTyr: 1.222 ± 0.469
0.0ArgXaa: 0.0 ± 0.0
Ser
9.372SerAla: 9.372 ± 0.232
2.037SerCys: 2.037 ± 0.958
4.89SerAsp: 4.89 ± 1.875
5.705SerGlu: 5.705 ± 1.638
4.482SerPhe: 4.482 ± 1.023
0.815SerGly: 0.815 ± 0.383
0.815SerHis: 0.815 ± 0.383
3.667SerIle: 3.667 ± 1.724
3.26SerLys: 3.26 ± 2.641
8.15SerLeu: 8.15 ± 3.831
2.852SerMet: 2.852 ± 0.297
3.667SerAsn: 3.667 ± 0.68
3.26SerPro: 3.26 ± 0.489
1.63SerGln: 1.63 ± 0.277
4.075SerArg: 4.075 ± 0.872
6.112SerSer: 6.112 ± 3.387
4.89SerThr: 4.89 ± 1.875
6.112SerVal: 6.112 ± 0.786
0.815SerTrp: 0.815 ± 0.383
3.26SerTyr: 3.26 ± 0.554
0.0SerXaa: 0.0 ± 0.0
Thr
4.075ThrAla: 4.075 ± 0.171
0.815ThrCys: 0.815 ± 1.704
3.667ThrAsp: 3.667 ± 0.68
2.037ThrGlu: 2.037 ± 0.958
3.26ThrPhe: 3.26 ± 0.489
3.667ThrGly: 3.667 ± 1.406
2.037ThrHis: 2.037 ± 0.958
3.26ThrIle: 3.26 ± 2.641
2.852ThrLys: 2.852 ± 1.789
6.52ThrLeu: 6.52 ± 0.065
2.445ThrMet: 2.445 ± 0.937
2.852ThrAsn: 2.852 ± 1.341
3.667ThrPro: 3.667 ± 1.406
1.222ThrGln: 1.222 ± 0.469
2.037ThrArg: 2.037 ± 0.958
4.89ThrSer: 4.89 ± 0.212
2.445ThrThr: 2.445 ± 1.981
4.075ThrVal: 4.075 ± 0.171
0.407ThrTrp: 0.407 ± 0.192
2.037ThrTyr: 2.037 ± 0.086
0.0ThrXaa: 0.0 ± 0.0
Val
6.112ValAla: 6.112 ± 0.257
1.63ValCys: 1.63 ± 0.277
4.482ValAsp: 4.482 ± 1.064
2.852ValGlu: 2.852 ± 0.297
3.667ValPhe: 3.667 ± 1.724
3.26ValGly: 3.26 ± 1.532
0.815ValHis: 0.815 ± 0.66
3.26ValIle: 3.26 ± 0.489
2.852ValLys: 2.852 ± 0.297
3.667ValLeu: 3.667 ± 1.406
1.63ValMet: 1.63 ± 0.277
2.852ValAsn: 2.852 ± 0.746
2.037ValPro: 2.037 ± 0.086
0.815ValGln: 0.815 ± 0.383
4.482ValArg: 4.482 ± 0.02
7.335ValSer: 7.335 ± 1.361
5.705ValThr: 5.705 ± 1.638
2.852ValVal: 2.852 ± 1.341
1.63ValTrp: 1.63 ± 0.766
5.297ValTyr: 5.297 ± 1.447
0.0ValXaa: 0.0 ± 0.0
Trp
1.63TrpAla: 1.63 ± 0.277
0.0TrpCys: 0.0 ± 0.0
1.222TrpAsp: 1.222 ± 0.575
0.815TrpGlu: 0.815 ± 0.66
0.815TrpPhe: 0.815 ± 0.383
2.037TrpGly: 2.037 ± 0.086
0.0TrpHis: 0.0 ± 0.0
0.407TrpIle: 0.407 ± 0.192
0.407TrpLys: 0.407 ± 0.852
0.815TrpLeu: 0.815 ± 0.383
0.0TrpMet: 0.0 ± 0.0
0.815TrpAsn: 0.815 ± 0.383
0.815TrpPro: 0.815 ± 0.383
0.815TrpGln: 0.815 ± 0.383
1.222TrpArg: 1.222 ± 0.469
0.815TrpSer: 0.815 ± 0.383
0.407TrpThr: 0.407 ± 0.192
2.037TrpVal: 2.037 ± 0.086
0.407TrpTrp: 0.407 ± 0.192
0.815TrpTyr: 0.815 ± 0.383
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.667TyrAla: 3.667 ± 0.68
0.407TyrCys: 0.407 ± 0.192
2.852TyrAsp: 2.852 ± 1.341
1.63TyrGlu: 1.63 ± 0.277
0.815TyrPhe: 0.815 ± 0.66
1.63TyrGly: 1.63 ± 0.766
2.037TyrHis: 2.037 ± 1.129
0.407TyrIle: 0.407 ± 0.852
2.037TyrLys: 2.037 ± 0.958
4.482TyrLeu: 4.482 ± 0.02
0.407TyrMet: 0.407 ± 0.192
1.63TyrAsn: 1.63 ± 0.766
1.222TyrPro: 1.222 ± 0.469
0.815TyrGln: 0.815 ± 0.66
2.445TyrArg: 2.445 ± 1.149
2.852TyrSer: 2.852 ± 1.341
3.667TyrThr: 3.667 ± 0.68
3.667TyrVal: 3.667 ± 1.724
0.407TyrTrp: 0.407 ± 0.852
0.407TyrTyr: 0.407 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2455 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski