Amino acid dipepetide frequency for Phasi Charoen-like phasivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.15AlaAla: 2.15 ± 3.454
1.075AlaCys: 1.075 ± 1.054
2.956AlaAsp: 2.956 ± 1.217
3.762AlaGlu: 3.762 ± 1.003
0.806AlaPhe: 0.806 ± 0.219
2.956AlaGly: 2.956 ± 0.27
1.881AlaHis: 1.881 ± 1.101
4.031AlaIle: 4.031 ± 1.252
4.031AlaLys: 4.031 ± 0.08
5.375AlaLeu: 5.375 ± 2.632
0.806AlaMet: 0.806 ± 0.419
3.762AlaAsn: 3.762 ± 1.214
0.269AlaPro: 0.269 ± 0.16
1.075AlaGln: 1.075 ± 0.68
2.15AlaArg: 2.15 ± 1.653
5.912AlaSer: 5.912 ± 0.821
3.225AlaThr: 3.225 ± 0.14
4.031AlaVal: 4.031 ± 2.973
0.537AlaTrp: 0.537 ± 0.182
1.344AlaTyr: 1.344 ± 0.595
0.0AlaXaa: 0.0 ± 0.0
Cys
1.344CysAla: 1.344 ± 1.314
0.0CysCys: 0.0 ± 0.0
0.806CysAsp: 0.806 ± 0.788
1.612CysGlu: 1.612 ± 1.577
1.075CysPhe: 1.075 ± 0.68
1.075CysGly: 1.075 ± 0.363
0.806CysHis: 0.806 ± 0.219
0.537CysIle: 0.537 ± 0.526
1.881CysLys: 1.881 ± 0.771
1.881CysLeu: 1.881 ± 0.893
0.269CysMet: 0.269 ± 0.16
0.269CysAsn: 0.269 ± 0.263
1.344CysPro: 1.344 ± 0.595
0.269CysGln: 0.269 ± 0.263
1.344CysArg: 1.344 ± 0.94
2.687CysSer: 2.687 ± 2.628
0.806CysThr: 0.806 ± 0.788
1.075CysVal: 1.075 ± 0.68
0.0CysTrp: 0.0 ± 0.0
0.537CysTyr: 0.537 ± 0.526
0.0CysXaa: 0.0 ± 0.0
Asp
3.494AspAla: 3.494 ± 0.287
1.344AspCys: 1.344 ± 0.94
3.225AspAsp: 3.225 ± 1.267
4.569AspGlu: 4.569 ± 1.23
4.031AspPhe: 4.031 ± 0.41
1.612AspGly: 1.612 ± 0.545
1.075AspHis: 1.075 ± 0.339
5.106AspIle: 5.106 ± 1.728
3.494AspLys: 3.494 ± 1.155
6.45AspLeu: 6.45 ± 0.28
0.537AspMet: 0.537 ± 0.32
2.15AspAsn: 2.15 ± 1.015
1.344AspPro: 1.344 ± 0.37
0.806AspGln: 0.806 ± 0.798
1.344AspArg: 1.344 ± 0.801
2.419AspSer: 2.419 ± 0.359
0.806AspThr: 0.806 ± 0.48
3.494AspVal: 3.494 ± 2.085
0.537AspTrp: 0.537 ± 0.32
2.15AspTyr: 2.15 ± 0.726
0.0AspXaa: 0.0 ± 0.0
Glu
3.762GluAla: 3.762 ± 1.079
0.269GluCys: 0.269 ± 0.263
4.3GluAsp: 4.3 ± 0.659
7.525GluGlu: 7.525 ± 1.788
4.031GluPhe: 4.031 ± 0.802
2.956GluGly: 2.956 ± 1.115
0.806GluHis: 0.806 ± 0.219
5.106GluIle: 5.106 ± 1.768
3.762GluLys: 3.762 ± 0.136
8.331GluLeu: 8.331 ± 2.574
2.15GluMet: 2.15 ± 0.482
3.494GluAsn: 3.494 ± 0.91
1.075GluPro: 1.075 ± 0.641
3.762GluGln: 3.762 ± 1.299
3.225GluArg: 3.225 ± 1.69
4.031GluSer: 4.031 ± 1.446
4.3GluThr: 4.3 ± 0.937
5.644GluVal: 5.644 ± 1.527
1.344GluTrp: 1.344 ± 0.37
1.881GluTyr: 1.881 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
1.075PheAla: 1.075 ± 1.054
0.0PheCys: 0.0 ± 0.0
2.15PheAsp: 2.15 ± 0.58
3.225PheGlu: 3.225 ± 1.562
1.344PhePhe: 1.344 ± 0.595
2.419PheGly: 2.419 ± 1.377
0.806PheHis: 0.806 ± 0.219
1.881PheIle: 1.881 ± 1.121
3.762PheLys: 3.762 ± 1.299
3.225PheLeu: 3.225 ± 1.158
1.344PheMet: 1.344 ± 0.663
3.762PheAsn: 3.762 ± 1.577
1.881PhePro: 1.881 ± 3.533
1.344PheGln: 1.344 ± 0.85
4.569PheArg: 4.569 ± 0.247
3.762PheSer: 3.762 ± 1.095
2.956PheThr: 2.956 ± 0.795
1.881PheVal: 1.881 ± 0.669
1.344PheTrp: 1.344 ± 0.37
1.881PheTyr: 1.881 ± 0.771
0.0PheXaa: 0.0 ± 0.0
Gly
1.344GlyAla: 1.344 ± 0.482
1.344GlyCys: 1.344 ± 1.314
2.956GlyAsp: 2.956 ± 1.42
2.419GlyGlu: 2.419 ± 1.267
3.225GlyPhe: 3.225 ± 0.878
2.419GlyGly: 2.419 ± 0.359
0.806GlyHis: 0.806 ± 0.48
2.687GlyIle: 2.687 ± 0.919
4.569GlyLys: 4.569 ± 1.309
5.912GlyLeu: 5.912 ± 1.165
1.075GlyMet: 1.075 ± 0.363
2.956GlyAsn: 2.956 ± 1.217
1.075GlyPro: 1.075 ± 1.051
2.419GlyGln: 2.419 ± 0.359
2.687GlyArg: 2.687 ± 1.326
2.956GlySer: 2.956 ± 0.603
3.762GlyThr: 3.762 ± 4.035
4.837GlyVal: 4.837 ± 1.899
0.537GlyTrp: 0.537 ± 0.32
1.075GlyTyr: 1.075 ± 0.339
0.0GlyXaa: 0.0 ± 0.0
His
1.075HisAla: 1.075 ± 0.363
1.075HisCys: 1.075 ± 0.363
1.075HisAsp: 1.075 ± 0.363
1.612HisGlu: 1.612 ± 0.439
0.537HisPhe: 0.537 ± 0.182
2.419HisGly: 2.419 ± 0.818
0.269HisHis: 0.269 ± 0.263
0.806HisIle: 0.806 ± 0.219
1.344HisLys: 1.344 ± 0.37
1.344HisLeu: 1.344 ± 0.482
0.0HisMet: 0.0 ± 0.0
0.269HisAsn: 0.269 ± 0.16
1.344HisPro: 1.344 ± 0.482
1.344HisGln: 1.344 ± 0.37
1.344HisArg: 1.344 ± 0.482
1.881HisSer: 1.881 ± 0.548
0.806HisThr: 0.806 ± 0.788
1.344HisVal: 1.344 ± 0.37
0.269HisTrp: 0.269 ± 0.16
1.612HisTyr: 1.612 ± 0.439
0.0HisXaa: 0.0 ± 0.0
Ile
2.687IleAla: 2.687 ± 0.556
0.806IleCys: 0.806 ± 0.48
4.569IleAsp: 4.569 ± 1.493
5.106IleGlu: 5.106 ± 0.686
2.15IlePhe: 2.15 ± 1.653
5.106IleGly: 5.106 ± 1.613
1.344IleHis: 1.344 ± 0.37
6.45IleIle: 6.45 ± 1.157
4.837IleLys: 4.837 ± 0.299
8.331IleLeu: 8.331 ± 1.105
1.075IleMet: 1.075 ± 0.641
4.569IleAsn: 4.569 ± 1.748
3.762IlePro: 3.762 ± 0.532
2.419IleGln: 2.419 ± 0.359
3.494IleArg: 3.494 ± 0.948
6.45IleSer: 6.45 ± 2.179
4.3IleThr: 4.3 ± 0.659
3.762IleVal: 3.762 ± 2.031
0.269IleTrp: 0.269 ± 0.16
1.344IleTyr: 1.344 ± 0.901
0.0IleXaa: 0.0 ± 0.0
Lys
2.956LysAla: 2.956 ± 0.309
2.15LysCys: 2.15 ± 2.103
3.494LysAsp: 3.494 ± 0.133
5.375LysGlu: 5.375 ± 2.21
2.15LysPhe: 2.15 ± 0.726
3.225LysGly: 3.225 ± 0.878
1.075LysHis: 1.075 ± 0.641
5.375LysIle: 5.375 ± 1.478
2.687LysLys: 2.687 ± 0.763
9.944LysLeu: 9.944 ± 2.578
1.344LysMet: 1.344 ± 0.85
1.612LysAsn: 1.612 ± 0.439
3.225LysPro: 3.225 ± 1.016
1.344LysGln: 1.344 ± 0.482
2.687LysArg: 2.687 ± 1.261
3.494LysSer: 3.494 ± 1.61
4.569LysThr: 4.569 ± 1.309
3.762LysVal: 3.762 ± 0.927
1.344LysTrp: 1.344 ± 0.37
2.956LysTyr: 2.956 ± 1.42
0.0LysXaa: 0.0 ± 0.0
Leu
9.406LeuAla: 9.406 ± 3.513
1.612LeuCys: 1.612 ± 1.202
2.956LeuAsp: 2.956 ± 0.309
4.837LeuGlu: 4.837 ± 1.635
5.106LeuPhe: 5.106 ± 2.192
4.837LeuGly: 4.837 ± 0.882
2.687LeuHis: 2.687 ± 0.763
8.331LeuIle: 8.331 ± 1.276
4.3LeuLys: 4.3 ± 1.596
8.331LeuLeu: 8.331 ± 1.415
2.15LeuMet: 2.15 ± 0.472
6.181LeuAsn: 6.181 ± 1.429
3.762LeuPro: 3.762 ± 0.532
3.762LeuGln: 3.762 ± 1.577
5.644LeuArg: 5.644 ± 1.517
11.825LeuSer: 11.825 ± 1.877
7.256LeuThr: 7.256 ± 1.227
4.837LeuVal: 4.837 ± 1.571
1.344LeuTrp: 1.344 ± 0.801
4.3LeuTyr: 4.3 ± 0.566
0.0LeuXaa: 0.0 ± 0.0
Met
1.612MetAla: 1.612 ± 1.583
0.806MetCys: 0.806 ± 0.48
0.806MetAsp: 0.806 ± 0.219
1.075MetGlu: 1.075 ± 0.716
1.075MetPhe: 1.075 ± 0.339
1.612MetGly: 1.612 ± 0.439
0.537MetHis: 0.537 ± 0.182
2.15MetIle: 2.15 ± 0.472
1.344MetLys: 1.344 ± 0.85
2.15MetLeu: 2.15 ± 1.477
0.806MetMet: 0.806 ± 0.219
1.881MetAsn: 1.881 ± 0.893
0.537MetPro: 0.537 ± 0.32
1.075MetGln: 1.075 ± 0.81
1.075MetArg: 1.075 ± 0.641
1.075MetSer: 1.075 ± 0.363
0.537MetThr: 0.537 ± 0.32
0.806MetVal: 0.806 ± 0.219
0.0MetTrp: 0.0 ± 0.0
0.537MetTyr: 0.537 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
2.687AsnAla: 2.687 ± 0.421
1.612AsnCys: 1.612 ± 1.202
3.494AsnAsp: 3.494 ± 3.081
4.031AsnGlu: 4.031 ± 1.125
2.956AsnPhe: 2.956 ± 0.656
1.881AsnGly: 1.881 ± 1.101
2.15AsnHis: 2.15 ± 0.58
3.225AsnIle: 3.225 ± 0.394
3.494AsnLys: 3.494 ± 0.91
5.912AsnLeu: 5.912 ± 0.624
0.537AsnMet: 0.537 ± 0.32
2.687AsnAsn: 2.687 ± 0.556
2.15AsnPro: 2.15 ± 0.677
2.15AsnGln: 2.15 ± 0.468
1.881AsnArg: 1.881 ± 0.607
4.031AsnSer: 4.031 ± 0.802
3.494AsnThr: 3.494 ± 0.678
4.569AsnVal: 4.569 ± 0.922
1.612AsnTrp: 1.612 ± 0.439
1.881AsnTyr: 1.881 ± 0.789
0.0AsnXaa: 0.0 ± 0.0
Pro
0.537ProAla: 0.537 ± 0.526
0.269ProCys: 0.269 ± 0.263
2.687ProAsp: 2.687 ± 0.919
2.687ProGlu: 2.687 ± 1.261
1.612ProPhe: 1.612 ± 0.579
1.344ProGly: 1.344 ± 0.482
1.075ProHis: 1.075 ± 0.363
2.419ProIle: 2.419 ± 0.658
0.806ProLys: 0.806 ± 0.48
3.494ProLeu: 3.494 ± 0.948
0.806ProMet: 0.806 ± 0.48
0.269ProAsn: 0.269 ± 0.263
0.537ProPro: 0.537 ± 0.32
1.344ProGln: 1.344 ± 0.663
3.494ProArg: 3.494 ± 2.158
3.494ProSer: 3.494 ± 1.357
0.806ProThr: 0.806 ± 0.422
2.956ProVal: 2.956 ± 1.372
0.537ProTrp: 0.537 ± 0.182
1.344ProTyr: 1.344 ± 0.801
0.0ProXaa: 0.0 ± 0.0
Gln
1.881GlnAla: 1.881 ± 0.502
1.344GlnCys: 1.344 ± 0.94
1.344GlnAsp: 1.344 ± 0.595
2.15GlnGlu: 2.15 ± 1.281
1.075GlnPhe: 1.075 ± 0.363
2.15GlnGly: 2.15 ± 1.495
1.075GlnHis: 1.075 ± 0.641
2.419GlnIle: 2.419 ± 0.818
2.15GlnLys: 2.15 ± 0.58
1.612GlnLeu: 1.612 ± 1.666
1.344GlnMet: 1.344 ± 0.663
2.956GlnAsn: 2.956 ± 1.259
0.0GlnPro: 0.0 ± 0.0
1.075GlnGln: 1.075 ± 0.641
1.881GlnArg: 1.881 ± 0.502
4.031GlnSer: 4.031 ± 0.802
1.881GlnThr: 1.881 ± 0.502
1.075GlnVal: 1.075 ± 0.339
0.0GlnTrp: 0.0 ± 0.0
0.806GlnTyr: 0.806 ± 0.48
0.0GlnXaa: 0.0 ± 0.0
Arg
2.956ArgAla: 2.956 ± 1.414
1.075ArgCys: 1.075 ± 0.363
1.612ArgAsp: 1.612 ± 0.439
3.225ArgGlu: 3.225 ± 0.394
2.956ArgPhe: 2.956 ± 1.217
4.031ArgGly: 4.031 ± 2.937
0.806ArgHis: 0.806 ± 0.788
4.837ArgIle: 4.837 ± 0.882
4.569ArgLys: 4.569 ± 1.275
4.031ArgLeu: 4.031 ± 1.221
1.344ArgMet: 1.344 ± 1.649
4.569ArgAsn: 4.569 ± 1.432
1.344ArgPro: 1.344 ± 0.85
1.612ArgGln: 1.612 ± 0.579
3.494ArgArg: 3.494 ± 0.133
4.569ArgSer: 4.569 ± 0.824
2.687ArgThr: 2.687 ± 1.326
4.837ArgVal: 4.837 ± 1.154
0.537ArgTrp: 0.537 ± 0.32
1.344ArgTyr: 1.344 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
5.375SerAla: 5.375 ± 0.641
2.15SerCys: 2.15 ± 2.103
2.956SerAsp: 2.956 ± 0.893
6.181SerGlu: 6.181 ± 1.748
3.494SerPhe: 3.494 ± 2.158
4.569SerGly: 4.569 ± 1.309
1.612SerHis: 1.612 ± 0.545
4.837SerIle: 4.837 ± 0.299
6.45SerLys: 6.45 ± 1.755
6.987SerLeu: 6.987 ± 2.31
1.612SerMet: 1.612 ± 0.845
5.106SerAsn: 5.106 ± 0.637
2.956SerPro: 2.956 ± 1.115
2.15SerGln: 2.15 ± 0.945
5.375SerArg: 5.375 ± 1.569
6.181SerSer: 6.181 ± 2.05
4.3SerThr: 4.3 ± 2.865
5.912SerVal: 5.912 ± 1.205
0.269SerTrp: 0.269 ± 0.16
3.225SerTyr: 3.225 ± 0.878
0.0SerXaa: 0.0 ± 0.0
Thr
3.225ThrAla: 3.225 ± 1.158
0.806ThrCys: 0.806 ± 0.788
3.225ThrAsp: 3.225 ± 0.14
3.762ThrGlu: 3.762 ± 0.136
1.612ThrPhe: 1.612 ± 0.961
3.494ThrGly: 3.494 ± 0.287
0.537ThrHis: 0.537 ± 0.32
3.762ThrIle: 3.762 ± 1.787
2.956ThrLys: 2.956 ± 0.795
7.256ThrLeu: 7.256 ± 2.117
1.344ThrMet: 1.344 ± 0.901
4.031ThrAsn: 4.031 ± 1.076
2.15ThrPro: 2.15 ± 0.677
1.344ThrGln: 1.344 ± 1.649
3.762ThrArg: 3.762 ± 1.214
4.569ThrSer: 4.569 ± 1.496
3.225ThrThr: 3.225 ± 0.777
5.106ThrVal: 5.106 ± 2.735
0.0ThrTrp: 0.0 ± 0.0
1.075ThrTyr: 1.075 ± 0.363
0.0ThrXaa: 0.0 ± 0.0
Val
3.762ValAla: 3.762 ± 2.004
1.075ValCys: 1.075 ± 1.051
2.419ValAsp: 2.419 ± 0.658
4.569ValGlu: 4.569 ± 0.247
2.687ValPhe: 2.687 ± 1.371
1.612ValGly: 1.612 ± 1.583
1.075ValHis: 1.075 ± 0.363
6.45ValIle: 6.45 ± 1.658
3.494ValLys: 3.494 ± 1.072
7.256ValLeu: 7.256 ± 1.439
1.344ValMet: 1.344 ± 0.595
3.494ValAsn: 3.494 ± 0.287
2.419ValPro: 2.419 ± 1.694
2.15ValGln: 2.15 ± 0.677
5.106ValArg: 5.106 ± 1.684
6.45ValSer: 6.45 ± 1.896
5.375ValThr: 5.375 ± 0.842
5.375ValVal: 5.375 ± 1.71
0.537ValTrp: 0.537 ± 0.32
1.075ValTyr: 1.075 ± 0.363
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.269TrpCys: 0.269 ± 0.16
0.537TrpAsp: 0.537 ± 0.182
1.344TrpGlu: 1.344 ± 0.482
0.806TrpPhe: 0.806 ± 0.219
0.537TrpGly: 0.537 ± 0.526
0.0TrpHis: 0.0 ± 0.0
0.806TrpIle: 0.806 ± 0.48
0.806TrpLys: 0.806 ± 0.219
1.881TrpLeu: 1.881 ± 0.789
0.0TrpMet: 0.0 ± 0.0
0.537TrpAsn: 0.537 ± 0.32
0.537TrpPro: 0.537 ± 0.182
0.269TrpGln: 0.269 ± 0.16
0.537TrpArg: 0.537 ± 0.32
0.806TrpSer: 0.806 ± 0.219
0.537TrpThr: 0.537 ± 0.32
0.806TrpVal: 0.806 ± 0.219
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.075TyrAla: 1.075 ± 0.68
0.537TyrCys: 0.537 ± 0.526
2.419TyrAsp: 2.419 ± 0.818
2.687TyrGlu: 2.687 ± 0.964
1.881TyrPhe: 1.881 ± 0.539
0.806TyrGly: 0.806 ± 0.219
1.344TyrHis: 1.344 ± 0.801
1.612TyrIle: 1.612 ± 0.439
4.031TyrLys: 4.031 ± 1.097
3.762TyrLeu: 3.762 ± 0.655
1.344TyrMet: 1.344 ± 0.351
1.881TyrAsn: 1.881 ± 0.502
0.806TyrPro: 0.806 ± 0.219
0.537TyrGln: 0.537 ± 0.87
1.344TyrArg: 1.344 ± 0.595
1.075TyrSer: 1.075 ± 0.339
1.612TyrThr: 1.612 ± 0.439
1.612TyrVal: 1.612 ± 0.545
0.0TyrTrp: 0.0 ± 0.0
0.806TyrTyr: 0.806 ± 0.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3722 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski