Amino acid dipepetide frequency for Delphinus delphis polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.304AlaAla: 6.304 ± 2.435
0.573AlaCys: 0.573 ± 0.379
3.438AlaAsp: 3.438 ± 0.98
2.865AlaGlu: 2.865 ± 0.77
2.292AlaPhe: 2.292 ± 1.028
2.292AlaGly: 2.292 ± 1.474
0.0AlaHis: 0.0 ± 0.0
5.731AlaIle: 5.731 ± 1.986
4.011AlaLys: 4.011 ± 0.93
10.888AlaLeu: 10.888 ± 3.879
1.719AlaMet: 1.719 ± 0.982
5.158AlaAsn: 5.158 ± 0.863
3.438AlaPro: 3.438 ± 1.476
2.292AlaGln: 2.292 ± 0.918
6.304AlaArg: 6.304 ± 2.531
6.304AlaSer: 6.304 ± 1.391
2.292AlaThr: 2.292 ± 0.999
5.731AlaVal: 5.731 ± 1.539
1.719AlaTrp: 1.719 ± 0.694
1.146AlaTyr: 1.146 ± 0.731
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.379
0.0CysCys: 0.0 ± 0.0
1.719CysAsp: 1.719 ± 1.136
1.719CysGlu: 1.719 ± 0.607
1.719CysPhe: 1.719 ± 2.141
1.146CysGly: 1.146 ± 0.502
0.573CysHis: 0.573 ± 0.714
4.011CysIle: 4.011 ± 2.651
2.292CysLys: 2.292 ± 0.944
3.438CysLeu: 3.438 ± 2.757
0.0CysMet: 0.0 ± 0.0
2.865CysAsn: 2.865 ± 1.576
0.573CysPro: 0.573 ± 0.379
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.146CysSer: 1.146 ± 0.502
0.573CysThr: 0.573 ± 0.379
1.146CysVal: 1.146 ± 0.757
0.573CysTrp: 0.573 ± 0.556
1.719CysTyr: 1.719 ± 0.689
0.0CysXaa: 0.0 ± 0.0
Asp
5.731AspAla: 5.731 ± 1.01
2.292AspCys: 2.292 ± 1.174
3.438AspAsp: 3.438 ± 1.058
4.011AspGlu: 4.011 ± 2.279
2.865AspPhe: 2.865 ± 1.894
2.292AspGly: 2.292 ± 1.004
0.573AspHis: 0.573 ± 0.379
1.719AspIle: 1.719 ± 0.694
3.438AspLys: 3.438 ± 1.842
2.865AspLeu: 2.865 ± 0.571
2.292AspMet: 2.292 ± 1.028
2.292AspAsn: 2.292 ± 0.999
5.731AspPro: 5.731 ± 1.386
0.573AspGln: 0.573 ± 0.379
0.573AspArg: 0.573 ± 0.556
1.146AspSer: 1.146 ± 0.502
2.292AspThr: 2.292 ± 0.829
2.292AspVal: 2.292 ± 0.492
0.0AspTrp: 0.0 ± 0.0
2.865AspTyr: 2.865 ± 1.289
0.0AspXaa: 0.0 ± 0.0
Glu
5.158GluAla: 5.158 ± 2.693
2.292GluCys: 2.292 ± 1.004
3.438GluAsp: 3.438 ± 1.388
5.158GluGlu: 5.158 ± 1.647
1.146GluPhe: 1.146 ± 0.502
4.011GluGly: 4.011 ± 1.309
0.0GluHis: 0.0 ± 0.0
0.573GluIle: 0.573 ± 0.477
4.011GluLys: 4.011 ± 1.255
8.023GluLeu: 8.023 ± 1.211
0.0GluMet: 0.0 ± 0.0
5.158GluAsn: 5.158 ± 0.77
0.573GluPro: 0.573 ± 0.556
2.865GluGln: 2.865 ± 0.799
2.865GluArg: 2.865 ± 0.799
5.158GluSer: 5.158 ± 1.482
6.304GluThr: 6.304 ± 1.171
4.011GluVal: 4.011 ± 1.302
2.865GluTrp: 2.865 ± 1.289
1.719GluTyr: 1.719 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
1.719PheAla: 1.719 ± 0.694
3.438PheCys: 3.438 ± 1.266
1.719PheAsp: 1.719 ± 0.607
2.865PheGlu: 2.865 ± 1.342
0.573PhePhe: 0.573 ± 0.556
4.585PheGly: 4.585 ± 1.196
1.719PheHis: 1.719 ± 0.989
1.146PheIle: 1.146 ± 0.502
2.292PheLys: 2.292 ± 1.174
2.865PheLeu: 2.865 ± 0.969
1.146PheMet: 1.146 ± 0.731
1.146PheAsn: 1.146 ± 0.757
4.011PhePro: 4.011 ± 0.696
0.573PheGln: 0.573 ± 0.714
1.146PheArg: 1.146 ± 0.502
1.146PheSer: 1.146 ± 0.757
1.719PheThr: 1.719 ± 0.819
0.573PheVal: 0.573 ± 0.556
1.146PheTrp: 1.146 ± 0.731
0.573PheTyr: 0.573 ± 0.714
0.0PheXaa: 0.0 ± 0.0
Gly
9.169GlyAla: 9.169 ± 4.36
1.146GlyCys: 1.146 ± 0.716
4.011GlyAsp: 4.011 ± 0.557
2.865GlyGlu: 2.865 ± 0.92
0.573GlyPhe: 0.573 ± 0.477
5.731GlyGly: 5.731 ± 0.831
0.0GlyHis: 0.0 ± 0.0
5.158GlyIle: 5.158 ± 1.62
4.011GlyLys: 4.011 ± 1.597
10.888GlyLeu: 10.888 ± 1.807
1.146GlyMet: 1.146 ± 0.757
1.719GlyAsn: 1.719 ± 0.752
2.292GlyPro: 2.292 ± 1.525
3.438GlyGln: 3.438 ± 2.131
2.292GlyArg: 2.292 ± 1.038
2.865GlySer: 2.865 ± 0.62
2.865GlyThr: 2.865 ± 2.071
8.023GlyVal: 8.023 ± 1.54
0.0GlyTrp: 0.0 ± 0.0
1.719GlyTyr: 1.719 ± 0.752
0.0GlyXaa: 0.0 ± 0.0
His
1.719HisAla: 1.719 ± 0.689
0.573HisCys: 0.573 ± 0.714
0.0HisAsp: 0.0 ± 0.0
1.719HisGlu: 1.719 ± 0.895
0.573HisPhe: 0.573 ± 0.379
0.0HisGly: 0.0 ± 0.0
1.146HisHis: 1.146 ± 0.502
0.573HisIle: 0.573 ± 0.477
1.719HisLys: 1.719 ± 0.689
1.719HisLeu: 1.719 ± 0.694
1.146HisMet: 1.146 ± 0.991
0.573HisAsn: 0.573 ± 0.379
2.292HisPro: 2.292 ± 0.811
0.0HisGln: 0.0 ± 0.0
1.146HisArg: 1.146 ± 0.757
1.719HisSer: 1.719 ± 0.694
0.0HisThr: 0.0 ± 0.0
1.146HisVal: 1.146 ± 0.757
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.719IleAla: 1.719 ± 1.061
0.573IleCys: 0.573 ± 0.379
3.438IleAsp: 3.438 ± 1.066
1.719IleGlu: 1.719 ± 0.78
1.146IlePhe: 1.146 ± 0.503
1.719IleGly: 1.719 ± 0.488
1.146IleHis: 1.146 ± 0.502
3.438IleIle: 3.438 ± 0.608
3.438IleLys: 3.438 ± 1.375
8.023IleLeu: 8.023 ± 1.944
0.573IleMet: 0.573 ± 0.379
2.865IleAsn: 2.865 ± 1.15
2.292IlePro: 2.292 ± 1.079
2.292IleGln: 2.292 ± 0.918
0.0IleArg: 0.0 ± 0.0
2.865IleSer: 2.865 ± 0.948
6.304IleThr: 6.304 ± 2.26
2.865IleVal: 2.865 ± 0.82
0.0IleTrp: 0.0 ± 0.0
1.719IleTyr: 1.719 ± 1.136
0.0IleXaa: 0.0 ± 0.0
Lys
2.292LysAla: 2.292 ± 0.7
1.146LysCys: 1.146 ± 0.757
2.292LysAsp: 2.292 ± 0.999
2.292LysGlu: 2.292 ± 0.811
1.146LysPhe: 1.146 ± 0.502
4.585LysGly: 4.585 ± 1.476
2.865LysHis: 2.865 ± 1.894
2.292LysIle: 2.292 ± 1.174
5.158LysLys: 5.158 ± 1.852
5.731LysLeu: 5.731 ± 2.426
1.719LysMet: 1.719 ± 1.136
1.719LysAsn: 1.719 ± 0.989
1.719LysPro: 1.719 ± 1.378
0.573LysGln: 0.573 ± 0.379
6.304LysArg: 6.304 ± 1.422
4.585LysSer: 4.585 ± 0.914
8.023LysThr: 8.023 ± 0.962
1.146LysVal: 1.146 ± 0.502
0.0LysTrp: 0.0 ± 0.0
2.292LysTyr: 2.292 ± 1.174
0.0LysXaa: 0.0 ± 0.0
Leu
6.304LeuAla: 6.304 ± 1.392
1.719LeuCys: 1.719 ± 0.689
6.877LeuAsp: 6.877 ± 1.345
9.169LeuGlu: 9.169 ± 1.925
5.731LeuPhe: 5.731 ± 1.939
6.304LeuGly: 6.304 ± 1.732
0.573LeuHis: 0.573 ± 0.379
5.731LeuIle: 5.731 ± 1.491
4.011LeuLys: 4.011 ± 1.089
8.023LeuLeu: 8.023 ± 2.083
5.731LeuMet: 5.731 ± 2.58
6.877LeuAsn: 6.877 ± 1.815
5.731LeuPro: 5.731 ± 2.186
3.438LeuGln: 3.438 ± 1.506
4.585LeuArg: 4.585 ± 1.214
2.865LeuSer: 2.865 ± 1.497
9.169LeuThr: 9.169 ± 1.633
4.585LeuVal: 4.585 ± 0.914
2.292LeuTrp: 2.292 ± 1.553
5.158LeuTyr: 5.158 ± 1.107
0.0LeuXaa: 0.0 ± 0.0
Met
3.438MetAla: 3.438 ± 1.262
1.146MetCys: 1.146 ± 0.716
3.438MetAsp: 3.438 ± 2.147
1.719MetGlu: 1.719 ± 0.607
1.146MetPhe: 1.146 ± 0.502
1.719MetGly: 1.719 ± 0.488
0.0MetHis: 0.0 ± 0.0
0.573MetIle: 0.573 ± 0.379
1.146MetLys: 1.146 ± 0.716
2.865MetLeu: 2.865 ± 1.405
0.0MetMet: 0.0 ± 0.0
0.573MetAsn: 0.573 ± 0.556
0.0MetPro: 0.0 ± 0.0
2.292MetGln: 2.292 ± 1.31
1.146MetArg: 1.146 ± 0.716
1.146MetSer: 1.146 ± 0.502
1.719MetThr: 1.719 ± 0.989
0.573MetVal: 0.573 ± 0.714
0.573MetTrp: 0.573 ± 0.556
0.573MetTyr: 0.573 ± 0.379
0.0MetXaa: 0.0 ± 0.0
Asn
4.011AsnAla: 4.011 ± 1.255
1.146AsnCys: 1.146 ± 0.731
1.719AsnAsp: 1.719 ± 0.694
2.865AsnGlu: 2.865 ± 2.071
2.865AsnPhe: 2.865 ± 1.283
2.292AsnGly: 2.292 ± 1.004
2.292AsnHis: 2.292 ± 0.492
1.719AsnIle: 1.719 ± 1.136
2.292AsnLys: 2.292 ± 0.751
5.731AsnLeu: 5.731 ± 1.468
1.146AsnMet: 1.146 ± 0.503
1.719AsnAsn: 1.719 ± 0.895
1.146AsnPro: 1.146 ± 1.112
2.292AsnGln: 2.292 ± 0.918
1.719AsnArg: 1.719 ± 0.694
2.865AsnSer: 2.865 ± 1.467
2.865AsnThr: 2.865 ± 0.972
6.304AsnVal: 6.304 ± 1.139
2.865AsnTrp: 2.865 ± 0.799
0.573AsnTyr: 0.573 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
4.585ProAla: 4.585 ± 0.42
0.0ProCys: 0.0 ± 0.0
8.023ProAsp: 8.023 ± 2.153
5.158ProGlu: 5.158 ± 1.798
1.146ProPhe: 1.146 ± 0.757
4.585ProGly: 4.585 ± 1.851
0.573ProHis: 0.573 ± 0.556
1.146ProIle: 1.146 ± 1.112
2.292ProLys: 2.292 ± 0.999
5.731ProLeu: 5.731 ± 1.884
1.146ProMet: 1.146 ± 0.757
1.719ProAsn: 1.719 ± 0.694
4.011ProPro: 4.011 ± 0.678
1.719ProGln: 1.719 ± 0.607
2.292ProArg: 2.292 ± 1.14
2.865ProSer: 2.865 ± 2.208
2.865ProThr: 2.865 ± 0.403
1.719ProVal: 1.719 ± 0.989
0.0ProTrp: 0.0 ± 0.0
0.573ProTyr: 0.573 ± 0.556
0.0ProXaa: 0.0 ± 0.0
Gln
2.865GlnAla: 2.865 ± 0.799
0.573GlnCys: 0.573 ± 0.379
0.0GlnAsp: 0.0 ± 0.0
3.438GlnGlu: 3.438 ± 0.608
1.146GlnPhe: 1.146 ± 0.757
4.585GlnGly: 4.585 ± 3.202
1.719GlnHis: 1.719 ± 1.378
1.719GlnIle: 1.719 ± 1.061
0.573GlnLys: 0.573 ± 0.379
2.865GlnLeu: 2.865 ± 1.467
0.573GlnMet: 0.573 ± 0.354
0.573GlnAsn: 0.573 ± 0.714
2.292GlnPro: 2.292 ± 1.14
2.865GlnGln: 2.865 ± 0.856
5.158GlnArg: 5.158 ± 1.875
1.719GlnSer: 1.719 ± 0.78
3.438GlnThr: 3.438 ± 0.98
3.438GlnVal: 3.438 ± 0.975
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.292ArgAla: 2.292 ± 1.462
0.573ArgCys: 0.573 ± 0.714
0.573ArgAsp: 0.573 ± 0.379
2.292ArgGlu: 2.292 ± 1.462
2.292ArgPhe: 2.292 ± 1.14
4.585ArgGly: 4.585 ± 0.983
2.292ArgHis: 2.292 ± 0.829
2.292ArgIle: 2.292 ± 0.7
3.438ArgLys: 3.438 ± 0.673
3.438ArgLeu: 3.438 ± 1.058
1.719ArgMet: 1.719 ± 0.781
2.292ArgAsn: 2.292 ± 0.492
3.438ArgPro: 3.438 ± 1.14
2.865ArgGln: 2.865 ± 1.577
8.023ArgArg: 8.023 ± 4.012
5.731ArgSer: 5.731 ± 2.579
2.865ArgThr: 2.865 ± 0.972
1.719ArgVal: 1.719 ± 0.78
1.146ArgTrp: 1.146 ± 0.731
1.719ArgTyr: 1.719 ± 0.989
0.0ArgXaa: 0.0 ± 0.0
Ser
4.011SerAla: 4.011 ± 1.385
5.158SerCys: 5.158 ± 1.736
1.719SerAsp: 1.719 ± 1.136
4.585SerGlu: 4.585 ± 1.878
1.146SerPhe: 1.146 ± 0.757
2.865SerGly: 2.865 ± 0.82
0.573SerHis: 0.573 ± 0.556
4.585SerIle: 4.585 ± 1.161
4.585SerLys: 4.585 ± 0.765
5.731SerLeu: 5.731 ± 1.866
1.719SerMet: 1.719 ± 0.989
1.719SerAsn: 1.719 ± 1.136
1.146SerPro: 1.146 ± 0.716
3.438SerGln: 3.438 ± 1.358
1.146SerArg: 1.146 ± 0.757
4.585SerSer: 4.585 ± 1.939
5.158SerThr: 5.158 ± 1.652
2.865SerVal: 2.865 ± 1.405
0.0SerTrp: 0.0 ± 0.0
0.573SerTyr: 0.573 ± 0.714
0.0SerXaa: 0.0 ± 0.0
Thr
6.304ThrAla: 6.304 ± 2.555
1.146ThrCys: 1.146 ± 0.716
1.719ThrAsp: 1.719 ± 0.694
4.011ThrGlu: 4.011 ± 1.36
2.865ThrPhe: 2.865 ± 1.07
8.596ThrGly: 8.596 ± 2.832
0.0ThrHis: 0.0 ± 0.0
2.292ThrIle: 2.292 ± 1.038
0.573ThrLys: 0.573 ± 0.379
5.731ThrLeu: 5.731 ± 2.995
0.0ThrMet: 0.0 ± 0.0
7.45ThrAsn: 7.45 ± 3.505
6.304ThrPro: 6.304 ± 0.96
5.158ThrGln: 5.158 ± 1.123
4.011ThrArg: 4.011 ± 1.89
3.438ThrSer: 3.438 ± 1.058
5.158ThrThr: 5.158 ± 1.123
2.865ThrVal: 2.865 ± 1.467
2.292ThrTrp: 2.292 ± 1.553
0.573ThrTyr: 0.573 ± 0.556
0.0ThrXaa: 0.0 ± 0.0
Val
4.585ValAla: 4.585 ± 1.469
0.573ValCys: 0.573 ± 0.379
0.573ValAsp: 0.573 ± 0.379
6.304ValGlu: 6.304 ± 1.803
2.292ValPhe: 2.292 ± 0.7
3.438ValGly: 3.438 ± 1.976
0.573ValHis: 0.573 ± 0.379
2.292ValIle: 2.292 ± 0.918
4.011ValLys: 4.011 ± 1.887
6.877ValLeu: 6.877 ± 1.936
0.573ValMet: 0.573 ± 0.379
1.146ValAsn: 1.146 ± 0.757
3.438ValPro: 3.438 ± 0.673
1.719ValGln: 1.719 ± 0.607
3.438ValArg: 3.438 ± 1.14
2.865ValSer: 2.865 ± 0.77
5.158ValThr: 5.158 ± 1.2
2.292ValVal: 2.292 ± 1.14
0.573ValTrp: 0.573 ± 0.714
2.292ValTyr: 2.292 ± 1.14
0.0ValXaa: 0.0 ± 0.0
Trp
1.719TrpAla: 1.719 ± 0.78
0.0TrpCys: 0.0 ± 0.0
1.146TrpAsp: 1.146 ± 0.757
0.573TrpGlu: 0.573 ± 0.556
1.146TrpPhe: 1.146 ± 0.716
1.719TrpGly: 1.719 ± 0.982
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.146TrpLys: 1.146 ± 0.757
1.146TrpLeu: 1.146 ± 0.731
2.292TrpMet: 2.292 ± 2.14
0.573TrpAsn: 0.573 ± 0.379
0.573TrpPro: 0.573 ± 0.556
0.0TrpGln: 0.0 ± 0.0
2.292TrpArg: 2.292 ± 1.462
0.573TrpSer: 0.573 ± 0.714
0.0TrpThr: 0.0 ± 0.0
1.146TrpVal: 1.146 ± 0.731
0.573TrpTrp: 0.573 ± 0.379
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.719TyrCys: 1.719 ± 0.895
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
2.292TyrPhe: 2.292 ± 0.492
2.865TyrGly: 2.865 ± 1.576
1.146TyrHis: 1.146 ± 0.716
1.146TyrIle: 1.146 ± 0.757
3.438TyrLys: 3.438 ± 0.952
2.292TyrLeu: 2.292 ± 0.999
0.573TyrMet: 0.573 ± 0.379
2.292TyrAsn: 2.292 ± 1.462
1.146TyrPro: 1.146 ± 1.112
1.146TyrGln: 1.146 ± 0.502
1.146TyrArg: 1.146 ± 0.731
1.719TyrSer: 1.719 ± 1.104
1.719TyrThr: 1.719 ± 0.607
0.573TyrVal: 0.573 ± 0.556
0.573TyrTrp: 0.573 ± 0.379
1.719TyrTyr: 1.719 ± 0.607
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1746 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski