Amino acid dipepetide frequency for Avian metapneumovirus (isolate Canada goose/Minnesota/15a/2001) (AMPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.336AlaAla: 5.336 ± 2.439
1.556AlaCys: 1.556 ± 0.472
2.001AlaAsp: 2.001 ± 0.402
5.558AlaGlu: 5.558 ± 1.442
1.112AlaPhe: 1.112 ± 0.582
4.002AlaGly: 4.002 ± 1.419
0.445AlaHis: 0.445 ± 0.203
4.224AlaIle: 4.224 ± 1.123
4.891AlaLys: 4.891 ± 0.987
5.336AlaLeu: 5.336 ± 1.659
1.556AlaMet: 1.556 ± 0.522
2.223AlaAsn: 2.223 ± 0.81
2.89AlaPro: 2.89 ± 1.102
1.556AlaGln: 1.556 ± 0.608
3.335AlaArg: 3.335 ± 1.432
3.112AlaSer: 3.112 ± 1.643
4.891AlaThr: 4.891 ± 1.392
4.891AlaVal: 4.891 ± 1.443
0.222AlaTrp: 0.222 ± 0.259
1.112AlaTyr: 1.112 ± 0.525
0.0AlaXaa: 0.0 ± 0.0
Cys
0.222CysAla: 0.222 ± 0.147
0.445CysCys: 0.445 ± 0.203
1.556CysAsp: 1.556 ± 0.527
1.112CysGlu: 1.112 ± 0.355
0.445CysPhe: 0.445 ± 0.359
1.112CysGly: 1.112 ± 0.52
0.445CysHis: 0.445 ± 0.293
1.112CysIle: 1.112 ± 0.356
2.446CysLys: 2.446 ± 0.681
0.445CysLeu: 0.445 ± 0.22
0.222CysMet: 0.222 ± 0.147
1.112CysAsn: 1.112 ± 0.293
0.222CysPro: 0.222 ± 0.287
1.334CysGln: 1.334 ± 0.444
1.112CysArg: 1.112 ± 0.596
1.334CysSer: 1.334 ± 1.067
1.556CysThr: 1.556 ± 0.517
0.445CysVal: 0.445 ± 0.293
0.667CysTrp: 0.667 ± 0.346
1.112CysTyr: 1.112 ± 0.485
0.0CysXaa: 0.0 ± 0.0
Asp
2.668AspAla: 2.668 ± 0.56
0.667AspCys: 0.667 ± 0.472
2.668AspAsp: 2.668 ± 1.221
1.779AspGlu: 1.779 ± 1.043
2.223AspPhe: 2.223 ± 0.726
1.779AspGly: 1.779 ± 0.785
0.889AspHis: 0.889 ± 0.359
3.779AspIle: 3.779 ± 1.047
3.112AspLys: 3.112 ± 0.785
4.669AspLeu: 4.669 ± 0.967
1.112AspMet: 1.112 ± 0.501
4.002AspAsn: 4.002 ± 0.6
1.334AspPro: 1.334 ± 0.393
2.668AspGln: 2.668 ± 1.213
2.001AspArg: 2.001 ± 0.854
2.446AspSer: 2.446 ± 0.785
2.668AspThr: 2.668 ± 0.89
2.446AspVal: 2.446 ± 0.607
0.445AspTrp: 0.445 ± 0.293
1.112AspTyr: 1.112 ± 0.53
0.0AspXaa: 0.0 ± 0.0
Glu
5.336GluAla: 5.336 ± 1.892
0.889GluCys: 0.889 ± 0.355
2.668GluAsp: 2.668 ± 1.212
7.781GluGlu: 7.781 ± 3.643
2.001GluPhe: 2.001 ± 0.815
4.224GluGly: 4.224 ± 1.093
1.556GluHis: 1.556 ± 0.6
3.112GluIle: 3.112 ± 1.159
5.113GluLys: 5.113 ± 1.477
7.559GluLeu: 7.559 ± 0.491
1.556GluMet: 1.556 ± 0.709
3.557GluAsn: 3.557 ± 0.971
1.334GluPro: 1.334 ± 0.536
2.668GluGln: 2.668 ± 0.336
2.223GluArg: 2.223 ± 1.003
7.114GluSer: 7.114 ± 1.768
4.669GluThr: 4.669 ± 1.648
4.891GluVal: 4.891 ± 1.321
0.445GluTrp: 0.445 ± 0.293
1.112GluTyr: 1.112 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
1.556PheAla: 1.556 ± 0.463
0.667PheCys: 0.667 ± 0.259
1.556PheAsp: 1.556 ± 0.552
1.779PheGlu: 1.779 ± 0.26
1.556PhePhe: 1.556 ± 0.757
2.001PheGly: 2.001 ± 0.692
0.889PheHis: 0.889 ± 0.406
2.001PheIle: 2.001 ± 0.422
2.668PheLys: 2.668 ± 0.78
2.223PheLeu: 2.223 ± 0.905
0.889PheMet: 0.889 ± 0.37
1.334PheAsn: 1.334 ± 0.431
2.223PhePro: 2.223 ± 0.43
1.112PheGln: 1.112 ± 0.418
1.334PheArg: 1.334 ± 0.88
2.223PheSer: 2.223 ± 0.903
0.889PheThr: 0.889 ± 0.279
3.112PheVal: 3.112 ± 0.694
0.667PheTrp: 0.667 ± 0.44
0.667PheTyr: 0.667 ± 0.275
0.0PheXaa: 0.0 ± 0.0
Gly
4.224GlyAla: 4.224 ± 1.413
0.667GlyCys: 0.667 ± 0.259
2.446GlyAsp: 2.446 ± 0.748
3.557GlyGlu: 3.557 ± 0.586
1.779GlyPhe: 1.779 ± 0.71
2.223GlyGly: 2.223 ± 0.735
1.112GlyHis: 1.112 ± 0.382
3.779GlyIle: 3.779 ± 1.028
5.113GlyLys: 5.113 ± 0.577
5.78GlyLeu: 5.78 ± 1.12
0.889GlyMet: 0.889 ± 0.715
2.668GlyAsn: 2.668 ± 0.44
2.446GlyPro: 2.446 ± 0.895
1.112GlyGln: 1.112 ± 0.736
2.001GlyArg: 2.001 ± 0.384
5.558GlySer: 5.558 ± 1.053
2.668GlyThr: 2.668 ± 0.672
4.669GlyVal: 4.669 ± 1.861
0.889GlyTrp: 0.889 ± 0.439
0.889GlyTyr: 0.889 ± 0.246
0.0GlyXaa: 0.0 ± 0.0
His
2.001HisAla: 2.001 ± 0.662
0.0HisCys: 0.0 ± 0.0
0.445HisAsp: 0.445 ± 0.245
0.222HisGlu: 0.222 ± 0.147
0.445HisPhe: 0.445 ± 0.293
1.334HisGly: 1.334 ± 0.584
0.667HisHis: 0.667 ± 0.313
0.889HisIle: 0.889 ± 0.587
0.445HisLys: 0.445 ± 0.293
0.667HisLeu: 0.667 ± 0.275
0.445HisMet: 0.445 ± 0.293
0.889HisAsn: 0.889 ± 0.578
1.112HisPro: 1.112 ± 0.313
0.445HisGln: 0.445 ± 0.339
0.667HisArg: 0.667 ± 0.341
0.222HisSer: 0.222 ± 0.147
1.779HisThr: 1.779 ± 1.132
1.112HisVal: 1.112 ± 0.371
0.445HisTrp: 0.445 ± 0.293
0.445HisTyr: 0.445 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
2.89IleAla: 2.89 ± 1.005
2.001IleCys: 2.001 ± 0.869
3.335IleAsp: 3.335 ± 0.758
2.89IleGlu: 2.89 ± 0.638
2.446IlePhe: 2.446 ± 1.032
2.89IleGly: 2.89 ± 0.599
0.889IleHis: 0.889 ± 0.261
3.335IleIle: 3.335 ± 1.278
4.891IleLys: 4.891 ± 0.579
6.225IleLeu: 6.225 ± 1.06
2.223IleMet: 2.223 ± 0.462
2.89IleAsn: 2.89 ± 0.703
2.001IlePro: 2.001 ± 0.506
1.112IleGln: 1.112 ± 0.385
2.89IleArg: 2.89 ± 0.586
5.336IleSer: 5.336 ± 1.36
4.669IleThr: 4.669 ± 1.352
3.112IleVal: 3.112 ± 0.984
0.222IleTrp: 0.222 ± 0.147
1.112IleTyr: 1.112 ± 0.605
0.0IleXaa: 0.0 ± 0.0
Lys
6.003LysAla: 6.003 ± 0.689
1.112LysCys: 1.112 ± 1.048
3.779LysAsp: 3.779 ± 0.786
6.225LysGlu: 6.225 ± 0.774
2.223LysPhe: 2.223 ± 0.726
4.224LysGly: 4.224 ± 1.451
1.334LysHis: 1.334 ± 0.565
4.669LysIle: 4.669 ± 1.165
5.336LysLys: 5.336 ± 1.059
7.114LysLeu: 7.114 ± 2.156
2.001LysMet: 2.001 ± 0.675
4.891LysAsn: 4.891 ± 0.7
2.446LysPro: 2.446 ± 0.863
2.668LysGln: 2.668 ± 0.734
2.668LysArg: 2.668 ± 0.508
5.336LysSer: 5.336 ± 0.89
8.004LysThr: 8.004 ± 2.61
5.336LysVal: 5.336 ± 0.755
0.445LysTrp: 0.445 ± 0.263
1.112LysTyr: 1.112 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
3.779LeuAla: 3.779 ± 1.077
2.668LeuCys: 2.668 ± 0.548
3.557LeuAsp: 3.557 ± 0.872
8.226LeuGlu: 8.226 ± 1.383
2.446LeuPhe: 2.446 ± 0.911
6.892LeuGly: 6.892 ± 1.987
1.112LeuHis: 1.112 ± 0.399
6.225LeuIle: 6.225 ± 1.124
8.893LeuLys: 8.893 ± 1.283
8.893LeuLeu: 8.893 ± 1.282
1.556LeuMet: 1.556 ± 0.632
4.669LeuAsn: 4.669 ± 1.208
2.89LeuPro: 2.89 ± 0.689
2.668LeuGln: 2.668 ± 0.911
5.78LeuArg: 5.78 ± 1.316
9.337LeuSer: 9.337 ± 1.918
7.114LeuThr: 7.114 ± 0.832
4.669LeuVal: 4.669 ± 0.865
1.112LeuTrp: 1.112 ± 0.54
2.89LeuTyr: 2.89 ± 0.874
0.0LeuXaa: 0.0 ± 0.0
Met
0.889MetAla: 0.889 ± 0.279
0.0MetCys: 0.0 ± 0.0
0.667MetAsp: 0.667 ± 0.44
2.446MetGlu: 2.446 ± 0.55
0.889MetPhe: 0.889 ± 0.587
0.889MetGly: 0.889 ± 0.427
0.667MetHis: 0.667 ± 0.44
1.556MetIle: 1.556 ± 0.642
0.889MetLys: 0.889 ± 0.738
3.335MetLeu: 3.335 ± 0.799
0.667MetMet: 0.667 ± 0.462
1.779MetAsn: 1.779 ± 0.411
0.667MetPro: 0.667 ± 0.259
1.556MetGln: 1.556 ± 0.573
0.667MetArg: 0.667 ± 0.44
2.89MetSer: 2.89 ± 0.557
0.889MetThr: 0.889 ± 0.539
2.668MetVal: 2.668 ± 0.874
0.0MetTrp: 0.0 ± 0.0
0.889MetTyr: 0.889 ± 0.261
0.0MetXaa: 0.0 ± 0.0
Asn
3.335AsnAla: 3.335 ± 0.81
0.889AsnCys: 0.889 ± 0.591
2.001AsnAsp: 2.001 ± 0.497
3.779AsnGlu: 3.779 ± 0.743
2.001AsnPhe: 2.001 ± 0.647
2.446AsnGly: 2.446 ± 0.549
0.667AsnHis: 0.667 ± 0.737
3.557AsnIle: 3.557 ± 1.157
4.446AsnLys: 4.446 ± 0.824
5.336AsnLeu: 5.336 ± 1.755
1.334AsnMet: 1.334 ± 0.618
3.112AsnAsn: 3.112 ± 0.738
2.001AsnPro: 2.001 ± 0.651
2.446AsnGln: 2.446 ± 0.635
3.112AsnArg: 3.112 ± 0.637
3.112AsnSer: 3.112 ± 0.886
4.002AsnThr: 4.002 ± 1.923
2.446AsnVal: 2.446 ± 0.581
1.112AsnTrp: 1.112 ± 0.341
2.223AsnTyr: 2.223 ± 1.058
0.0AsnXaa: 0.0 ± 0.0
Pro
3.335ProAla: 3.335 ± 1.019
1.556ProCys: 1.556 ± 0.727
1.779ProAsp: 1.779 ± 0.551
2.668ProGlu: 2.668 ± 0.92
0.445ProPhe: 0.445 ± 0.293
0.445ProGly: 0.445 ± 0.203
0.222ProHis: 0.222 ± 0.147
2.001ProIle: 2.001 ± 0.922
4.224ProLys: 4.224 ± 0.563
2.446ProLeu: 2.446 ± 0.716
1.779ProMet: 1.779 ± 0.448
1.779ProAsn: 1.779 ± 0.486
2.446ProPro: 2.446 ± 0.63
0.889ProGln: 0.889 ± 0.261
1.334ProArg: 1.334 ± 0.382
3.112ProSer: 3.112 ± 0.635
5.558ProThr: 5.558 ± 2.623
3.112ProVal: 3.112 ± 0.921
0.889ProTrp: 0.889 ± 0.587
0.889ProTyr: 0.889 ± 0.759
0.0ProXaa: 0.0 ± 0.0
Gln
2.668GlnAla: 2.668 ± 1.109
0.222GlnCys: 0.222 ± 0.147
1.556GlnAsp: 1.556 ± 0.506
1.779GlnGlu: 1.779 ± 0.627
1.556GlnPhe: 1.556 ± 0.609
2.223GlnGly: 2.223 ± 0.695
0.889GlnHis: 0.889 ± 0.246
1.112GlnIle: 1.112 ± 0.261
1.556GlnLys: 1.556 ± 0.601
6.003GlnLeu: 6.003 ± 0.845
0.445GlnMet: 0.445 ± 0.24
2.001GlnAsn: 2.001 ± 0.675
1.112GlnPro: 1.112 ± 0.63
0.667GlnGln: 0.667 ± 0.272
1.779GlnArg: 1.779 ± 0.611
2.89GlnSer: 2.89 ± 0.61
3.112GlnThr: 3.112 ± 1.514
1.556GlnVal: 1.556 ± 0.385
0.0GlnTrp: 0.0 ± 0.0
0.889GlnTyr: 0.889 ± 0.572
0.0GlnXaa: 0.0 ± 0.0
Arg
2.223ArgAla: 2.223 ± 0.426
0.445ArgCys: 0.445 ± 0.263
2.668ArgAsp: 2.668 ± 0.815
3.112ArgGlu: 3.112 ± 0.488
2.223ArgPhe: 2.223 ± 0.71
2.668ArgGly: 2.668 ± 0.64
0.667ArgHis: 0.667 ± 0.256
3.335ArgIle: 3.335 ± 0.76
2.89ArgLys: 2.89 ± 0.737
3.335ArgLeu: 3.335 ± 0.974
0.667ArgMet: 0.667 ± 0.312
3.779ArgAsn: 3.779 ± 0.83
1.556ArgPro: 1.556 ± 0.709
2.89ArgGln: 2.89 ± 0.658
3.557ArgArg: 3.557 ± 0.616
4.669ArgSer: 4.669 ± 0.879
4.446ArgThr: 4.446 ± 0.727
4.002ArgVal: 4.002 ± 1.336
1.112ArgTrp: 1.112 ± 0.324
1.112ArgTyr: 1.112 ± 0.46
0.0ArgXaa: 0.0 ± 0.0
Ser
4.002SerAla: 4.002 ± 1.364
1.334SerCys: 1.334 ± 0.482
3.112SerAsp: 3.112 ± 0.785
4.224SerGlu: 4.224 ± 1.08
2.223SerPhe: 2.223 ± 0.274
4.002SerGly: 4.002 ± 1.036
0.889SerHis: 0.889 ± 0.32
4.224SerIle: 4.224 ± 1.196
7.781SerLys: 7.781 ± 0.803
8.226SerLeu: 8.226 ± 1.979
1.779SerMet: 1.779 ± 0.477
3.557SerAsn: 3.557 ± 1.221
4.002SerPro: 4.002 ± 1.079
1.779SerGln: 1.779 ± 0.411
4.669SerArg: 4.669 ± 1.146
7.559SerSer: 7.559 ± 1.404
7.337SerThr: 7.337 ± 1.976
6.225SerVal: 6.225 ± 1.529
1.112SerTrp: 1.112 ± 0.371
3.779SerTyr: 3.779 ± 0.678
0.0SerXaa: 0.0 ± 0.0
Thr
5.336ThrAla: 5.336 ± 2.745
1.112ThrCys: 1.112 ± 0.312
4.002ThrAsp: 4.002 ± 0.8
5.336ThrGlu: 5.336 ± 1.852
1.779ThrPhe: 1.779 ± 0.878
3.112ThrGly: 3.112 ± 0.927
0.445ThrHis: 0.445 ± 0.245
3.779ThrIle: 3.779 ± 0.819
6.447ThrLys: 6.447 ± 2.478
6.67ThrLeu: 6.67 ± 1.49
1.779ThrMet: 1.779 ± 0.697
3.779ThrAsn: 3.779 ± 0.933
6.003ThrPro: 6.003 ± 2.643
2.668ThrGln: 2.668 ± 0.536
5.336ThrArg: 5.336 ± 1.823
5.113ThrSer: 5.113 ± 1.449
14.229ThrThr: 14.229 ± 8.993
4.669ThrVal: 4.669 ± 1.627
0.667ThrTrp: 0.667 ± 0.285
1.779ThrTyr: 1.779 ± 0.682
0.0ThrXaa: 0.0 ± 0.0
Val
2.89ValAla: 2.89 ± 1.51
1.112ValCys: 1.112 ± 0.527
2.668ValAsp: 2.668 ± 0.883
5.558ValGlu: 5.558 ± 0.674
2.223ValPhe: 2.223 ± 0.718
4.891ValGly: 4.891 ± 0.784
0.222ValHis: 0.222 ± 0.421
2.223ValIle: 2.223 ± 0.61
3.779ValLys: 3.779 ± 1.119
6.892ValLeu: 6.892 ± 1.208
2.223ValMet: 2.223 ± 0.965
3.557ValAsn: 3.557 ± 0.796
2.223ValPro: 2.223 ± 0.814
3.112ValGln: 3.112 ± 1.022
3.779ValArg: 3.779 ± 1.157
7.114ValSer: 7.114 ± 1.415
3.779ValThr: 3.779 ± 0.713
4.224ValVal: 4.224 ± 1.29
0.667ValTrp: 0.667 ± 0.493
3.557ValTyr: 3.557 ± 0.785
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.445TrpCys: 0.445 ± 0.293
0.667TrpAsp: 0.667 ± 0.313
0.0TrpGlu: 0.0 ± 0.0
0.667TrpPhe: 0.667 ± 0.299
0.889TrpGly: 0.889 ± 0.321
0.0TrpHis: 0.0 ± 0.0
0.889TrpIle: 0.889 ± 0.406
0.889TrpLys: 0.889 ± 0.485
1.112TrpLeu: 1.112 ± 0.733
0.667TrpMet: 0.667 ± 0.44
0.222TrpAsn: 0.222 ± 0.147
0.445TrpPro: 0.445 ± 0.274
0.222TrpGln: 0.222 ± 0.147
0.889TrpArg: 0.889 ± 0.308
1.112TrpSer: 1.112 ± 0.531
0.445TrpThr: 0.445 ± 0.407
1.112TrpVal: 1.112 ± 0.491
0.0TrpTrp: 0.0 ± 0.0
0.667TrpTyr: 0.667 ± 0.469
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.556TyrAla: 1.556 ± 0.922
0.667TyrCys: 0.667 ± 0.493
1.334TyrAsp: 1.334 ± 0.491
2.001TyrGlu: 2.001 ± 0.68
0.667TyrPhe: 0.667 ± 0.44
2.223TyrGly: 2.223 ± 0.642
0.667TyrHis: 0.667 ± 0.44
1.556TyrIle: 1.556 ± 0.457
1.112TyrLys: 1.112 ± 0.491
3.112TyrLeu: 3.112 ± 0.811
0.889TyrMet: 0.889 ± 0.298
1.556TyrAsn: 1.556 ± 0.367
1.556TyrPro: 1.556 ± 0.609
0.445TyrGln: 0.445 ± 0.418
2.223TyrArg: 2.223 ± 0.776
2.223TyrSer: 2.223 ± 0.516
1.334TyrThr: 1.334 ± 0.681
2.001TyrVal: 2.001 ± 0.696
0.222TyrTrp: 0.222 ± 0.298
1.112TyrTyr: 1.112 ± 0.261
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4499 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski