Forum | Documentation | Website | Blog

Skip to content
Snippets Groups Projects
  1. Dec 28, 2021
  2. Dec 27, 2021
  3. Dec 22, 2021
  4. Dec 17, 2021
    • Alex Deucher's avatar
      drm/amdgpu: add support for IP discovery gc_info table v2 · 5e713c6a
      Alex Deucher authored
      Used on gfx9 based systems. Fixes incorrect CU counts reported
      in the kernel log.
      
      Bug: https://gitlab.freedesktop.org/drm/amd/-/issues/1833
      
      
      Reviewed-by: default avatarHawking Zhang <Hawking.Zhang@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      Cc: stable@vger.kernel.org
      5e713c6a
    • chen gong's avatar
      drm/amdgpu: When the VCN(1.0) block is suspended, powergating is explicitly enabled · b7865173
      chen gong authored
      Play a video on the raven (or PCO, raven2) platform, and then do the S3
      test. When resume, the following error will be reported:
      
      amdgpu 0000:02:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring
      vcn_dec test failed (-110)
      [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block
      <vcn_v1_0> failed -110
      amdgpu 0000:02:00.0: amdgpu: amdgpu_device_ip_resume failed (-110).
      PM: dpm_run_callback(): pci_pm_resume+0x0/0x90 returns -110
      
      [why]
      When playing the video: The power state flag of the vcn block is set to
      POWER_STATE_ON.
      
      When doing suspend: There is no change to the power state flag of the
      vcn block, it is still POWER_STATE_ON.
      
      When doing resume: Need to open the power gate of the vcn block and set
      the power state flag of the VCN block to POWER_STATE_ON.
      But at this time, the power state flag of the vcn block is already
      POWER_STATE_ON. The power status flag check in the "8f2cdef drm/amd/pm:
      avoid duplicate powergate/ungate setting" patch will return the
      amdgpu_dpm_set_powergating_by_smu function directly.
      As a result, the gate of the power was not opened, causing the
      subsequent ring test to fail.
      
      [how]
      In the suspend function of the vcn block, explicitly change the power
      state flag of the vcn block to POWER_STATE_OFF.
      
      BugLink: https://gitlab.freedesktop.org/drm/amd/-/issues/1828
      
      
      Signed-off-by: default avatarchen gong <curry.gong@amd.com>
      Reviewed-by: default avatarEvan Quan <evan.quan@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      Cc: stable@vger.kernel.org
      b7865173
    • Lijo Lazar's avatar
      drm/amd/pm: Fix xgmi link control on aldebaran · 19e66d51
      Lijo Lazar authored
      
      Fix the message argument.
      	0: Allow power down
      	1: Disallow power down
      
      Signed-off-by: default avatarLijo Lazar <lijo.lazar@amd.com>
      Reviewed-by: default avatarHawking Zhang <Hawking.Zhang@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      19e66d51
    • Huang Rui's avatar
      drm/amdgpu: introduce new amdgpu_fence object to indicate the job embedded fence · bf67014d
      Huang Rui authored
      
      The job embedded fence donesn't initialize the flags at
      dma_fence_init(). Then we will go a wrong way in
      amdgpu_fence_get_timeline_name callback and trigger a null pointer panic
      once we enabled the trace event here. So introduce new amdgpu_fence
      object to indicate the job embedded fence.
      
      [  156.131790] BUG: kernel NULL pointer dereference, address: 00000000000002a0
      [  156.131804] #PF: supervisor read access in kernel mode
      [  156.131811] #PF: error_code(0x0000) - not-present page
      [  156.131817] PGD 0 P4D 0
      [  156.131824] Oops: 0000 [#1] PREEMPT SMP PTI
      [  156.131832] CPU: 6 PID: 1404 Comm: sdma0 Tainted: G           OE     5.16.0-rc1-custom #1
      [  156.131842] Hardware name: Gigabyte Technology Co., Ltd. Z170XP-SLI/Z170XP-SLI-CF, BIOS F20 11/04/2016
      [  156.131848] RIP: 0010:strlen+0x0/0x20
      [  156.131859] Code: 89 c0 c3 0f 1f 80 00 00 00 00 48 01 fe eb 0f 0f b6 07 38 d0 74 10 48 83 c7 01 84 c0 74 05 48 39 f7 75 ec 31 c0 c3 48 89 f8 c3 <80> 3f 00 74 10 48 89 f8 48 83 c0 01 80 38 00 75 f7 48 29 f8 c3 31
      [  156.131872] RSP: 0018:ffff9bd0018dbcf8 EFLAGS: 00010206
      [  156.131880] RAX: 00000000000002a0 RBX: ffff8d0305ef01b0 RCX: 000000000000000b
      [  156.131888] RDX: ffff8d03772ab924 RSI: ffff8d0305ef01b0 RDI: 00000000000002a0
      [  156.131895] RBP: ffff9bd0018dbd60 R08: ffff8d03002094d0 R09: 0000000000000000
      [  156.131901] R10: 000000000000005e R11: 0000000000000065 R12: ffff8d03002094d0
      [  156.131907] R13: 000000000000001f R14: 0000000000070018 R15: 0000000000000007
      [  156.131914] FS:  0000000000000000(0000) GS:ffff8d062ed80000(0000) knlGS:0000000000000000
      [  156.131923] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [  156.131929] CR2: 00000000000002a0 CR3: 000000001120a005 CR4: 00000000003706e0
      [  156.131937] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      [  156.131942] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
      [  156.131949] Call Trace:
      [  156.131953]  <TASK>
      [  156.131957]  ? trace_event_raw_event_dma_fence+0xcc/0x200
      [  156.131973]  ? ring_buffer_unlock_commit+0x23/0x130
      [  156.131982]  dma_fence_init+0x92/0xb0
      [  156.131993]  amdgpu_fence_emit+0x10d/0x2b0 [amdgpu]
      [  156.132302]  amdgpu_ib_schedule+0x2f9/0x580 [amdgpu]
      [  156.132586]  amdgpu_job_run+0xed/0x220 [amdgpu]
      
      v2: fix mismatch warning between the prototype and function name (Ray, kernel test robot)
      
      Signed-off-by: default avatarHuang Rui <ray.huang@amd.com>
      Reviewed-by: default avatarChristian König <christian.koenig@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      bf67014d
    • Dave Airlie's avatar
      Merge tag 'amd-drm-fixes-5.16-2021-12-15' of... · a2fbfd51
      Dave Airlie authored
      Merge tag 'amd-drm-fixes-5.16-2021-12-15' of https://gitlab.freedesktop.org/agd5f/linux
      
       into drm-fixes
      
      amd-drm-fixes-5.16-2021-12-15:
      
      amdgpu:
      - Fix RLC register offset
      - GMC fix
      - Properly cache SMU FW version on Yellow Carp
      - Fix missing callback on DCN3.1
      - Reset DMCUB before HW init
      - Fix for GMC powergating on PCO
      - Fix a possible memory leak in GPU metrics table handling on RN
      
      Signed-off-by: default avatarDave Airlie <airlied@redhat.com>
      From: Alex Deucher <alexander.deucher@amd.com>
      Link: https://patchwork.freedesktop.org/patch/msgid/20211216035239.5787-1-alexander.deucher@amd.com
      a2fbfd51
  5. Dec 16, 2021
  6. Dec 15, 2021
  7. Dec 14, 2021
  8. Dec 13, 2021
  9. Dec 12, 2021
  10. Dec 11, 2021
    • Linus Torvalds's avatar
      Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi · a763d5a5
      Linus Torvalds authored
      Pull SCSI fixes from James Bottomley:
       "Four fixes, all in drivers.
      
        Three are small and obvious, the qedi one is a bit larger but also
        pretty obvious"
      
      * tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi:
        scsi: qla2xxx: Format log strings only if needed
        scsi: scsi_debug: Fix buffer size of REPORT ZONES command
        scsi: qedi: Fix cmd_cleanup_cmpl counter mismatch issue
        scsi: pm80xx: Do not call scsi_remove_host() in pm8001_alloc()
      a763d5a5
    • Linus Torvalds's avatar
      Merge tag 'xfs-5.16-fixes-3' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux · e034d9cb
      Linus Torvalds authored
      Pull xfs fix from Darrick Wong:
       "This fixes a race between a readonly remount process and other
        processes that hold a file IOLOCK on files that previously experienced
        copy on write, that could result in severe filesystem corruption if
        the filesystem is then remounted rw.
      
        I think this is fairly rare (since the only reliable reproducer I have
        that fits the second criteria is the experimental xfs_scrub program),
        but the race is clear, so we still need to fix this.
      
        Summary:
      
         - Fix a data corruption vector that can result from the ro remount
           process failing to clear all speculative preallocations from files
           and the rw remount process not noticing the incomplete cleanup"
      
      * tag 'xfs-5.16-fixes-3' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux:
        xfs: remove all COW fork extents when remounting readonly
      e034d9cb
    • Linus Torvalds's avatar
      Merge branch 'for-5.16-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/dennis/percpu · 8f97a35a
      Linus Torvalds authored
      Pull percpu fixes from Dennis Zhou:
       "This contains a fix for SMP && !MMU archs for percpu which has been
        tested by arm and sh. It seems in the past they have gotten away with
        it due to mapping of vm functions to km functions, but this fell apart
        a few releases ago and was just reported recently.
      
        The other is just a minor dependency clean up.
      
        I think queued up right now by Andrew is a fix in percpu that papers
        of what seems to be a bug in hotplug for a special situation with
        memoryless nodes. Michal Hocko is digging into it further"
      
      * 'for-5.16-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/dennis/percpu:
        percpu_ref: Replace kernel.h with the necessary inclusions
        percpu: km: ensure it is used with NOMMU (either UP or SMP)
      8f97a35a
    • Linus Torvalds's avatar
      Merge tag 'perf-tools-fixes-for-v5.16-2021-12-11' of... · bbdff6d5
      Linus Torvalds authored
      Merge tag 'perf-tools-fixes-for-v5.16-2021-12-11' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux
      
      Pull perf tools fixes from Arnaldo Carvalho de Melo:
      
       - Prevent out-of-bounds access to per sample registers.
      
       - Fix NULL vs IS_ERR_OR_NULL() checking on the python binding.
      
       - Intel PT fixes, half of those are one-liners:
            - Fix some PGE (packet generation enable/control flow packets) usage.
            - Fix sync state when a PSB (synchronization) packet is found.
            - Fix intel_pt_fup_event() assumptions about setting state type.
            - Fix state setting when receiving overflow (OVF) packet.
            - Fix next 'err' value, walking trace.
            - Fix missing 'instruction' events with 'q' option.
            - Fix error timestamp setting on the decoder error path.
      
      * tag 'perf-tools-fixes-for-v5.16-2021-12-11' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux:
        perf python: Fix NULL vs IS_ERR_OR_NULL() checking
        perf intel-pt: Fix error timestamp setting on the decoder error path
        perf intel-pt: Fix missing 'instruction' events with 'q' option
        perf intel-pt: Fix next 'err' value, walking trace
        perf intel-pt: Fix state setting when receiving overflow (OVF) packet
        perf intel-pt: Fix intel_pt_fup_event() assumptions about setting state type
        perf intel-pt: Fix sync state when a PSB (synchronization) packet is found
        perf intel-pt: Fix some PGE (packet generation enable/control flow packets) usage
        perf tools: Prevent out-of-bounds access to registers
      bbdff6d5